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• 
Variable Length Sequence Decoder 
• 
Compatible with the 82750PA 


Intel's 82750PB is a 25 MHz wide instruction processor that generates and manipulates pixels. When paired 
with its companion chip, the 827500B, and used to implement a OVI@Technology video subsystem, the 
82750PB provides real time (30 images/sec) pixel processing, real time video compression, interactive motion 
video playback and real time video effects. 


Real time pixel manipulations, including 30 images/sec video compression, are supported by the 25 MHz 
instruction rate. On-chip instruction RAM provides programmability for execution of a wide range of algorithms 
that support motion video decompression, text, and 20 and 3D graphics. Inner loops are optimized with the 
integration of sixteen 16-bit quad ported registers, on-chip DRAM, and two loop counters that provide zero 
delay two-way branching "free" in any instruction. Two, 16-bit internal buses enable two parallel register 
transfers on each 82750PB instruction, contributing to the real time performance of the video processing. 
Another feature that adds to the processing power of the 82750PB is the 16-bit ALU, which includes an 8-bit 
dual-add-with-saturate operation critical for pixel arithmetic. Other specialized features for pixel processing 
include a 20 pixel interpolator for image processing functions and a variable length sequence decoder for 
decoding compressed data. 


The 82750PB is implemented using Intel's low-power CHMOS IV Technology and is packaged in a 132-lead 
space-saving, plastic quad flat pack (PQFP) package. 
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33 
Vcc 
35 
Vcc 
45 
Vcc 
51 
Vcc 
65 
Vcc 
67 
Vcc 
75 
Vcc 
82 
Vcc 
91 
Vcc 
98 
..- 


Pin 
Location 
Name 


Vcc 
100 
Vcc 
104 
Vcc 
109 
Vcc 
116 
Vcc 
123 
Vcc 
127 
Vcc 
132 
Vss 
1 
Vss 
32 
Vss 
34 
Vss 
39 
Vss 
48 
Vss 
57 
Vss 
66 
Vss 
68 
Vss 
76 
Vss 
89 
Vss 
94 
Vss 
99 
Vss 
101 
Vss 
108 
Vss 
115 
Vss 
117 
Vss 
124 
Vss 
131 
Vss 
10 
Vss 
17 
Vss 
21 
Vss 
25 
Vss 
29 
Vss 
46 
Vss 
64 
WE# 
49 


I 


Location 
Pin 
Name 


1 
VSS 
2 
VCC 
3 
021 
4 
020 
5 
019 
6 
018 
7 
017 
8 
016 
9 
015 
10 
VSS 
11 
014 
12 
013 
13 
012 
14 
011 
15 
010 
16 
09 
17 
VSS 
18 
08 
19 
07 
20 
06 
21 
VSS 
22 
05 
23 
04 
24 
03 
25 
VSS 
26 
02 
27 
01 
28 
00 
29 
VSS 
30 
HINT# 
31 
HALT# 
32 
VSS 
33 
VCC 


I 


Table 1·2. Pin Cross Reference 
by Location 


Location 
Pin 
Name 


34 
VSS 
35 
Vcc 
36 
HBUSEN# 
37 
TRNFR# 
38 
HROY# 
39 
VSS 
40 
HREG# 
41 
BE3# 
42 
BEU 
43 
BE1# 
44 
BEO# 
45 
VCC 
46 
VSS 
47 
CLKIN 
48 
VSS 
49 
WE# 
50 
VBUS[3] 
51 
VCC 
52 
VBUS[2] 
53 
VBUS[1] 
54 
VBUS[O] 
55 
HALEN# 
56 
HREQ# 
57 
VSS 
58 
HRAM# 
59 
MREQ# 
60 
MROY# 
61 
NXTFST# 
62 
RFSH# 
63 
RESET# 
64 
Vss 
65 
Vcc 
66 
VSS 


Location 
Pin 
Name 


67 
VCC 
68 
VSS 
69 
TEST# 
70 
PMFRZ# 
71 
A2 
72 
A3 
73 
A4 
74 
A5 
75 
VCC 
76 
VSS 
77 
A6 
78 
A7 
79 
A8 
80 
A9 
81 
A10 
82 
VCC 
83 
A11 
84 
A12 
85 
A13 
86 
A14 
87 
A15 
88 
A16 
89 
VSS 
90 
A17 
91 
Vcc 
92 
A18 
93 
A19 
94 
VSS 
95 
A20 
96 
A21 
97 
A22 
98 
VCC 
99 
Vss 


82750PB 


Location 
Pin 
Name 


100 
VCC 
101 
Vss 
102 
A23 
103 
A24 
104 
Vcc 
105 
A25 
106 
A26 
107 
A27 
108 
Vss 
109 
VCC 
110 
A28 
111 
A29 
112 
A30 
113 
A31 
114 
CLKO UT 
115 
VSS 
116 
VCC 
117 
VSS 
118 
031 
119 
030 
120 
029 
121 
028 
122 
027 
123 
VCC 
124 
VSS 
125 
026 
126 
025 
127 
VCC 
128 
024 
129 
023 
130 
022 
131 
VSS 
132 
VCC 


• 
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A(31:t).~ 


ADORES! 
IUS 
-- 
COON 


MllEO# 
RES!T# 
r--'~ 
TRHFR# 
CLKOUT 
I 
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• 
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- 
-- 
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HRDY# 
IC DATAIIUS 
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..• 
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HINT# 
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~ 
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Figure 1-2. 82750PB Functional Signal Groupings 
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HREG,# 
I 
The HOST REGISTER 
and HOST RAM signals, when validated 
by HREQ #, 
HRAM# 
are used to define three host access cycles. HRAM # active indicates 
the host 
is requesting 
a VRAM read or write cycle. HREG # active indicates 
that the 
host is requesting 
a 82750PB 
register read or register write cycle. When both 
signals are inactive, a host external 
cycle is requested. 


HBUSEN# 
0 
HOST BUS ENABLE is asserted 
by the 82750PB 
at the start of a host access 
to indicate that the 82750PB 
Address 
and Data buses (A[31 :2], BE # [3:0], and 
D[31 :0]) have been tri-stated. 
This allows the host to drive the same buses 
either for accessing 
shared VRAM or the 82750PB 
internal registers. 


HALEN# 
I 
The HOST ADDRESS 
LATCH ENABLE signal is used to indicate 
to the 
82750PB 
that the host has asserted 
a valid address 
(A[31 :2], BE # [3:0]) and 
write enable (WE#). 


HRDY# 
0 
HOST READY is asserted 
by the 82750PB 
at the end of a host access to 
indicate 
that the access cycle is ready for data transfer. 
For a host write cycle, 
HRDY # indicates 
that the 82750PB 
is ready to accept data from the host. For 
a host VRAM write cycle, HRDY # indicates 
that the VRAM has latched the 
data from the host. For a host read cycle, HRDY # indicates 
that output data 
from the 82750PB 
or VRAM is ready to be latched by the host. 


HINT# 
0 
HOST INTERRUPT: 
This output is asserted 
when an interrupt 
condition 
is 
detected 
by the 82750PB, 
and the enable bit in the PROCESSOR 
CONTROL 
register corresponding 
to that interrupt 
condition 
is set to a ONE. HINT # stays 
active until the host CPU reads the INTERRUPT 
STATUS 
register. 
If an 
interrupt 
condition 
that is enabled occurs during the same cycle that the 
INTERRUPT 
STATUS 
register is being read, HINT# 
remains active. 


D[31:0] 
1/0 
The DATA BUS is used to transfer 
data between: 
1. The 82750PB 
and VRAM, and 
2. The Host CPU and internal 82750PB 
registers. 
During host VRAM accesses, 
this bus is tri-stated 
to allow the host to share the same VRAM data bus. During 
host accesses 
to internal 82750PB 
registers all 32 bits are used for data 
transfer. 


A[31:9] 
0 
The ADDRESS 
BUS is shared between 
the 82750PB 
and the host for 
A[8:2] 
liD 
addressing 
VRAM. This 30-pin bus addresses 
32-bit double words in VRAM. 


Byte Enable signals are used to address individual 
bytes or words within a 
double word in VRAM. In addition, 
the address for host accesses 
to internal 
82750PB 
registers are communicated 
to the 82750PB 
using the lower seven 
pins, A[8:2], 
and the BE# 
pins. During host access cycles to either VRAM or 
82750PB 
internal 
registers, 
A[31 :2] are tri-stated. 
For internal register 
accesses, 
as indicated 
by HREG # being low, the lower seven bits, A[8:2], 
are 
used as the host address 
input. 


CLKOUT 
0 
The CLOCK OUTPUT 
signal is one of the two internal clocks and is 
synchronized 
with CLKIN. It is always driven and will have a 50% duty cycle. 


I 
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Symbol 
Type 
Name and Fu'nctlon 


BEll 
[3:0] 
1/0 
The BYTE ENABLE BUS is shared by the 82750PB 
and the host for 
addressing 
VRAM down to the byte level. The correspondence 
between 
the four Byte Enable pins and the 0[31:0] 
pins is: BEll [3]-0[31 
:24], 
BEll [2]-0[23:16], 
BEll 
[1]-0[15:8], 
and BEll [0]-0[7:0]. 
During VRAM 
read cycles, the 82750PB 
enables all four bytes. During write cycles the 
82750PB 
only enables those bytes that are to be written. 
Bytes that are 
not enabled are not to be altered in VRAM. During host accesses 
to 
82750PB 
on-chip registers, 
the BE 11 [0] pin is used as an input to select 
whether the even or odd word is being accessed; 
the double word 
address 
is provided 
by the host on the A[8:2] 
pins. BE 11 [0] = 0 indicates 
that data is transferred 
on 0[15:0]. 
BEll [0] = 1 indicates 
that data is 
transferred 
on 0[31:16]. 


MREQII 
0 
MEMORY 
REQUEST 
is asserted 
for the first cycle, T1, of each VRAM 
cycle. 


TRNFRII, 
0 
The MEMORY 
CYCLE DEFINITION 
SIGNALS: Transfer, 
Refresh and 
RFSHII 
Write Enable are asserted 
at the same time as MREQII, 
but stay active 
for the entire VRAM cycle. TRNFR 11 active indicates 
a VRAM transfer 
cycle. RFSH 11 active indicates 
a VRAM refresh cycle. If neither TRNFR 11 
nor RFSH 11 are active, a VRAM data read or write cycle is requested. 


WEll 
1/0 
The WRITE 
ENABLE pin is used as an output during an 82750PBIVRAM 
cycle to drive the WE 11 signal, which defines the access as a VRAM read 
cycle (when inactive) 
or write cycle (when active). During HostlVRAM 
and Host External cycles, the 82750PB 
tri-states 
this pin to allow the host 
to drive the VRAM write enable signals directly. 
During Host/register 
cycles, this pin is used as an input for the Host Write Enable signal to 
determine 
whether the host is reading or writing the 82750PB 
register. 


NXTFSTII 
0 
The NEXT FAST signal indicates 
that the following 
vram cycle can be 
performed 
with a page-mode 
or bank-interleaved 
access. This signal is 
asserted 
during the first of a pair of VRAM cycles that is guaranteed 
to be 
within the same VRAM page and in opposite 
banks-a 
pair of accesses 
to two sequential 
double words in VRAM at addresses 
Even Address 
and 
Even Address + 1. In other words, A[2] is a zero for the first cycle and a 
one for the second cycle. 


MROYII 
I 
The MEMORY 
READY input indicates 
that the VRAM cycle has 
progressed 
to the point where it is ready to perform the data transfer. 
For 
a VRAM read cycle, the VRAM data can be latched by the transition 
of 
MROY 11 to an active state. For a VRAM write cycle, MROY 11 indicates 
that the data has been latched into the VRAMs. 


VBUS[3:0] 
I 
The VDP COMMUNICATION 
BUS is used to communicate 
from the 
827500B 
to the 82750PB. 
Codes sent over this bus indicate 
interrupt 
requests, 
transfer 
requests, 
and status information. 
Since the 827500B 
and 82750PB 
run asynchronously, 
the VBUS signals are sampled 
on the 
falling edge of CLKIN and compared 
with the previous 
sample. 
For a 
VB US code to be detected 
by the 82750PB, 
it must be valid for two 
successive 
samples. 


HALT 11 
I 
The HALT signal causes the microcode 
processor 
on the 82750PB 
to 
halt prior to executing 
the next instruction. 
This signal does not halt the 
VRAM interface. 
The Halt signal will allow the design of a hardware 
emulator 
for the 82750PB 
based on an 82750PB 
chip. 


TEST 11 
I 
The TEST signal is used for test purposes 
only and must remain high for 
normal operation. 


Table 1·3. Pin Descriptions 
(Continued) 
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Table 
1-3. Pin Descriptions 
(Continued) 


Symbol 
Type 
Name and Function 


PMFRZ# 
0 
The PERFORMANCE 
MONITORING 
AND FREEZE signal is toggled 
by 
specific 
microcode 
instructions 
and can be used to determine 
the time 
required to execute 
certain sections 
of microcode. 


Vcc 
I 
POWER pins provide the + 5V D.C. supply input. 


Vss 
I 
GROUND 
pins provide the OV connection 
to which all inputs and outputs 
are referenced. 


Table 
1-4. Output 
Pins 
Table 
1-5. Input 
Pins 


Name 
Active 
When 
Level 
Floated 


CLKOUT 
High 
Always Driven 


A[31:9] 
High 
Reset', 
Host Cycle 


HBUSEN# 
Low 
Reset' 


HRDY# 
Low 
Reset' 


HINT# 
Low 
Reset' 


MREQ# 
Low 
Reset' 


TRNFR#, 
Low 
Reset' 
RFSH# 


NXTFST# 
Low 
Reset' 


PMFRZ# 
Low 
Reset' 


Name 
Active 
Synchronousl 
Level 
Asynchronous 


CLKIN 
High 
Synchronous 


RESET# 
Low 
Asynchronous 


HREQ# 
Low 
Asynchronous' 


HREG# 
Low 
Synchronous 


HRAM# 
Low 
Synchronous 


MRDY# 
Low 
Synchronous 


VBUS[3:0] 
High 
Asynchronous 


HALT# 
Low 
Synchronous 


HALEN# 
Low 
Asynchronous' 


'Can be programmed to accept synchronous inputs. 


'The reset state IS caused by RESET# being active low. 


Table 
1-6. Input/Output 
Pins 


Name 
Active 
Level 
When 
Floated 
Synch/Async 


D[31:0] 
High 
Reset', 
Host Cycle 
Synchronous 


A[8:2] 
High 
Reset', 
Host Cycle 
Synchronous 


BE#[3:0] 
Low 
Reset', 
Host Cycle 
Synchronous 


WE# 
Low 
Reset', 
Host Cycle 
Synchronous 


'The reset state IS caused by RESET# being active low. 


All output 
pins are floated 
when RESET is active 
low. 
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2.0 ARCHITECTURE 


Overview 


The 
82750PB 
includes 
a 
wide 
instruction 
word 
processor 
that 
comprises 
a number 
of processing, 


storage, 
and 
input! output 
elements. 
The 
wide 
in- 


struction 
word architecture 
allows a number of these 
elements 
to operate 
in parallel. 
The 82750PB 
exe- 
cutes 
one 
instruction 
every 
internal 
clock 
cycle 
or 
T-cycle. 
The 
various 
elements 
are 
connected 
via 
two 16-bit buses, the A bus and B bus, as shown in 
Figure 2-1. During each instruction 
execution 
cycle, 


data can be transferred 
from a bus source 
to a bus 
destination 
element 
on both buses. 


Registers 


(rN; N = 0- 15) 


There 
are 16 general-purpose 
data registers, 
each 
16 bits wide, that are connected 
to both the A bus 
and B bus as both sources 
and destinations. 
These 
registers 
are designated 
rO-r1S. 
All the registers 
are 


intel~ 


functionally 
identical 
except 
rO, which 
also includes 
logic for bit shifting 
and byte swapping. 
A register 
can 
source 
both 
the 
A bus and the 
B bus in the 
same cycle. A reqister 
cannot 
be the destination 
of 
both the A bus and the B bus in a single instruction. 
Because 
the regi::;ters are doubly 
latched, 
the same 
register 
may be both a source and destination 
in the 
same cycle. The result is that the data in the register 
prior to the curren t cycle will be driven on the source 
bus, 
and 
the, data 
on the 
destination 
bus will 
be 
latched 
into the mgister 
at the end of the cycle. 


Register 
rO has additionat 
logic to allow 
bit shifting 
and byte swapping. 
The value 
in rO can be shifted 
left or right one bit position 
per instruction 
cycle. For 
a right shift, the nl:lW MSB is equal to the old MSB; in 
other 
words, 
the 
value 
is sign-extended. 
For 
left 
shifting, the new LSB is equal to zero. RO cannot 
be 
shifted 
and 
loaded 
in the 
same 
instruction. 
Byte 
swapping, 
on the other hand, only occurs when rO is 
being loaded 
with a value from the A bus or B bus. 
Byte swapping 
causes the most significant 
byte and 
the 
least 
significa nt byte of the 
16-bit value 
being 
loaded 
into rO to be interchanged. 
Refer to Chapter 
4 for a description 
of the SHFT microcode 
field that 
controls 
the shifting 
and swapping 
operations 
in rO. 


SEOUENCER 


IoAICROCODE 
RAI.A 


t.tICROCOOE 
INSTRUCTION 


AlU 


REGISTER 
fiLE 
. 


BARREL 
SHlflER 


COUNTERS 


PIXEL 
INTERPOLATOR 


DATA 
RAI.A 


0[31:0] 


r------, A[31:2] 
BEI[3:0J 
VRAI.A 
POINTERS 


HOST/VRAI.A 
~ 
INTERfACE 
....-- 
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Figure 2-1_ 82750PB 
Block Diagram 
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ALU 


(atu, cc) 


The ALU performs 16-bit arithmetic and logic opera- 
tions, and can also be operated as two independent 
8-bit ALUs for the Dual-Add-with-Saturate operation. 
There are two fields in the microcode instruction that 
affect the operation of the ALU: the ALUOP field 
specifies the operation to be performed, and the 
ALUSS field specifies the source of the two ALU 
inputs. Refer to Chapter 4 for further information on 
these fields. 


The two ALU operands either come from values 
held in the ALU input latches or from "eavesdrop- 
ping" on the A or B buses. The result of any ALU 
operation is latched in the ALU output register, alu. 
In a subsequent instruction this result can be trans- 
ferred to any A or B destination. 


The ALU has four condition flag outputs: CarryOut, 
Sign, Overflow, and Zero. CarryOut is the carry out 
of the most significant bit position. Sign is equal to 
the value of the most significant bit of the result. 
Overflow is the exclusive-OR of CarryOut and the 
Carryln to the most significant bit position of the re- 
sult. Zero is true (a value of 1) if all 16 bits of the 
ALU result are equal to zero. CarryOut and Overflow 
are defined as equal to zero for all logical opera- 
tions. For most ALU operations, the state of these 
four condition flags are latched when the operation 
is complete. There are eight operations (nop, a', b", 
+], 
-], 
0', prof and int) that are exceptions. These 
operations are performed without disturbing the con- 
dition state of the previous ALU operation. 


Microcode routines can read and write the ALU con- 
dition flag register, cc. This can be used to save and 
restore the state of these flags. The bit ordering of 
the ALU condition flags within cc are given in Table 
2-1. A complete list of ALU opcodes is given in Ta- 
ble 2-2. 


Table 2-1. Bit Assignments 
for cc Register 


Bit 
Condition 
Bit 0 
False (This bit of the cc is always read as 
a zero.)" 
Bit 1 
ALU Carry Out 
Bit 2 
ALU Overflow 
Bit3 
ALU Sign 
Bit4 
ALU Zero 
Bit 5 
Loop Counter Zero' 
Bit 6 
ROLSB' 
Bit? 
ROMSB' 
Bit 15:8 
RESERVED-,The state of these bits is 
undefined when read; write as zeros. 


'These are read-only values and are not affected by wntes to the cc 
register. 
I 
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Table 2-2. ALU Opcodes 


Operation 
Mnemonic 
No Operation 
nop 


pass a 
a 
pass b 
b 
1's Compliment of a 
-a 


1's Compliment of b 
-b 


aANDb 
& 
(NOTa) AND b 
-& 
aAND (NOT b) 
&- 
aORb 
1 
aXORb 
• 


a+b 
+ 
a + b + 1 
++ 
a-b 
- 


-a 
+ b 
-+ 
2's Compliment of a 
-a 


2's Compliment of b 
-b 


Incrementa 
a++ 
Increment b 
b++ 
Decrement a 
a-- 


Decrement b 
b-- 
Dual Add with Sat. 
+] 
a + b + (Prev. Carry) 
+< 


a - 
b - 
(Prev. Borrow) 
-< 


-a 
+ b - 
(Prev. Borrow) 
-+< 


Interrupt Host 
int 
Zero 
0' 
Pass a, Don't Latch Flags 
a' 


Pass b, Don't Latch Flags 
a' 


(NOTa) OR b 
-I 
aOR (NOT b) 
1- 
Dual Sub. with Sat. 
-] 
Performance Monitor 
prof 


The Dual-Add-with-Saturate operation performs in- 
dependent 8-bit ADDs on the upper and lower bytes 
of the two ALU operands. The two bytes of the A 
operand are treated as unsigned binary numbers 
(00:FF16corresponds to 0:25510)' The two bytes of 
the B operand are treated as offset binary numbers 
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with an offset of + 128 (00:FF16 corresponds to 
-12810:12710). The upper and lower byte results 
are treated as 9-bit offset binary, including the carry 
output of each byte, with a + 128 offset (000:1FF16 
corresponds to -12810:38310) and are saturated to 
a range of 0-25510' A result that is less than zero is 
set equal to zero or 0016and a result that is greater 
than + 255 is set equal to + 255 or FF16. 


In fact, this operation is symmetric. Either the A op- 
erand or the B operand can be defined as the un- 
signed binary value, and the other operand will be 
treated as the offset signed binary value. 


Dual-subtract-with-saturate is similar to dual-add- 
with-saturate. It calculates A - 
B + 128 on each 
8-bit half of the two 16-bit inputs, and clamps the 
results to 0 and 255. This can be viewed as subtract- 
ing an offset-binary signed byte (-128 
to 127) from 
an unsigned byte (0 to 255). 


The ALU opcode INT generates the MCINT (micro- 
code interrupt) condition. When this condition is de- 
tected by interrupt logic in the host CPU interface, 
and if the Enable MCINT bit in the PROCESSOR 
CONTROL register is set to a ONE, the host inter- 
rupt output, HINT#, will be asserted. Refer to Chap- 
ter 3 for further information on host interface. 


The 'prof' opcode activates the PMFRZ# pin, and is 
primarily used for performance monitoring and/or 
debugging. 


Barrel Shifter 


(shift, 
shltt-r, 
shift-r/, 
shift-/l 


The barrel shifter performs a single cycle, n-bit left or 
right shift. The barrel shifter operates independent of 
the ALU. The three barrel shifter operations are: 
Shift-r for a right shift with sign extend; Shift-rl for 
right shift with zero fill; and Shift-I for a left shift with 
zero fill. The shift operation is invoked by writing a 
4-bit value (the shift amount) to one of three A bus 
registers, depending on which of the three opera- 
tions is to be performed. The operand is taken from 
the B bus, and the result is stored in the barrel shift- 
er output register, Shift. Like the ALU result register, 
the value in Shift can be read onto the A bus or B 
bus in the following instru~tion cycle. 


A barrel shifter operation does not affect any of the 
condition flags. 


1-14 


intel~ 


Data RAM 


[dramN, 'dramN, + +, - -; N = 1-4l 


The Data RAM holds 512, 16-bit words that are ac- 
cessed using four pointers. To access a value in a 
particular locatio\"',the microcode routine must first 
load a pointer with the address to be accessed, and 
then perform a read or write using the same pointer. 
In parallel with t le data RAM access, the pointer 
can optionally be post-incremented or post-decre- 
mented. The four pointers, referred to as dram1- 
dram4, can be written and read via the A bus. When 
a dram pointer, which is only 9 bits wide, is read onto 
the A bus, its upper seven bits are set to zeros. 


NOTE: 


The width of tne dram pointers may change in 
later versions 01 the 82750PB. Software should 
not rely on the width of a pointer to, for exam- 
ple, mask the upper seven bits of a value to 
zero. 


All four pointers can be used to read or write the 
Data RAM from either the A or B bus. Only one Data 
RAM access can 
e performed in any cycle. A Data 
RAM access is reierred to, using C language syntax, 
as "drsmt, The • means "the value pointed to by". 
As another example, "drems+ + means access the 
Data RAM using the pointer dram3 and increment 
dram3. The symbol - - 
in place of the + + would 
indicate autodecmment. 


Loop Counters 


(cnt,cnt2l 


Two 16-bit loop counters are available to microcode 
programs for automatically counting iterations of a 
microcode loop. In parallel with other operations 
performed in an instruction, either loop counter can 
be decremented, and a conditional branch can be 
made based on the loop counter value being equal 
or not equal to zero. Since the two loop counters 
can be written and read on the A bus, as cnt and 
cnt2 respectively, they can also be used for variable 
storage when not being used as loop counters. The 
loop counters can be written to arid decremented 
during the same instruction cycle. The value in the 
counter at the start of the next cycle will be the value 
written to the counter minus one. 


The LC microcode bit determines the loop counter 
that is selected for decrementing and/or branching 
in an instruction. The LC microcode bit does not af- 
fect the loop counter that is written or read over the 
A bus, since each loop counter is separately ad- 
dressable as an A bus source or destination. Refer 
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to Chapter 4 for a description of the CNT - - 
micro- 
code bit that causes the select loop counter to be 
decremented, and for a description of the CFSEL 
microcode field that is used to perform a conditional 
branch based on the selected loop counter's value. 


Microcode 
RAM 


imcoaer-e, 
maddr, pc} 


The 82750PB executes instructions stored in an on- 
chip microcode RAM. This RAM holds 512 instruc- 
tions and each instruction is 48 bits wide. Normally, 
to start the microcode processor, the host CPU will 
load a microcode program into the microcode RAM, 
point the program counter, pc, to the start of the 
program, and then release the HALT bit to start exe- 
cuting the microcode program. The microcode proc- 
essor can also load its own microcode RAM to over- 
lay new routines and therefore, does not require 
constant intervention by the host to perform multiple 
operations. 


Writing an instruction into Microcode RAM is done 
by first loading the three registers mcode3, mcode2, 
and mcode1 with the three 16-bit words of the in- 
struction 
(the 
most 
significant 
word 
goes 
into 
mcode1), and then loading the address.where the 
instruction should be written into maddr. 
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The host CPU can also read the Microcode RAM by 
first loading the pc with the address of the instruc- 
tion to be read and then reading the three 16-bit 
words of the instruction from the mcode1-mcode3 
registers. Normally, this would be done by the Host 
CPU while the 82750PB is halted. Since mcode1- 
mcode3 hold the instruction pointed to by the pc (Le. 
the instruction that is about to be executed), normal- 
ly reading these three registers from a microcode 
routine is not useful. 


The read registers named mcode1-mcode3 and the 
write registers also named mcode1-mcode3 
are in 
fact different registers. Writing values into mcode1- 
mcode3 and then reading the values of mcode1- 
mcode3 will not read back the same values just writ- 
ten. The read registers hold the instruction stored in 
the instruction latch (the instruction to be executed). 
The write registers hold an instruction that is about 
to be written into microcode RAM. 


After writing to maddr to load an instruction into mi- 
crocode RAM, a one cycle freeze occurs and during 
the freeze a write to the microcode RAM takes 
place. The instruction following the write to maddr 
can either jump to the address just loaded or start 
loading the mcode1-mcode3 registers with the next 
instruction to be written. 


Here are two examples that illustrate the fact that 
the 82750PB requires at least one instruction be- 
tween the write to maddr and the execution of the 
instruction that is loaded by the write to maddr. 


Example 
1: 


maddr 
= ADDRl 
jmp 
addrl 
'* load 
instruction *' 
'* jump 
to 
it, 
this 
is the 
extra 
inst. 
required 
between *' 
'* writing 
to 
maddr 
and 
executing 
the 
loaded 
inst. *' 


ADDR1: 
??????????? 


Example 
2: 
'* here's 
where 
new 
instruction 
gets 
loaded *' 


maddr 
= INS! 
nop 
INS!: 
??????????? 
'* extra 
instruction *' 
'* instruction 
gets 
loaded 
here *' 


When 
a microcode 
routine 
writes 
to pc, one more 
instruction 
is executed 
before 
the jump to the new address 
takes 
effect. 
For example: 


pc 
= 
ADDRl 
rO = rl 
jmp 
ADDR2 '* this 
instructIon 
gets 
executed 
but *' 
'* its 
jump 
to ADDR2 
is 
ignored. *' 


ADDR1: 
r3 = rO 
'* after 
this 
instruction 
executes 
r3 = rO = rl *' 
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When the host CPU writes to the pc, the instruction 
at the address that was written is loaded into the 
mcode1-mcode3 
registers and, when the micro- 
code processor is released from its Halt condition, 
this is the first instruction that will be executed. 


When the host CPU reads the pc, the result returned 
is the address of the instruction that will be executed 
when Halt is released, that is, the address of the 
instruction held in the mcode1-mcode3 
registers. 


Horizontal 
Line Counter 


{lent) 


The 12-bit Horizontal Line Counter is updated by 
VBUS codes from the 82750DB to track the horizon- 
tal display line that is currently being scanned by the 
82750DB. The counter is reset by a VODD code and 
incremented each time an HlINE code is received. 
A value can also be written into a Horizontal Line 
'Counter but this is used primarily for testing the 
82750PB. The upper four bits will always read zeros. 


Field Counter 


!tent) 


The 4-bit field counter is updated by VBUS codes 
from the display processor to keep track of the field 
count being displayed by 82750DB. The counter is 
incremented each time a VODDcode or VEVENcode 
is received. When reading the field counter, the up- 
per 12 bits will read zeros. This counter will not be 
initialized upon reset. 


Input FIFOs 


[inN-lo, inN-hi, inN-e, 'inN; N = I, 2) 


There are two input channels, referred to as input 
FIFOs, through which the processor can read pixels 
or data from VRAM. Each channel automatically 
fetches 64-bit quad words from VRAM and breaks 
them into 8-bit bytes or 16-bit words that are read by 
microcode. Each input FIFO operates independently 
and can be programmed to automatically increment 
or decrement through bytes or words in VRAM. The 
FIFOs are double buffered so that while values are 
being extracted from one quad word (64 bits), the 
next quad word is being prefetched from VRAM. 
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Thj:!mode control register for each input FIFO, des- 
ignated in1-c 01' in2-c, contains four mode bits as 
seen in Figure 2-2. The WORD/BYTE bit (bit 0) de- 
termines whether the input FIFO is in word mode 
(WORD/BYTE = 0) or byte mode (WORD/BYTE = 
1). In byte mode, the FIFO can start reading on any 
byte boundary and in word mode on any word 
boundary. 


The INC/DEC bit (bit 1) determines the order that 
bytes' or words are read from VRAM. In INCRE- 
MENT mode, with INC/DEC = 0, the FIFO reads 
from the least significant byte or word to the most 
significant byte or word of each double word and 
increments through double words in VRAM. In DEC- 
REMENT mode, with INC/DEC = 1, the FIFO reads 
from most significant byte or word to least significant 
byte or word within a double word and decrements 
through double words in VRAM. 


The AHOLD bit (Bit 2) is used by the address hold 
mode. When asserted, (bit 2 = 1) the automatic ad- 
dress increment! decrement function will be disabled 
and input FIFOs will not double buffer VRAM data. In 
other words, at the end of a VRAM cycle, when the 
FIFO has been updated with 64 bits of VRAM data, 
the input FIFO will not issue another MREQ# until 
there is a write to the address-Io registers OR a roll- 
over/roll-under read access of the input FIFO. If a 
roll-over/roll-under occurs, then a memory request 
will be issued to fetch data from the same VRAM 
location. If there is a write to the address-Io register, 
the FIFO will then fetch data from the new location. 


The PREFETCH OFF bit (bit 3) specifies whether 
the FIFO will automatically prefetch successive quad 
words from VRAM or will only fetch a new quad word 
when a value from that quad word is requested. In 
PREFETCH-ONmode, bit 3 = 0, the input FIFO pre- 
fetches successive quad words from VRAM as nec- 
essary to keep its buffer full (either from ascending 
or descending addresses, depending on the state of 
the INC/DEC bit). In PREFETCH-OFF mode, the 
FIFO will still prefetch the first two quad words to fill 
its buffer (when started at a new address location), 
but will only fetch a new quad word when a read 
request is made to the FIFO for a value in the next 
unfetched quad word. 


The CB bit (bit 4) allows circular buffers of sizes 
64 kBytes, 128 kBytes, or 256 kBytes to be created 
in VRAM memory. The choice of different sizes of 
buffers are determined by programming the least 
significant 3 bits of the circular buffer register (cir- 


bits: 
15 ... 4 
Set to Zeros 
1 
INC/DEC 
o 
WORD/BYTE 
5 
4 
3 
2 
BY-32 MODE 
CB 
PF OFF 
AHOLD 


Figure 2-2. Input FIFO Control 
Register 
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cbuf). To enable this feature, the CB bit has to be set 
to a 1, then depending on the buffer size selected, 
the appropriate address pin that goes off chip will be 
forced to a 0 (register pointers remain unchanged). 
Table 2-3 shows the programming combinations of 
the circular buffer register. 


It is important to note that the internal address coun- 
ters themselves are not affected by the circbuf func- 
tion. Only the selected external address pin is forced 
to '0'. 


Table 2·3. Circular Buffer Register (clrcbuf) 


Bits [2:0] Buffer Size Effect on PB Address Bus 
(If Function Enabled) 


000 
Disabled 
None 


100 
256 kBytes Address Pin 18 Forced to 0 


010 
128 kBytes Address Pin 17 Forced to 0 


001 
64 kBytes Address Pin 16 Forced to 0 


In "BY-32" MODE (bit 3), the pointer increments or 
decrements by 32 bits, independent of whether the 
FIFO is in 8-bit pixel mode or 16-bit pixel mode. This 
mode was added to facilitate microcode that oper- 
ates on one component of a 32-bit per pixel image. 


The standard sequence for initializing an input FIFO 
is to write to the control register (in-c), the high ad- 
dress (in-hi), and then the low address (in-Io) of the 
appropriate FIFO. Refer to the access state diagram 
in Chapter 3. The write to in-Io causes the FIFO to 
start reading from VRAM. A byte or word is then 
read from -in. Successive reads from -in will read 
sequential bytes or words from VRAM. Writing to the 
control register each time the FIFO is started at a 
new address is not necessary, except to change the 
FIFO's mode. Also, if the new address is within the 
same 64 kByte page of VRAM, only the lo-address 
needs to be written in order to start the FIFO reading 
from the new address. 


If microcode attempts to read a value from an empty 
input FIFO, the processor is frozen prior to the exe- 
cution of the instruction, until the FIFO's control log- 
ic has fetched another double word from VRAM and 
extracted the next value. At this point, the processor 
is released from the frozen state, and the instruction 
that reads the value is executed. When the proces- 
sor is frozen waiting for a particular FIFO that isn't 
yet ready, that FIFO's VRAM access priority is raised 
above all other FIFOs. 
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Output FIFOs 


(outN-lo, outN-hi, outN-c, 'outN, outN+ +; N = 1,2l 


There are two output channels, referred to as output 
FIFOs, through which the graphics processor writes 
pixels or data to VRAM. Each channel automatically 
collects bytes or words into 64-bit quad words and 
writes the quad words to VRAM. Each output FIFO 
operates independently and can be programmed to 
write bytes or words into sequential addresses in 
VRAM (either incrementing or' decrementing). The 
FIFOs are double buffered so that while one quad 1 
word is waiting to be written to VRAM, the next quad 
word can be assembled from individual bytes or 
words. 


The mode control .reqister for each output FIFO, 
designated outt-c or ootz-o, contains six mode bits 
as shown in the Figure 2-3. The WORD/BYTE bit 
(bit 0) determines whether the output FIFO is in word 
mode (WORD/BYTE = 0) or byte mode (WORDI 
BYTE = 1). In byte mode the FIFO can start writing 
on any byte boundary in VRAM and in word mode on 
any word boundary. 


The INC/DEC bit (bit 1) determines the order that 
bytes or words are written to VRAM. In INCREMENT 
mode, with INC/DEC = 0, the FIFO writes from the 
least significant byte or word to the most significant 
byte or word in a double word and increments 
through double words in VRAM. In DECREMENT 
mode, with INC/DEC = 
1, the FIFO writes from 
most significant byte or word to least significant byte 
or word within a double word and decrements 
through double words in VRAM. 


When the AHOLD bit (bit 2) is set, the output FIFO 
quad word address is not incremented or decre- 
mented. In this mode, the FIFO continues to output 
to a single quad word in VRAM. 


The FORCE-LSBbits (bits 3 and 4) are used to force 
the least significant bit of each byte written to VRAM 
to either a zero or a one. This can be used, for ex- 
ample, to force the LSB to the correct polarity when 
writing to the U bitmap during motion video decom- 
pression. In certain display modes for the 82750DB, 
the LSB of the 8-bit samples in the U or Y bitmap are 
used to select VIDEO or GRAPHICS display mode 
for the n x n group of display pixels corresponding to 
the particular U or Y sample. A one in the FORCE- 


bits: 
15-6 
5 
4 
3 
2 
1 
0 
Set to Zeros BY-32 MODE FORCE-LSB FORCE-LSB AHOLD INC/DEC WORD/BYTE 
ENABLE 
VALUE 


Figure 2·3. Output FIFO Control Register 
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LSB ENABLE bit (bit 4) enables the forcing; a zero 
results in normal operation. The FORCE-LSB VAL- 
UE bit (bit 3) is used as the value to which the LSB is 
forced. Whether in byte mode or word mode, the 
LSB of each byte is forced to the FORCE-LSBvalue. 


In "BY-32" MODE (bit 5), the pointer increments or 
decrements by 32 bits, independent of whether the 
FIFO is in 8-bit pixel mode or 16-bit pixel mode. This 
mode is used to facilitate microcode that operates 
on one component of a 32-bit per pixel image. The 
bytes or words that are skipped over will be un- 
changed in VRAM. 


The standard sequence for initializing an output 
FIFO is to write to the control register (out-c), the 
low address (out-to), and then the high address (out- 
hi) of the appropriate FIFO. A series of bytes or 
words is then written to "out. Refer to the access 
state diagram in Chapter 3 (Figure 3-1). 


In order to flush any remaining data in an output 
FIFO before changing its VRAM pointer, it is neces- 
sary to write to the control register. When pointing to 
a new location in VRAM, if the new address is within 
the same 64 kByte page of VRAM, only the lo-ad- 
dress needs to be written. 


There must be one instruction between the write to 
the output FIFOs low address and the first write to 
·outN. Therefore, it is recommended that outN-lo be 
written before outN-hi. The write to outN-hi insures 
that this requirement is met. If only the outN-lo value 
is being changed, it is still necessary to have one 
additional instruction before the first write to ·outN. 


When writing bytes or words to VRAM through an 
output FIFO, a byte or word can be skipped over by 
writing to outN+ + instead of ·outN. When the val- 
ues are written to VRAM, any byte or word that was 
skipped will retain its original value in VRAM, and its 
value is not altered by the VRAM write. This can be 
used when writing a series of pixels, some of which 
are "transparent", 
allowing whatever was behind 
them to show through. 


If the microcode routine attempts to write a value to 
a full output FIFO, the processor is frozen prior to 
the execution of the instruction. The processor re- 
mains frozen until the FIFO has a chance to write 
one of the buffered quad words to VRAM. At that 
point, the processor is released from the frozen 
state, and the instruction that writes the value is exe- 
cuted. When the processor is frozen, waiting for a 
particular FIFO that isn't yet ready, that FIFO's 
VRAM access priority is raised above all other 
FIFOs. 
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Statistical 
Decoder 


(stat-Io, stet-hi 
stet-c. stet-rem, 
'stat, 
'stat# I 


The Statistical Decoder (also referred to as the Huff- 
man Decoder) is a specialized input channel that 
can read a variable-length bit sequence from VRAM 
and convert it into a fixed-length bit sequence that is 
read by the microcode processor. In image com- 
pression, as well as in other applications such as 
text compression, certain values occur more fre- 
quently than others. 
A means of compressing this 
data is to use fewer bits to encode more frequently 
occurring values and more bits to encode less fre- 
quently occurring values. This type of encoding re- 
sults in a variable-length sequence in which the 
length of a symbol (the group of bits used to encode 
a single value) can range for example, from one bit 
to sixteen bits. 


The statistical code that the statistical decoder can 
decode is of either of the two forms: 


Ox 
10x 
110xxx 
1110xxxxx 


1x 
01x 
001xxx 
0001xxxxx 
or 
11111110XXXXXI( 
111111110xxxxxx 
00000001xxxxxx 
000000001xxxxxx 


Each symbol of a given length (one per line as 
shown here) cons sts of a run-in sequence followed 
by some number of x-bits. The run-in sequence is 
defined as a series of zero or more ONEs followed 
by a ZERO or, as in the code on the right above, 
zero or more ZEnOs followed by a ONE. The re- 
mainder of this dascrlption will use examples of the 
code on the left. ~,bit in the decoder's control regis- 
ter determines th-i~polarity of the run-in sequence 
bits. 


In the example on the left, there would be two sym- 
bols of length two: 00 and 01. Each x-bit can take on 
a ZERO or ONE value. The number of x-bits follow- 
ing a run-in sequence can range from zero to six. 
Since the goal, in general, is to have a few short 
codes and a larger number of long codes, typically, 
codes with fewer run-in bits will have fewer x's fol- 
lowing. However, this is not a hardware constraint. A 
code of this form is completely described by a code 
description table indicating: for each length of run-in 
sequence, R = the number of ONEs in the run-in, 
and how many x-bits follow the ZERO. The value of 
R is used as an index into the code description table. 
Due to the hardware implementation, the number 
actually stored in the table is 2x, where x is the num- 
ber of x-bits. 


For the example above, the corresponding code de- 
scription values are given in Table 2-4. 
I 
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Table 2-4. Sample Code Description Table 


R 
X 
2X(dec.) 
2X(bln.) 


0 
1 
2 
0000010 
1 
1 
2 
0000010 
2 
3 
8 
0001000 
3 
5 
32 
0100000 
... 
7 
6 
64 
1000000 


Note that the table only goes up to symbols with 
seven ONEs in the run-in. For symbols with more 
than seven ONEs, the value of X and 2X for seven 
ONEs is used for all symbols having seven or more 
ONEs in the run-in sequence. For example, in the 
code above a symbol with eight or more ONEs in the 
run-in sequence has six x-bits following the ZERO, 
which is the same as symbols having seven ONEs. 


For each different symbol, including all symbols of 
the same run-in length with different x-bit values, the 
decoder generates a unique fixed-length, 16-bit val- 
ue. Some of the decoded values for the sample 
code given above are provided in Table 2-5. 


Table 2-5 Decoded Values 


Symbol" 
Decoded Value 


00 
0 


01 
1 


100 
2 


101 
3 


110000 
4 


110001 
5 


110010 
6 


... 
. .. 


110111 
11 


111000000 
12 


... 
... 


111011111 
43 


... 
... 
.The x-bits of the symbol are onboldface for clanty. 


The algorithm for generating a decoded value from a 
symbol is as follows: all symbols of a given run-in 
length are assigned a base value, 8; the value corre- 
sponding to a particular symbol is equal to 8 plus the 
binary value of the x-bits in the symbol. The base 
valule 8 for a symbol with a run-in length of R is 
calculated by: 


S(A) = SUM[2X(r)) 
with r = 0 to A - 
1, 
I 
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where X(r) corresponds to the X value in the table 
entry corresponding to R = r. 


For example, in the above code: 
8(0) = 0, 
8 (0) is always zero 
8(1) = 0 + 2 = 2 
8(2) = 0 + 2 + 2 = 4 
8(3) = 0 + 2 + 2 + 8 = 12 
8(4) = 0 + 2 + 2 + 8 + 32 = 44 


This is one of the reasons that the table holds 2x 
instead of X. The calculation of 8(R) are easier to 
implement in logic. 


There are two enhancements that are made to this 
coding 
scheme 
in 
the 
implementation 
on 
the 
82750P8. These two modes are referred to as END 
mode and SHORT mode. If neither END nor SHORT 
mode are enabled, the decoding is performed as de- 
scribed above. SHORT mode allows the decoder to 
be switched easily to a simpler code format without 
having to reload the code description table. In the 
SHORT form, all symbols have the same number of 
x-bits, as though all entries in the table had been 
filled with the same value of 2x. When SHORT mode 
is invoked, this value of 2x is obtained from a field in 
the statistical decoder's CONTROL word, instead of 
from the individual table entries. 


END mode is added in recognition of the fact that, 
for codes with few symbols, some increase in effi- 
ciency is possible by not having to place a zero at 
the end of the longest run-in sequence. For exam- 
ple, consider the code: 


o 
10x 
110x 


The END mode allows us to shorten the last symbol 
to 11x instead of 11Ox.The trailing ZERO is not re- 
quired because the decoder has been told that the 
maximum length of a run-in is two ONEs. The result- 
ing symbol set and corresponding decoded values 
are given in Table 2-6. 


Table 2-6. END Mode Decoded Values 


Symbol 
Decoded Value 


0 
0 


100 
1 


101 
2 


110 
3 


111 
4 
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The number 
of x-bits must be constant 
for all sym- 
bols 
of the 
same 
run-in 
length. 
Therefore, 
a code 
such as: 


o 
10xx 
11xxx +- 
NOT CORRECT! 
... 
Must be 11xx. 


is not allowed. 
The last symbol 
(11xxx, in this case) 
uses the same table entry for 2x as the next to last 
symbol 
(10xx) and, therefore, 
the last symbol will be 
11xx. 


The maximum 
length of the run-in sequence 
in END 
mode is specified 
by placing an END flag in the code 
description 
table. 
For example, 
a code and the cor- 
responding 
table is shown 
in Table 2-7. 


Table 
2-7. END Flag Decoded 
Values 


Code 
Table 
Entries 


Index 
END Bit 
2X 


0 
0 
0 
0 


10xx 
1 
0 
4 


110xxx 
2 
1 
8 


111xxx 
3 
- 
- 


4 
- 
- 


5 
- 
- 


6 
- 
- 


7 
- 
- 


The hyphens 
indicate 
that those table entries 
aren't 
used 
to 
decode 
this 
code. 
Note 
that 
the 
symbol 
111xxx has three x-bits because 
of the value of 2x in 
Index 2; it is not based on the 2X value in Index 3. 


The 
SHORTED 
and 
END 
modes 
can 
be invoked 
simultaneously, 
resulting 
in a code such as: 


Ox 
10x 
110x 
111x 


with 
a SHORT - zx value 
= 2 (for 1 x-bit in each 
symbol) 
and the END bit set in Index 2. 


Packed 
binary fields with one to seven bits per field 
can be read using the statistical 
decoder 
by setting 
the END bit in Index 0 and by programming 
the X 
value to be N -1, 
where 
N is the number of bits per 
field. 
For example, 
packed 
three-bit 
fields 
could 
be 
decoded 
as shown 
in Table 2-8. 
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Table 
2-8. Packed 
3-Blt 
Field Decoded 
Values 


Code 
Table 
Entries 


Index 
END Bit 
2x 


Oxx 
0 
1 
4 {N = 3, so X = 2} 


1xx 
1 
- 
- 


2 
- 
- 


3 
- 
- 


4 
- 
- 


5 
- 
- 


6 
- 
- 


7 
- 
- 


The unpacked 
bits are in reverse 
order 
relative 
to 
how they are stored in VRAM. 
For example, 
if three- 
bit values 
are packed 
in VRAM, 
the pattern 
110 in 
VRAM 
is read 
from 
right to left and 
gives 
an un- 
packed 
or decoded 
value of 3. 


The 
CONTROL 
register 
for the 
statistical 
decoder 
(stet-c) is used to specify the mode to use for decod- 
ing, as well as to invoke 
certain 
modes 
for writing 
and reading the code description 
table. Refer to the 
bit assignments 
for this register 
below. 
To write 
to 
the code 
description 
table, 
the WRITE 
bit (bit 4) is 
set to a ONE; 
the 
starting 
table 
index 
is reset 
to 
zero. 
Each 
write 
to the 
table 
causes 
the 
index 
to 
increment 
by one. This index will wrap around 
from 
seven 
back to zero. For example, 
to write all eight 
table entries the user would write a value of Ox10 to 
stat-c register and then write eight 8-bit values to the 
register 
stat-ram. The 
most 
significant 
bit of each 
8-bit value is the END bit, and the lower seven 
bits 
are the values 
of 2x. To read the code 
description 
table, the TEST bit (bit 4) of the CONTROL 
register 
is set to a one. The table entries 
are then read from 
the decoder's 
data register (·stat). Reads and writes 
always start at table entry zero. 


NOTE: 


When reading the code description table, it is 
necessary to wait one instruction time between 
the write to stat-c and the first read from "stet. 
An access diagram showing al/ legal sequences 
for read and write FIFO registers is shown in 
Chapter 3 (Figure 3-1). 
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t'lgure 2-4. Statistical 
Decode 
CONTROL 
Register 


END mode is enabled by setting the END bit (bit 6) 
in the CONTROL register to a ONE. The SHORT 
mode is enabled by setting the SHORT bit (bit 7) in 
the CONTROL register to a ONE. When in SHORT 
mode, the five SVAl 
bits (bits 12:8) in the CON- 


TROL register are used as the SHORT- 2x value. 


The POl bit (bit 15) determines the polarity of the 
run-in sequence bits. If bit 15 = 0, then ONEs end- 
ing in ZERO (e.g., 111Oxxx)sequence is selected. If 
bit 15= 1, the ZEROs ending in ONE (e.g., 0001xxx) 
sequence is selected. 


The CB bit (bit 13) allows circular buffers of sizes 
64 kBytes, 128 kBytes, or 256 kBytes to be created 
in memory, as in the case of the input FIFO. The 
choice of different sizes of buffers are determined 
by programming the least significant 3 bits of the 
circular buffer register (circbuf). To enable this fea- 
ture, the CB bit has to be set to a 1, then depending 
on the buffer size selected, the appropriate address 
pin that goes off chip will be forced to a 0 (register 
pointers remain unchanged). Table 2-3 shows the 
programming combination of the circular buffer reg· 
ister. 


The decoding parameters may be changed between 
symbols by writing to the CONTROL register and, if 
necessary, writing new values into the code descrip- 
tion table. The correct procedure for changing the 
code type or decode mode is to read the last value 
from the decoder prior to the change, using rstet« 
instead of "stet. This keeps the decoder from auto- 
matically starting to decode the next symbol. At this 
point, the code description table and the SHORT 
and END mode bits can be changed as desired. The 
next time the CONTROL register is written with both 
TEST = 0 and WRITE = 0, the decoder will begin 
to decode the next symbol using the new parame- 
ters. 


The statistical decoder buffers one quad word read 
from VRAM so that the decoding of bits in one 32-bit 
I 


word and the fetch of the next 32-bit word may over- 
lap. As with the input and output FIFOs, the decoder 
has a VRAM pointer associated with it that points to 
the location in VRAM from which it is reading data. 
This pointer increments twice each time a new quad 
word is read; there is no decrement mode. When the 
least significant word of the decoder's pointer (stat- 
Io) is written, any data that had previously been pre- 
fetched from VRAM is ignored, and the decoder 
fetches one quad word starting from this new loca- 
tion. 


The 82750PB assumes that the statistically encoded 
bitstream in VRAM starts with the least significant bit 
of a double word. That is, the two lSBs of the ad- 
dress written to start-to are ignored. 


The statistical decoder decodes data at a rate of 
one bit per T-cycle. To a first approximation, the de- 
code time for an N-bit symbol is: 


decode 
time (in T-cycles) 
= N + 1 


Since it takes at least 64 T-cycles to decode data 
from one quad word, which is the time required for 
eight quad word reads from VRAM, the decoder 
should rarely run out of data. Therefore, the above 
estimate should very accurately model the actual 
decoding rate of the statistical decoder. 


The statistical decoder always begins to read the 
bitstream from the least significant bit of the double 
word found at the starting location in VRAM. That is, 
the decoder does not start on a byte or word bound- 
ary as an input FIFO or output FIFO does, but only 
on double word boundaries. The bitstream moves 
from the least significant bit to the most significant 
bit of a double word and then to the least significant 
bit of the next double word (at the next higher ad- 
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dress location). For the x-bits, the first x-bit read 
from the bitstream becomes the most significant bit 
of the x-bit field when it is interpreted as a binary 
number. The example below shows a code defini- 
tion, a bitstream stored in VRAM, and the resulting 
decoded values. 


The code definition and range of values for each 
symbol length are indicated in Table 2-9. 


Table 2-9. VRAM Bltstream Decode Values 


Symbol Values 
Comments 


0 
0 


10x 
1,2 
100 = 1, 101 = 2 


110xx 
3-6 
11000 = 3, ... ,11011 = 6 


1110xxx 7-14 
1110000 = 7, ... ,1110111 = 14 


Decoding starts at address 0 in this example. The 
two double words at addresses 0 and 1 are: 
0: OxAC98E14D 
1: Ox372E74CB 


The bitstream in VRAM, with colons dividing the 
symbols (read from right to left starting at LSB of 
address 0) is shown in Figure 2-5. 


Table 2-10 lists the symbols, in the order they are 
encountered in the bitstream, and the corresponding 
decoded values. 


Table 2-10. Decoding Symbols 


Symbol 
Value 
Comments 
101 
2 
Starts at LSB, 
Address 0, 
Scanning Left 
100 
1 
101 
2 
0 
0 
0 
0 
0 
0 
0 
0 
1110001 
8 
100 
1 
100 
1 
11010 
5 
1110100 
11 
Spans First and 
Second Double Word 
11001 
4 
0 
0 
1110011 
10 
101 
2 
0 
0 
0 
0 
1110110 
13 
... 
... 


0:0110111: 
0: 
0:1.01: 
1100111: 
0: 
10011 
:001011 


240854-5 


. Address MSe 
0( 
Read bltstream from LSe to MSe 
0( 
LSB 
Start 
o 
1: .2...!. 0 1 1 :~ 0 1 : 0 0 1 : 1 0 0 0 1 1 1 : 0 : 0 : 0 : 0 : 1 0 1 : 0 0 1 : 1 0 1'" 
Here 


First bit of a symbol, continued at LSe of next double word 


Figure 2-5. VRAM Bltstream Decoding Addresses 
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Figure 2-6. Plxel Interpolation 


Pixel Interpolator 


(Pixint·c, 
Pixintl 


The pixel interpolator performs bilinear interpolation 
on four 8-bit pixels to generate,in 
effect, a pixel 
shifted by a fraction of a pixel position. See Figure 
2-6. If the four pixels have values of A, B, C, and D; 
and the horizontal weight and vertical weight are h 
and v, respectively, the interpolated value W, ignor- 
ing any quantization effects, is given by: 


W = A*(l-h)(l-v)+ 
S*h(l-v)+C*(l-h)v+ 
O*hv 


The values of h and v are even multiples of 1/16. 
Figure 2-6 illustrates pixel interpolation with an h 
weight of 6/16 or 3/8 and a v weight of 10/16 or 
5/8. 


The pixel interpolar can operate in two modes: se- 
quential-2D and random-2D. Sequential-2D mode is 
used for motion video decoding and when an array 
of pixels are interpolated with a common weighting. 
Random-2D mode is used either when the pixel ar- 
rays to be interpolated are not adjacent pixels in two 
rows or when the weight is changed for each inter- 
polation. (The word random is used here to mean 
non-sequential.) 
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The example in Figure 2-7 shows a single row of 
pixels being interpolated in SequentiaJ-2Dmode us- 
ing two rows from the original (source) bitmap. The h 
and v weighting are constant for all the interpolated 
pixels. In this case, the weights appear to be approx- 
imately h = 10/16 and v = 6/16. 


A 
B 
E 
F 
-First 
Input Row 
W 
X 
Y 
Z 
-Interpolated 
Row 
C 
D 
G 
H 
K 
-Second 
Input Row 


Figure 2-7. Sequentlal-2D 
Plxellnterpolatlon 


The pixel interpolator is pipelined and requires some 
startup sequence to fill the pipeline. Once filled, the 
pixel interpolator generates a new interpolated pixel 
every two T-cycles when in Sequential-2D mode. 
Source pixels are written into the interpolator as pix- 
el pairs. In the case above, the pixel pair BA would 
be written first, followed by the pixel pair DC. It would 
seem more natural to refer to the pixel pair as AB, 
but because of the way 8-bit pixels are arranged in 
16-bit words in VRAM, the left-most pixel on the 
screen is the least significant byte position. For ex- 
ample, if pixel A had a hex value of OxAAand B had 
a value of OxBB,the 16-bit word containing pixels A 
and B would have a value of OxBBAA. 


Then, two pixels are read from the interpolator. Be- 
cause the pipeline isn't full yet, these pixels are read 
and discarded. This loop of writing two pixel pairs 
and reading two output pixels continues four times. 
The two pixels that are read this fourth time are the 
first two valid output pixels: Wand X. The interpola- 
tor may also collect output (interpolated) pixels into 
pixel pairs. For example, pixels Wand X, instead of 
being output separately, would be combined into a 
16-bit pixel pair XW. Since there are two possible 
phase relationships between the input pixel pairs 
and output pixel pairs, the desired phasing (either X 
and W paired or Y and X paired) can be specified. 


•• 
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bits 
15 
14 
13 
12 
11 
10 
9 
8 
7:4 
3:0 
'RESERVED-Write 
as ZERO 
'Pipelining Select (1 = Fast,O = Standard) 
'Phase (0 = In Phase, 1 = Opposite Phase) 
'RESERVED-Write 
as ZERO 


'Pairing (1 = Output Pixel Pairs,0 = Single Pixels) 
'Reset Bit (1 = Reset, 0 = Normal) 
Mode Select Bits - - - - - - - - - - - - - - - - - - -' - - - - - - -' 
Vertical Weight - - - -' 
Horizontal Weight - - - - - -' 


Figure 2-8. Pixellnterpolator 
Control Register 


Random-2D interpolation is used either when the 
pixels to be interpolated are not in horizontal rows or 
when the weight is changed for each interpolated 
pixel. Examples for this are smooth warping or 
smooth scaling operations. In the case of Random- 
20, the processing for successive interpolated pix- 
els cannot take advantage of pipelining; each pixel is 
considered to be the first pixel of a Sequential mode 
interpolation. The weight and the two input pixel- 
pairs are written into the interpolator. After waiting at 
least 10 T-cycles, the one interpolated pixel can be 
read. (The delay is 10 cycles when in the standard 
mode (bit 14 = 0) and 6 T-cycles when in the fast 
mode (bit 14 = 1).) Then, the next two input pixel- 
pairs and if necessary, the new weight value, are 
written, and 10 cycles later the next interpolated pix- 
el can be read. 


The h and v weight values, the mode selection, and 
other control bits are written to the pixel interpolator 
control register (avg-c). The bit assignment for this 
register is in Figure 2-8. The least significant byte 
holds the 4-bit v value (bits7:4) and the 4-bit h value 
(bits 3:0). 


NOTE: 


The values used for h and v here are numerators 
of the fraction 
where the implied 
denominator 
is 
16. 


MODE SELECT 


Bits 8 and 9 are used to select one of four operating 
modes, of which only two are presently defined. 
These modes are given in Table 2-11. 


Table 2-11. Mode Select Operating Modes 


Bits 9:8 
Mode 


00 
RANDOM-2D 


01 
Sequential-2D 


10 
RESERVED 


11 
RESERVED 
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RESET 


Writing a ONE to bit 10 resets the pixel interpolator. 
The pixel interpolator must be reset prior to chang- 
ing modes. 


PAIRING 


A ZERO in bit 11 causes the pixel interpolator to 
output individual pixels. A ONE causes the interpola- 
tor to collect 
adjacent pixels (in Sequential-2D 
mode) into 16-bit pixel pairs. This feature assists in 
motion video decoding, when combined with the 
ALU's dual-add-with-saturate operation, by allowing 
two pixels to be processed each cycle. The phasing 
used in collecting the pixel pairs is determined by the 
Phase bit described below. 


PHASE 


When output pixels are collected into pixel pairs, 
there are two possible alignments of the input pixel 
pairs to the output pixel pairs. The Phase bit (bit 13) 
selects the alignment to be used, based on the rela- 
tive word alignment of the source and destination 
bitmaps in VRAM. When the Phase bit is set to a 
ZERO, this indicates that the bitmaps are in-phase. 
In this case, the first two output pixels are grouped 
into one 16-bit pixel pair (with the first pixel in the 
least significant byte). When the Phase bit is set to a 
ONE, the bitmaps are out-of-phase. In this case, the 
first pixel is placed in the most significant byte of the 
first pixel pair, with invalid data in the least significant 
byte, and the second and third output pixels are col- 
lected into the second pixel pair. This is illustrated in 
Figure 2-9. 


PIPELlNING 


A ZERO in bit 14 causes the pixel interpolator to use 
the standard amount of pipeline delay. A ONE in this 
field will select the fast mode that has less pipeline 
delay. Table 2-12 shows the pipelining delay for both 
modes. Note that the effect 'Ofthe phase bit is to add 
an extra pixel delay. 
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~B 
E__ 
F 
I__ 
J 
1st Row of Input Pixels Pairs 


In-Phase: 


C__ 
O 
G__ 
H 


Out-of-Phase: 
~B 
~F 


??__ 
W 


C__ 
O 
G__ 
H 
K-L 
2nd Row of Input Pixel Pairs 


Y--Z 
Output Pixel Pairs 


K-L 
2nd Row of Input Pixel Pairs 


I__ 
J 
1st Row of Input Pixels Pairs 


~Y 
Z__ 
?? 
Output Pixel Pairs 


Figure 
2-9. Plxel Pair Phases 


Table 
2-12. Plpellnlng 
Delay for 
Sequentlal-2D 
NON-PAIR 
Mode 


Plpellnlng 
Phase 
Pipeline 
Delay 
Bit 
Bit 
In Output 
(Bit 14) 
(Bit 13) 
Plxels 


0 
0 
6 


0 
1 
7 


1 
0 
2 


1 
1 
3 


When in PAIR mode (with bit 11 = one), the amount 
of pixel delay 
does 
not change, 
but half as many 


reads and writes are required 
to fill the pipeline 
be- 
cause 
each 
read or write of the averager 
transfers 


two pixels. For example, 
when in the standard 
mode 
(bit 14 = 0), with zero phase 
(bit 13 = 0) and pair 
mode 
(bit 11 = 1), three 
indeterminate 
pixel pairs 


must be read before the first good pixel pair is read. 
In the same 
case 
but with the phase 
bit = 1, the 
fourth 
pixel pair read contains 
one good 
pixel and 
one indeterminate 
pixel, and the fifth pixel pair read 


contains 
two good pixels. 


RESERVED 


Bits 
15 and 
12 are reserved 
for future 
use. Write 
ZEROs 
into these bit positions. 


Signature 
Register 


(hwid) 


The signature 
register can be read either by the host 


CPU or by microcode 
to determine 
the version of the 


82750PB. 
The value of the signature 
register 
can be 


used 
to 
distinguish 
between 
the 
82750PB 
in the 


I 


82750PA 
emulation 
mode, and the 82750PB 
in na- 


tive 
mode. 
The 
currently 
defined 
signature 
values 


given in Table 2-13. 


Table 
2-13. Signature 
Values 


Value 
Definition 


OxFFFE 
The 82750PB 
Emulating 
the 82750PA 


OXFFFC 
The 82750PB 
in Native Mode 


All other 
signature 
values 
are presently 
undefined 


but may be used in the future 
to denote 
other 
ver- 


sions of the 82750 architecture. 


Display Format Registers 


(yeven, 
yodd, 
vu, vptrl 


The 82750PB's 
processor 
can write 
to the display 


registers 
in the VRAM interface. 
These registers 
are 


pointers 
and pitch 
values 
that 
address 
display 
bit- 


maps and 8275009 
register 
loads in VRAM. 
Point- 


ers 
are 
32-bit 
values 
that 
specify 
the 
specify 
the 


starting 
byte 
address 
of a bitmap 
or register 
load 


within a 4GByte 
address 
space. The bottom 
two ad- 


dress bits are ignored since display bitmaps and reg- 
ister loads 
must 
start on a double 
word 
boundary. 


Therefore, 
the internal 
representation 
of a pointer 
is 


a 30-bit value. The pitch value associated 
with each 


pointer 
indicates 
the number 
of bytes 
between 
the 


start of two lines of a display bitmap or between 
the 


start of two register loads. The pitch is a single 16-bit 
value with its two least significant 
bits ignored, 
since 


the 
pitch 
must 
be 
an 
integer 
number 
of 
double 


words. 
Currently, 
there 
is also 
a restriction 
in the 


827500B 
limiting 
all display 
bitmap 
pitches 
to pow- 


ers of two; so, the maximum 
display 
bitmap 
pitch is 


± 214 Bytes 
= ± 16 kBytes. 
The 
display 
registers 


are described 
in Table 2-14. 
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Table 2-14. Display Registers 


Register 
Description 


yeven-Io, 
hi 
This register pair points to the start of the Y bitmap or main bitmap that 
is to be displayed 
during an even field scan. 


yodd-Io, hi 
This register pair points to the start of the Y bitmap or main bitmap that 
is to be displayed 
during the odd field scan. 


ypitch 
The value in this register is added to the current Y bitmap pointer value 
each time a Y transfer 
is performed. 


vu-le, hi 
This register pair points to the start of the VU bitmap. This bitmap is 
read to generate 
the VU values for both odd and even field scans. 


vupitch 
This value is added to the current VU bitmap pointer value each time a 
VU transfer 
is performed. 


vptr-lo, 
hi 
This register pair points to the start of a series of 8275008 
register 
loads stored in VRAM. 


vpitch 
This value is added to the current 8275008 
register load pointer each 
time a 8275008 
register load is performed. 
The pitch is equal to the 
number of bytes from the start of one register load to the start of the 
next register load. 


3.0 
HARDWARE 
INTERFACE 
• 
Arbitrates 
VRAM accesses 
between 
the two input 
FIFOs, 
the two output 
FIFOs, 
the statistical 
de- 
coder, 
the transfer 
request 
logic, the VRAM 
re- 
fresh logic, and the external 
VRAM access 
logic. 


• 
Ouring a memory 
cycle, performs 
appropriate 
ad- 
dress 
arithmetic 
on the VRAM 
pointer 
used 
for 
that memory 
cycle. 


• 
As a result 
of certain 
V8US 
codes, 
performs 
a 
shadow 
copy that consists 
of copying 
display-re- 


lated VRAM pointer values from shadow 
registers 
(that are loaded 
by the 
host CPU or the micro- 
code 
processor) 
to working 
registers 
where 
the 
various 
pointers 
are 
used 
for 
transfer 
cycles 
when 
the 
8275008 
is 
refreshing 
the 
display 
screen. 


VRAM Interface 


The VRAM 
interface 
performs 
the following 
opera- 
tions: 


• 
Maintains 
VRAM pointers for the two input FIFOs, 
the two output 
FIFOs, the statistical 
decoder, 
the 
Y 
(main) 
bitmap, 
the 
VU 
bitmap, 
and 
the 
8275008 
register 
load. 


• 
Oecodes 
V8US 
codes and takes appropriate 
ac- 


tions such as generating 
a transfer 
cycle, sched- 


uling refresh cycles, or generating 
interrupt condi- 
tions. 


Table 3-1. VRAM Interface 
Signals 


Signal 
Description 


MREQ# 
MEMORY 
REQUEST 
is asserted 
during the first cycle of a VRAM 
memory access. 


TRNFR# 
The TRANSFER 
output indicates 
the current memory cycle is a result 
of a 8275008 
transfer 
request. 


RFSM# 
The REFRESH 
output indicates 
the current memory cycle is a result of 
a 8275008 
refresh request. 


NXTFST# 
The NEXT FAST output indicates 
the next memory access will use the 
same row address as the current memory access. This facilitates 
the 
use of page mode memory accesses. 


MROY# 
The MEMORY 
READY input indicates 
the availability 
of valid data on 
the 0[31 :0] pins. 
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VRAM 
ACCESSES 


The 82750PB can initiate five different types of 
memory accesses: FIFO read, FIFO write, transfer 
read, transfer write, and refresh. In addition, the 
82750PB supports VRAM accesses by external log- 
ic. During an external access VRAM cycle, the 
82750PB tri-states its VRAM address and data bus- 
es and performs a host VRAM read or host VRAM 
write cycle. There is another operation performed by 
the 82750PB, a shadow copy, that is not a VRAM 
cycle but is arbitrated as though it were, since no 
VRAM cycles can take place during a shadow copy. 


The seven types of VRAM cycles initiated by the 
82750PB, including host VRAM read and host 
VRAM write, begin with the 82750PB asserting a 
combination of its three VRAM cycle definition out- 
puts: TRNFR#, RFSH#, and WE#. External logic 
detects the state of these signals, validated by 
MREQ#, and produces the appropriate sequence of 
VRAM control signals (RAS, CAS, etc.) to perform 
the type of memory cycle the 82750PB has request- 
ed. The 82750PB requires that each of these VRAM 
cycles take a minimum of two T-cycles, or T-states, 
denoted T1 and T2. External logic can insert addi- 
tional T2 states in order to stretch the VRAM cycle 
to more than two T-cycles. The start of a new VRAM 
access cycle is signaled by the assertion of MREQ# 
for the first T-cycle, T1. The VRAM access cycle 


82750PB 


definition signals, TRNFR #, RFSH#, and WE#, are 
asserted at the start of T1 and remain asserted until 
the end of the last T2. Other VRAM operations can 
be described similarly by sequences of T-states. Re- 
fer to Figures 3-4 and 3-5 on page 42 for timing dia- 
grams. 


Table 3-2 defines the states used for all VRAM ac- 
cess operations. A state diagram for the VRAMI 
Host Interface is provided in Figure 3-1. This dia- 
gram includes the FIFO access states 


Table 3-2. 82750PB 
VRAM Access States 


State 
Description 


Ti 
Idle State, No VRAM Activity 


T1, TF1 
First State of a VRAM FIFO Cycle 


T2, TF2 
Last State of a VRAM FIFO Cycle 


TSC 
The T-State required to perform a 
shadow copy 


nX1 
First State of a VRAM Transfer Cycle 


nX2 
Last State of a VRAM Transfer Cycle 


TRF1 
First State of a VRAM Refresh Cycle 


TRF2 
Last State of a VRAM Refresh Cycle 


nro ACCESS 
HOST ACCESS 


RErR[SH 
C'IClE 


240654-7 


Figure 3-1. Access State Diagram 
I 
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Note that during successive VRAM cycles it is not 
necessary to go back to the idle state, Ti, between 
each cycle; the TF2state can be followed directly by 
a T1 state, starting at the next VRAM cycle. This 
results in efficient utilization of the 82750PBIVRAM 
bandwidth by allowing a VRAM cycle time of 2 
T-states. 


FAST VRAM 
CYCLES 


When the 82750PB performs Data Read or Data 
Write VRAM cycles for the input or output FIFOs, it 
performs two 32-bit accesses to read or write one 
64-bit value. These accesses are always performed 
in a sequence of EvenAddress followed by EvenAd- 
dress + 1, which guarantees both that the two se- 
quential accesses will be in opposite banks and that 
the two accesses will be within the same VRAM 
page. This allows external logic to use either bank- 
interleaving or a page-mode access to complete the 
second access of the sequence and improve the 
VRAM bandwidth. However, the second access 
does not need to be handled differently from the 
first. Except for the assertion of the NXTFST# sig- 
nal, both accesses are treated as standard VRAM 
accesses. External logic can ignore the NXTFST# 
signal, though, and treat the two accesses as two 
normal data read or data write cycles. Note that 
NXTFST# is not asserted for transfer, refresh, or 
host memory accesses. 
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The NXTFST# output signal is provided for cases 
when external logic can generate a faster access for 
the second access of the two sequential accesses. 
During such a pair of accesses, NXTFST# is assert- 
ed during the first of the two accesses in order to 
provide sufficient time for the external logic to gener- 
ate the appropriate fast memory cycle for the sec- 
ond access. Refer to the timing diagrams in Figures 
3-4 and 3-5 (page 42) for examples illustrating the 
use of the NXTFST# signal. 


VBUS CODES 


Transfer 
request, 
interrupt, 
and 
synchronization 
codes are sent over the BUS from the 827500B to 
the 
82750PB. 
The 
codes 
recognized 
by 
the 
82750PB are listed in Table 3-3, along with the ac- 
tions taken by the 82750PB as a result of receiving 
each code. Codes that cause TRANSFER cycles 
must be asserted for at least two clock cycles of the 
82750PB to insure that, in the worst case, the 
82750PB completes the transfer cycle before the 
code is released and the 82750DB starts shifting 
data from the VRAM shift registers. Other codes 
must also be asserted for a minimum of two 
82750PB clock cycles. Only the codes given in the 
Table 3-3 are valid codes for the VBUS. Other codes 
are reserved for future use and should not be used. 
Once a transfer cycle code is sent to the 82750PB, 
any non-transfer code may be sent immediately. A 
subsequent transfer cycle code should be sent only 
after the current transfer cycle is completed. 
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Table 
3-3. VB US Codes 


Binary 
Name 
Action 


0000 
YBMX 
TXRO Cycle Using Yc; Yc = Yc + Yp' 


0001 
VUBMX 
TXRO Cycle Using VUc; VUc = VUc + VUp 


0010 
REGX 
TXRO Cycle Using Vc; Vc = Vc + Vp 


0011 
WROIGX 
TXWR Cycle Using Yc; Yc = Yc + Yp 


0100 
YNPBMX 
TXRO Cycle Using Yc; Yc = Yc 


0101 
Reserved 
Reserved 


0110 
Reserved 
Reserved 


0111 
WROIGNPX 
TXWR Cycle Using Yc; Yc = Yc 


1000 
OFL 
OFL Int; Shadow Copy" 


1001 
827500BSO 
827500B 
Shutdown 
Interrupt 


1010 
REFRESH 
Schedule 
N Refresh Cycles 


1011 
Reserved 
Reserved 


1100 
VOOO 
VBI Int; OF Int; Shadow 
Copy Odd; Hline = 0··' 


1101 
VEVEN 
VBI Int; EF Inf; Shadow 
Copy Even; Hline = 0'" 


1110 
HLlNE 
Icnt+ 
+ (Increment 
Line Counter) 


1111 
NULL 
No Action 


NOTES: 
'Yc-Y 
bitmap pointer, current; Yp-Y 
bitmap pitch; VU-VU 
bitmap; V-82750DB 
register load. 
"Shadow 
Copy with Yc = Y-start-odd in odd field; Yc = Y-start-even in even field. 


'·'Hline-Horizontal 
line Counter. 


PRIORITY 


Each 
time 
the 
VRAM 
state 
machine 
completes 
a 
VRAM operation 
and returns to the Ti state, it exam- 
ines all pending VRAM access 
requests 
and selects 
the highest 
priority request 
for the next VRAM oper- 
ation. The priority ordering 
of these requests 
are list- 
ed in Table 3-4. 


Table 
3-4. Priority 
of VRAM Operations 


Request 
Type 
Priority 


Transfer 
Cycle 
Highest 


Shadow 
Copy 
• 


Host Access 
• 


VRAM Refresh 
• 


FIFO Read/Write 
Lowest 


NOTE: 
The shadow copy is treated as a VRAM operation even 
though it does not result in an access to VRAM. 


The VRAM 
refresh 
operation 
is placed 
low on the 
priority list to reduce the latency 
in servicing 
transfer 
requests 
and external 
VRAM 
requests. 
Since a sin- 
I 


gle REFRESH 
code from the 827500B 
schedules 
a 
number of refresh cycles, a higher priority for refresh 
would cause all the refresh cycles to occur in a burst 
that would lock out all lower priority requests 
until all 
refresh 
cycles 
completed. 
Instead, 
the 
following 
restriction 
applies 
to all request 
types 
with 
higher 
priority 
than refresh: 
high priority 
requests, 
such as 
transfer 
cycles, 
shadow 
copies, 
and external 
VRAM 
access 
must 
occur 
infrequently 
enough 
to 
allow 
proper 
refresh 
of the VRAM 
chips. 
Transfer 
cycles 
and 
shadow 
copies, 
by. their 
nature, 
occur 
infre- 
quently 
so they are not generally 
a problem. 


There is a separate 
priority scheme 
for the five FIFO 
channels. 
The scheme 
used is rotating 
priority 
with 
automatic 
override 
and single 
cycle arbitration. 
Ro- 
tating priority means that the priority is assigned 
in a 
fixed cyclic order with the lowest priority given to the 
FIFO 
channel 
that 
"won" 
the 
last 
FIFO 
access. 


There is only one level of memory, 
so the order that 
requests 
arrive is not a factor 
in the arbitration. 
The 
cyclic order is given in Figure 3-2. 


As an example, 
if input FIFO q (abbreviated 
i1O)was 
the last channel 
to perform 
a cycle, the priority order 
for the next 
FIFO access 
(from 
highest 
to lowest) 
would 
be: if1, sd, 010, of1, and ifO. 


• 
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Automatic override is available so that the rotating 
cyclic priority can be bypassed if there is an. UR- 
GENT condition for one of the channels. A channel 
is urgent if the microcode processor is frozen be- 
cause the processor is waiting for that channel to be 
ready. The channel can be either an input channel 
that is empty or an output channel that is full. In this 
case, the urgent channel gets the next available cy- 
cle. However, the priority will still be lower than non- 
FIFO requests, such as refresh cycles. 


Single clock cycle arbitration means that the selec- 
tion of the next channel that will get an access oc- 
curs in a single T-cycle or T-state, either in a Ti state 
or during the last T2 state of the previous VRAM 
cycle. 


VRAM 
POINTERS 


The VRAM interface maintains VRAM pointers for 
the FIFOs, as well as display-related pointers for the 
82750DB. Internally each pointer or address is 
stored as a 30-bit value addressing a double word in 
VRAM. The pointer values are read and written as 
two 16-bit words representing a 32-bit byte address 
(refer to the Figure 3-3). With a 30-bit double word 
address, the 82750PB can decode a VRAM address 
space of 1G double words or 4GBytes. 


Input and output FIFOs can address down to a sin- 
gle word or byte in VRAM. A FIFO's pointer is post- 
incremented or post-decremented in parallel with its 
VRAM read or write cycle. 


The statistical decoder can only start decoding bit- 
streams on double word boundaries in VRAM and 
can only increment through VRAM. The decoder's 
pointer is post-incremented in parallel with each of 
its VRAM read cycles. 


Display-related pointers are updated by adding a 
pitch value to the current value during the corre- 
sponding transfer cycle. 


If a VRAM pointer appears on the B-Bus as source 
or as a destination then the following rules apply: 


Rule 1 


If a B-Bus destination refers to an address that is 
both Even and > Ox1f,then the source is restricted 
to "-10" pointers if the source refers to a pointer. 


Rule 2 


If a B-Bus destination refers to an address that is 
both Odd and >Ox1f, then the source is restricted to 
"-hi" pointers if the source refers to a pointer. 


SHADOW 
COPY 


When a VODD, VEVEN, or DFL code is received 
from the 82750DB over the VBUS, a shadow copy is 
scheduled. The actual shadow copy will occur as 
soon as the priority logic allows. Any VRAM access 
in progress must complete and a pending transfer 
cycle, if any, must be performed before the shadow 
copy can start. During the operation, shadow regis- 
ters for the Y-START, Y-PITCH, VU-START, VU- 
PITCH, 82750DB-START, and 82750DB-PITCH are 
copied into the corresponding working registers. 
During display refresh, the address arithmetic is per- 
formed on the working registers. The shadow regis- 
ters can be loaded by the host CPU or by a micro- 
code routine with less critical timing constraints, and 
then copied instantly by a shadow copy when it is 
time to update the registers, either prior to the next 
field or during the active display for split screen ef- 
fects. 
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--. 
InFIFO 
1 --. 
InFIFO 0 
--. 
outFIFO 
1 --. 
outFIFO 
0 
--. 
Statistical 
Decoder 
1 
1 


Figure 3·2. Cyclic Ordering 
of FIFOs 


31 30 29 
24 23 
16 15 
3 
2 
1 
0 


<- - - -' - - - - - - - - - - - - - - VRAM 
Address 
- - - 
- - - - - 
- - - 
- - - 
- - 30 bits 
- - - - 
- - 
- - - - 
- - 
> 


Byte Address 
within 
Double-Word 
. . . . . . . . . .. 
< - - -> 


<- Most 5ig. Word of VRAM 
Address. 
-> 
<- Least Sig. Wd. of VRAM 
Addr.-> 


Figure 3·3. VRAM Addressing 


1-30 
I 


intel~ 
82750PB 


There 
are 
actually 
two 
shadow 
registers 
for 
Y- 
START. 
One for start of odd fields and one for start 
of even fields. A VODD code causes Y-START-ODD 
to be copied 
into the working 
register 
Y-CURRENT. 
Similarly, 
a 
VEVEN 
code 
causes 
the 
Y-START- 
EVEN to be copied 
into Y-CURRENT. 
A DFL code 
causes 
the Y-START-ODD 
value to be copied 
if the 
most recent 
start of field code received 
is a VODD, 
or a Y-START-EVEN 
value if the most recent start of 
field code was a VEVEN. This allows a simple inter- 
laced or non-interlaced 
display to be refreshed 
with 
no host 
CPU 
intervention. 
For more 
complex 
dis- 
plays, such as split screens, 
the host CPU must up- 
date 
the 
shadow 
registers 
prior 
to 
each 
shadow 
copy. A shadow 
copy operation 
requires 
2 T-cycles. 


Table 
3-5. Host 
Interface 
Signals 


Host Interface 


The Host Interface 
provides 
the following 
functions: 


• 
Arbitrates 
host 
CPU 
and 
82750PB 
access 
to 
VRAM. 


• 
Provides 
the host access 
to external 
devices. 


• 
Provides 
the 
host 
access 
to 
82750PB 
internal 
registers 
and memories. 


Signals 
specific 
to the 
Host 
Interface 
are listed 
in 
Table 3-5. 


Signal 
Description 


HREQ# 
HOST REQUEST: 
Asynchronous 
request from the host for all types of 
host access. 
Used both to request and release system buses. 


HREG# 
HOST REGISTER: 
Single-ranked 
control to request host access to 
82750PB 
internal registers 
in concert 
with HRAM #. 


HRAM# 
HOST VRAM: Single-ranked 
control to request host access to VRAM in 
concert 
with HREG #. 


HALEN# 
HOST ADDRESS 
LATCH 
ENABLE: 
Asynchronous 
status from the host 
indicating 
the presence 
of valid address, write enable (transaction 
direction 
control), 
and the byte enables 
at the interface 
of the 82750PB. 


HBUSEN# 
HOST BUS ENABLE: 
82750PB 
synchronous 
status granting the host 
access to the address, write. enable, data bus, and byte enables at the 
interface 
of the 82750PB. 


HRDY# 
HOST READY: 
82750PB 
synchronous 
status to the host indicating 
the 
presence 
of valid data appearing 
at the 82750PB's 
databus for VRAM 
and register accesses 
and optionally 
for external accesses. 


HINT# 
HOST INTERRUPT: 
82750PB 
synchronous 
interrupt 
to tbe host, set 
under direct or indirect microprogram 
control. 


Signals 
common 
to the host, VRAM, 
and external 
device 
interfaces 
are listed in Table 3-6. 


Table 
3-6. Host, VRAM, and External 
Device 
Interfaces 


Signal 
Description 


A[31:2] 
ADDRESS 
BUS: System address bus used to select unique VRAM, the 
82750PB 
register, and external device locations 
that will be accessed 
under host control. 
The lower seven bits A[8:2] 
are bidirectional 
and are 
used during register accesses 


D[31:0] 
OAT A BUS: Bidirectional 
system data bus used to transfer 
data to and 
from all sources and destinations. 
When transferring 
16-bit host register 
values, the data bus MSH and LSH will both carry identical values. 


WE# 
WRITE 
ENABLE: 
Bidirectional, 
single-ranked 
signal used to determine 
the data transfer 
direction. 
When active during host register cycles, data 
flows from the host to an 82750PB 
destination. 
During host VRAM cycles, 
WE # active will define the data direction 
to be from the host to VRAM. 


BE[3:0] # 
BYTE ENABLE: 
Bidirectional 
signals used to select the bytes that will be 
modified 
during data transactions. 
All host register transactions 
are 
performed 
16 bits at a time, while VRAM may be modified 
8 bits at a time. 
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As with VRAM 
operations, 
host operations 
are described 
through 
a sequence 
of T-states. 
Table 3-7 defines 
the T-states 
used to implement 
all host transactions 
with VRAM, 
external 
devices, 
and the 82750PB. 


The master 
execution 
state diagram 
that defines 
the VRAM/Host 
transactions 
is provided 
in Figure 3-1. 


Table 3-7. 82750PB Host Transaction 
States 


State 
Description 


TA 
First state of any host transaction. 
Entry into TA will be granted after 
HREQ# 
has been asserted. 
During this state, the 82750PB 
will tri-state 


its address, data bus, write enable, and byte enable signals to provide a 
full cycle of "dead-band" 
before the assertion 
of HBUSEN #. In the state 
immediately 
following 
TA HBUSEN # will assert, allowing 
the host to drive 


the host buses. 


TB 
First cycle in which the host is granted bus access for register or VRAM 
transactions. 
The sequencer 
will remain in TB until HALEN # is received, 


indicating 
that the address write enable and byte enable signals are 
stable at the 82750PB 
pins. 


TC1 
First cycle that output data is valid. 


TCn 
This state is entered to wait for the completion 
of the current host cycle. 


The cycle is defined as complete 
when HREQ# 
deasserts. 
HRDY # is 
asserted 
along with valid data until the transition 
to state TO occurs. 


TO 
The last cycle of a host transaction. 
HBUSEN # is deasserted 
allowing 
one dead-band 
cycle to allow control 
of the address, 
data, write enable, 


and byte enable signals to be returned to the 82750PB. 


TV1 
First cycle of a Host VRAM transaction. 
Memory is requested 
and is 


followed 
by a transition 
to TV2. 


TV2 
Last cycle of a Host VRAM transaction. 
The sequencer 
will remain in TV2 


until MRDY # is received. 


A single stage of input synchronization 
is employed 


for 
HREG#, 
HRAM#, 
WE#, 
and 
BE[Oj#, 
while 


HREQ# 
and 
HALEN# 
are programmable 
to have 


one or two stages 
by bit 12 of the Microcode 
Proc- 


essor Control 
Register. See Table 3-10. T-state tran- 


sitions 
are caused 
by the synchronized 
versions 
of 


these signals. 


The synchronized 
versions 
of HREG # and HRAM # 
must 
be stable 
before 
entry 
into 
T-state 
TA. The 


synchronized 
versions 
of 
WE #, 
BE[Oj #, 
and 


HALEN# 
should 
be stable 
before 
exiting 
T-State 


TB. Once 
asserted, 
all of the above 
signals 
should 


remain stable until the deassertion 
of HBUSEN #. 


The type of host cycle to perform 
is determined 
by 


the states 
of HREG # and HRAM # as indicated 
in 


Table 3-8. 


Table 3-8. Host Cycle Types 


HREG# 
HRAM# 
Host Cycle 
Type 


1 
1 
External 


0 
1 
Register 


1 
0 
VRAM 


0 
0 
Reserved 
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HOST REGISTER 
ACCESS 


The host has access 
to the 82750PB's 
internal 
reg- 
isters and memories 
to monitor and control 
the oper- 


ation of the microcode 
processor, 
provide 
a means 


of debugging 
microprogram 
routines, 
and to function 


as the primary test port for production 
testing. 


Register 
access 
is initiated 
by the 
host 
asserting 


HREQ#, 
HREG#, 
and HRAM# 
as shown 
in Table 


3-8 and in the timing diagrams 
on pages 42 through 


45. After the host has been granted 
bus access 
by 


an active 
HBUSEN# 
in state TB, the address, 
write 


enable, and byte enables 
may be driven. After these 


signals 
have 
stabilized 
HALEN # 
is asserted, 
en- 


abling a read or a write operation 
to occur. 


NOTE:. 


Once 
HREQ# 
has 
been 
recognized 
by 
the 


82750PB, 
a HBUSEN # 
will 
always 
be generated. 


HREQ# 
is recognized 
on the rising edge of TA, but 


note 
that 
it is only 
possible 
to 
know 
this 
AFTER 


state TA has been entered. 
Designs 
which 
need to 
"abort" 
requests 
must 
be prepared 
to 
ignore 
the 


possible 
HBUSEN #. 
Also, 
it is not possible 
in the 


general 
case 
to change 
the 
type 
of host 
request 


because 
HREG # and HRAM # are also recognized 


on the rising edge of TA. 
I 


In the case of a register read, state TC1 is entered 
and the data bus is driven with the internal value. 
One cycle later, a transition to state TC occurs, and 
HRDY# activates, signaling the presence of stabi- 
lized data at the 82750PB data pins. This state (TC) 
will be maintained until the host deasserts HREQ#, 
signaling the completion of the cycle that caused a 
transition to state TD. 


In the case of a register write, TC1 is again entered 
(from TB), but the data bus may now be driven by 
the host. (During host cycles, data bus drive activity 
is indirectly controlled by WE# and an additional 
dead-band is provided by entry into state TC1 to al- 
low for internal WE# stabilization.) Stable data at 
the 82750PB interface, as well as the completion of 
the write cycle, is signaled by the deassertion of 
HREQ#. As with reads, the deactivation of HRDY# 
signals the transition to state TD. 


As state TD is entered, HRDY# and HBUSEN# 
deassert, the address data, write enable, and byte 
enables tri-state, and bus control is returned to the 
82750PB in the following cycle. 


HOST VRAM 
ACCESS 


Because the 82750PB is so closely coupled with 
VRAM, host accesses to VRAM are arbitrated and 
controlled by the 82750PB. VRAM access is initiated 
by 
the 
host 
asserting 
HREQ#, 
HREG#, 
and 
HRAM# as shown in the Host Cycle Table above 
and in the Timing Diagrams on pages 38 through 45. 
After the host has been granted bus access by an 
active HBUSEN#, the address, write enable, and 
byte enables may then be driven. After these signals 
have stabilized at the memory devices (or longest 
relevant propagation path), HALEN# is asserted, 
enabling a read or write operation to occur. 


Because VRAM will not drive the data bus until after 
a memory request, a transition into state TC1 to al- 
low for data bus direction stabilization is not re- 
quired. Instead, a transition to state TV1 occurs, 
which asserts MREQ# for a single cycle and is fol- 
lowed by a transition to TV2. TV2 will remain the 
current 
state 
until 
the 
reception 
of 
an 
active 
MRDY#. 


In the case of a VRAM read, the memory data bus 
will be driven during TV1, and valid data will appear 
in state TV2. Data will be guaranteed valid coinci- 
dent with the deassertion of MRDY# from memory. 


In the case of a VRAM write, the memory data bus is 
driven with valid data during TV1. Again the recep- 
tion of MRDY# will serve to indicate the completion 
of the memory operation. 
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NOTE: 
The host device must be able to transmit or receive 
memory data in order to be valid at the trailing 
edge of MRDY# at the data's destination (memory 
or host). 


After MRDY# becomes active, a transition from TV2 
into TC1 is accomplished to allow time to propagate 
data to the host. TC is then entered to await the 
deassertion of HREQ# 
(if it has not already oc- 
curred). TD is then entered, duplicating the dead- 
banding previously described. 


HOST EXTERNAL 
ACCESS 


In addition to VRAM and register host access, an 
external device access mechanism is provided. Dur- 
ing this access, upon the receipt of HREQ# with 
HREQ# and HRAM# inactive, the 82750PB releas- 
es the address, data, write enable, and byte enables' 
in state TA. 


The difference here is that state TC1 is directly en- 
tered from TA, thereby ignoring any transitions of 
HALEN#. Since the 82750PB also ignores the data 
bus direction control (write enable) the host and an 
external device may communicate unencumbered 
by the 82750PB. 


Entry into state TC directly follows TC1 in the ex- 
pected sequence and remains there until HREQ# is 
released. 
This 
is 
followed 
by 
entry 
into 
TD. 


HBUSEN# is asserted during the timing that TC1 
and TCN are active. 


During an external access, HRDY# is not asserted 
unless the external logic asserts MRDY# as shown 
in Figure 3-7. 


HOST REGISTER 
ADDRESS 
MAPPING 


Table 3-9 shows the host address mapping of the 
on-chip registers and memories, in terms of the off- 
set in bytes, from the base address for 82750PB 
accesses. Note that the 82750PB only supports 
word accesses to these registers. Therefore, the 
least significant bit of the byte offset should be set to 
zero. The 82750PB forms the register address from 
inputs on the A[31 :2] pins and BE# [3:0] pins. The 
A[31 :2] specify the double word address of the reg- 
ister, and combinations of the BE# pins determine 
which of the two words with the double word is being 
addressed. BE# [3:0] = 11002 selects the least sig- 
nificant word within a double word, and BE# [3:0] = 
00112 selects the most significant word within a 
double word. These are the only two valid patterns 
for BE# inputs during a host register access cycle. 
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word reads or writes to on-chip 
registers. 


When the host CPU reads or writes to areas (a, b, or 
d) and the 82750PB 
is not already 
in a HALT state, 
the 
microcode 
processor 
is automatically 
HALTED 
for the one T-cycle actually 
required to complete 
the 
data 
transfer, 
and 
then 
the 
processor 
is restarted 
after the transfer 
is complete. 
If the 82750PB 
is in a 
HALT state when the host access 
is initiated, 
it will 
remain in the HALT state following 
the completion 
of 
the access. This is transparent 
to both the host CPU 
and the microcode 
processor. 
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During reset of the 82750PB, 
the HALT bit is set to a 


one, the six Interrupt 
Enable 
bits are reset to zero, 


the Disable SYNC bit is set to zero, the PMON/FRZ 
bit is set to zero (so that the FRZ signal 
is output), 


and the Enable 82750PB 
bit is reset to zero (so that 


on reset, the 82750PB 
starts 
in a 82750PA 
emula- 


tion mode). 
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Table 3·10. Bit Assignments 
for Mlcrocode 
Processor 
CONTROL 
Register 
(Wrlte-only, 
Byte Offset = Ox100I 


Bit 
Name 
Description 


BitO 
HALT 
1 = Microcode 
Processor 
Halt 
o = Microcode 
Pr.ocessor Run 


Bit 1 
SINGLE-STEP 
1 = Execute One Instruction 
and then Halt 
(Only when Already 
Halted, Bit 0 = 
1) 
0= 
No Action 


Bit 2 
Enable MCINT 
1 = Enable Microcode 
Interrupts 
to Host CPU 
o = Mask Microcode 
Interrupts 


Bit 3 
EnableVBI 
1 = Enable Vertical 
Blanking 
Interrupt to Host CPUt 
o = Mask Vertical 
Blanking 
Interrupt 


Bit 4 
Enable DFL 
1 = Enable DFL Interrupt to Host CPU 
o = Mask DFL Interrupt 


Bit 5 
Enable SO 
1 = Enable 82750DB 
Shutdown 
Interrupt to Host 
o = Mask SO Interrupt 


Bit 6 
EnableOFI 
1 = Enable Odd Field Interruptt 
o = Mask OF Interrupt 


Bit 7 
Enable EFl 
1 = Enable Even Field Interruptt 
o = Mask EF Interrupt 


Bits 8-11" 
1 = RESERVED; 
Write as Zeros 


Bit 12 
Disable SYNC 
1 = Disable Synchronizers 
for HREQ# 
IHALEN# 
o = Enable Synchronizers 
for HREQ# 
IHALEN 
# 


Bit 13 
PMON/FRZ 
1 = Output FRZ # Signal on PMFRZ # Pin 
o = Output PMON # Signal on PMFRZ # Pin 


Bit 14 
1 = RESERVED; 
Write as Zero 


Bit 15 
Enable 82750PB 
1 = Enable 82750PB 
Mode 
o = Enable 82750PA 
Emulation 
Mode 


•All other 
bits are reserved 
for future 
use, and should 
be written 
as zeros. 
tOnly 
one of these 
bits should 
be set. 


Bit 3, when 
set, enables 
an interrupt 
when either 
VODD 
or VEVEN 
VB US codes 
are received. 
Bit 6, when 
set, enables 
an interrupt 
when 
VODD 
VB us codes 
are received. 
Bit 7, when 
set, enables 
an interrupt 
when 
VEVEN 
VBUS 
codes 
are received. 
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The INTERRUPT FLAG register holds a flag for 
each of the six interrupt sources. A flag bit is set to a 
one when the interrupt condition is detected (inde- 
pendent of the state of the corresponding Interrupt 
Enable/Mask bit in the CONTROL register), and all 
flags are cleared to zero each time the INTERRUPT 
FLAG register is read. If this register is read during 
the same cycle that an interrupt condition is detect- 
ed, the flag bit corresponding to that interrupt condi- 
tion will remain at a one. This new interrupt condition 
will then be seen by the host processor when it next 
reads the INTERRUPT FLAG register. The flag in- 
sures that an interrupt is not lost if it occurs at the 
same cycle that the INTERRUPT FLAG register is 
read (and reset). In addition, the Microcode Interrupt 
source has an overflow flag that indicates if more 
than one Microcode Interrupt has occurred since the 
Interrupt Flag register was last read. The bit assign- 
ments for the INTERRUPT FLAG register are listed 
in Table 3-11. 
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The PROCESSOR STATUS register 
holds four 
status bits: HALT, FREEZE, PMON, and SYNC 
status. HALT indicates that the processor is HALT- 
EO due to a HALT bit in the CONTROL register be- 
ing set to a ONE or due to the HALT# pin being 
asserted. FREEZE indicates that the processor is 
waiting for one of the VRAM channels to become 
ready or IS waiting for an access to the VRAM point- 
er RAM. PMON is a signal that can be toggled by a 
special ALU opcode or a special B source code. 
This signal can be used for performance monitoring 
of microcode. SYNC status bit indicates the pres- 
ence or absence of the internal synchronizers for 
HREQ# and HALEN# inputs. In addition, the Inter- 
rupt Mask bits that are written into the PROCESSOR 
CONTROL register can be read from this register. 
These mask bits are read in the same polarity that 
they are written, but note that the bit positions and 
bit ordering are not consistent with the PROCES- 
SOR CONTROL register. The bit assignments for 
this register are given in Table 3-12. 


Address mapping for areas (a), (b), and (d) are given 
in Tables 3-13 to 3-15. 


Table 3-11. Bit Assignments for INTERRUPT FLAG Register 
(Read-Only, Byte Offset = Ox100) 


Bit 
Description 


Bit 8:0 
Not Used,the State of These Bits Are Not Specified 


Bit 9 
EF Interrupt Flag 


Bit 10 
OF Interrupt Flag 


Bit 11 
MCINT Overflow Flag 


Bit 12 
827500B Shutdown Interrupt 


Bit 13 
MCINT Microcode Interrupt 


Bit 14 
VBI Vertical Blanking Interrupt 


Bit.15 
OFL Display Format Load Interrupt 
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Table 3-12. Bit Assignments for PROCESSORSTATUS Register 
(Read-Only, Byte Offset = Ox102) 


Bit 
Description 


Bit 0 
HALT (1 = Halted, 0 = Running) 


Bit 1 
FREEZE (1 = Frozen, 0 = Running) 


Bit 2 
PMON (1 = Active, 0 = Inactive) 


Bit 3 
Synchronizers 
on HREQ# 
/HALEN# 
(0 = Enabled, 
1 = Disabled) 


Bit 9:4 
Not Used, the State of These Bits is Not Specified 


Bit 10 
MCINT Microcode 
Interrupt 
Mask 


Bit 11 
VBI Vertical 
Blanking 
Interrupt 
Mask 


Bit 12 
DFL Display Format Load Interrupt 
Mask 


Bit 13 
82750DB 
Shutdown 
Interrupt 
Mask 


Bit 14 
OF Interrupt 
Mask 


Bit 15 
EF Interrupt 
Mask 


I 
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Table 3-13. 82750PB 
A Bus Source/Destination 
Address 
Mapping 


Address 
(Hex) 
ADST 
ASRC 


OxOOO 
Null 
Null 


OxOO2 
hwid 


, 
- 


OxOO4 
cc 


OxOO6 
maddr 


OxOO8 
alu 


OxOOA 
cnt 
cnt 


OxOOC 
cnt2 
cnt2 


OxOOE 
Icnt 
Icnt 


Ox010 
rO 
rO 


Ox012 
r1 
r1 


Ox014 
r2 
r2 


Ox016 
r3 
r3 


Ox018 
r4 
r4 


Ox01A 
rS 
rs 


Ox01C 
r6 
r6 


Ox01E 
r7 
r7 


Ox020 
mcode3 
mcode3 


Ox022 
mcode2 
mcode2 


Ox024 
mcode1 
mcode1 


Ox026 
pc 
pc 


Ox028 
pixint-c 


Ox02A 
pixint 
pixint 


Ox02C 
"drarnt 
"drarnt 


Ox02E 
"dram2 
·dram2 


Ox03O 
"drarnt 
+ + 
"drarnt 
+ + 


Ox032 
"drarnz+ 
+ 
"drarnz+ 
+ 


Ox034 
"dram1- 
- 
"dram1- 
- 


Ox036 
·dram2- 
- 
"dram2- 
- 


Ox038 
dram1 
dram1 


Ox03A 
dram2 
dram2 


Ox03C 
dram3 
dram3 


Ox03E 
dram4 
dram4 


Ox040 
"out t 
·in1 
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Address 
(Hex) 
ADST 
ASRC 


Ox042 
out1 + + 
·in2 


Ox044 
shift-hi 
·stat 


Ox046 
out1-hi 
·stat# 


Ox048 
·out2 


Ox04A 
out2+ 
+ 


Ox04C 
shift-r 


Ox04E 
out2-hi 


OxOSO 
outt-c 


OxOS2 
in1-c 


OxOS4 
shift-I 


OxOS6 
in1-hi 


OxOS8 
out2-c 


OxOSA 
in2-c 


OxOSC 


OxOSE 
in2-hi 


Ox060 
r8 
r8 


Ox062 
r9 
r9 


Ox064 
r10 
r10 


Ox066 
r11 
r11 


Ox068 
r12 
r12 


Ox06A 
r13 
r13 


Ox06C 
r14 
r14 


Ox06E 
r1S 
r1S 


Ox070 
cc 
shift 


Ox072 
tcnt 
tcnt 


Ox074 
"drama 
"dram3 


Ox076 
·dram4 
·dram4 


Ox078 
·dram3+ 
+ 
·dram3+ 
+ 


Ox07A 
·dram4+ 
+ 
·dram4+ 
+ 


Ox07C 
·dram3- 
- 
"dram3- 
- 


Ox07E 
·dram4- 
- 
·dram4- 
- 
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Table 3-14. 82750PB 
B Bus Source/Destination 
Address 
Mapping 


Address 
(Hex) 
BDST 
BSRC 


Ox080 
Null 
Null 


Ox082 
alu 


Ox084 
°dram3 
°dram3 


Ox086 
°dram4 
°dram4 


Ox088 
°dram3+ 
+ 
°dram3+ 
+ 


Ox08A 
°dram4+ 
+ 
°dram4+ 
+ 


OX08C 
°dram3- 
- 
°dram3- 
- 


Ox08E 
°dram4- 
- 
°dram4- 
- 


Ox09O 
rO 
rO 


Ox092 
r1 
r1 


Ox094 
r2 
r2 


Ox096 
r3 
r3 


Ox098 
r4 
r4 


Ox09A 
r5 
r5 


Ox09C 
r6 
r6 


Ox09E 
r7 
r7 


OxOAO 
r8 
°in1 


OxOA2 
r9 
°in1 


OxOA4 
r10 
°stat 


OxOA6 
r11 
°stat # 


OxOA8 
r12 
circbuf 


OxOAA 
r13 


OxOAC 
r14 


OxOAE 
r15 


OxOBO 
circbuf 
literal 0 


OxOB2 
literal 1 


OxOB4 
°dram1 
literal 2 


OxOB6 
°dram2 
literal 3 


OxOB8 
°dram1 + + 
literal 4 


OxOBA 
°dram2+ 
+ 
literal 5 


OxOBC' 
°dram1- 
- 
literal 6 


OxOBE 
°dram2- 
- 
literal 7 


OxOCO 
°out1 
prof 


I 


Address 
(Hex) 
BDST 
BSRC 


OxOC2 
out1 + + 


OxOC4 
out1-lo 
outt-lo 


OxOC6 
out1-hi 
out1-hi 


OxOC8 
°out2 
stat-Io 


OxOCA 
out2+ 
+ 
stat-hi 


OxOCC 
out2-lo 
out2-lo 


OxOCE 
out2-hi 
out2-hi 


OxODO 
out1-c 
outt-c 


OxOD2 
in1-c 
in1-c 


OxOD4 
in1-lo 
in1-lo 


OxOD6 
in1-hi 
in1-hi 


OxOD8 
out2-c 
out2-c 


OxODA 
in2-c 
in2-c 


OxODC 
in2-lo 
in2-lo 


OxODE 
in2-hi 
in2-hi 


OxOEO 
stat-ram 
r8 


OxOE2 
stat-e 
r9 


OxOE4 
stat-Io 
r10 


OxOE6 
stat-hi 
r11 


OxOE8 
yeven-lo 
r12 


OxOEA 
yeven-hi 
r13 


OxOEC 
yodd-Io 
r14 


OxOEE 
yodd-hi 
r15 


OxOFO 
ypitch 
shift 


OxOF2 
stat-e 


OxOF4 
vu-Io 
"drarnt 


OxOF6 
vu-hi 
°dram2 


OxOF8 
vupitch 
°dram1 + + 


OxOFA 
vpitch 
°dram2+ 
+ 


OxOFC 
vptr-Io 
°dram1- 
- 


OxOFE 
vptr-hi 
°dram2- 
- 
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Table 3-15. VRAM Pointer 
RAM Mapping 


Byte Address 
Name 
Description 


Ox180 
Yw-Io 
Working Copy of Y Pointer 
Ox182 
Yw-hi 


Ox184 
out1-lo 
Output FIFO 1 Pointer 
Ox186 
out1-hi 


Ox188 
Yw-pitch 
Working Copy of Y Pitch 


Ox18A 
RESERVED 


Ox18C 
out2-lo 
Output FIFO 2 Pointer 
Ox18E 
out2-hi 


Ox190 
VUw-lo 
Working 
Copy of VU Pointer 
Ox192 
VUw-hi 


Ox194 
in1-lo 
Input FIFO 1 Pointer 


Ox196 
in1-hi 


Ox198 
VUpitchw 
Working Copy of VU Pitch 


Ox19A 
vpitchw 
Working Copy of 827500B 
Pitch 


Ox19C 
in2-lo 
Input FIFO 2 Pointer 


Ox19E 
in2-hi 


Ox1AO 
vptrw-Io 
Working Copy of 827500B 
Pointer 
Ox1A2 
vptrw-hi 


Ox1A4 
stat-Io 
Working Copy of Statistical 
Decoder 
Pointer 
Ox1A6 
• 
stat-hi 


Ox1A8 
Yeven-Io 
Shadow Copy of Y Start Even Pointer 
Ox1M 
Yeven-hi 


Ox1AC 
Yood-to 
Shadow Copy of Y Start Odd Pointer 


Ox1AE 
Yodd-hi 


Ox1BO 
Ypitch 
Shadow Copy of Y Pitch 


Ox1B2 
rfcnt 
RFSH Cycles per RFSH Code from 827500B 


Ox184 
VU-Io 
Shadow Copy of VU Start Pointer 


Ox1B6 
VU-hi 


Ox1B8 
VU pitch 
Shadow Copy of VU Pitch 


Ox1BA 
vpitch 
Shadow Copy of 827500B 
Pitch 


Ox1BC 
vptr-lo 
Shadow Copy of 827500B 
Pointer 


Ox1BE 
vptr-hi 


NOTE: 
Register 
rfent is a write-only 
register 
and should 
never be read. 


Initializing the 82750PB 


The 82750PB 
is placed 
in a RESET state by assert- 


ing RESET# 
for at least ten T-cycles. 
In the RESET 


state, which continues 
until RESET # is released, 
all 
of the 82750PB's 
outputs 
are tri-stated 
for compati- 
bility with board test requirements. 


leasing 
RESET #. This is referred 
to as the INITIAL 


state. 
In the INITIAL state: 


• 
The microcode 
processor 
is halted. 


• 
All six interrupts 
are masked, 
and the 
interrupt 
latches 
are cleared. 


• 
The 82750PAl82750PB 
instruction 
format 
select 


bit is set to the 82750PA. 


• 
The VRAM 
interface 
is ready 
to service 
VRAM 
requests; 
however, 
none 
of the VRAM 
pointers 
are valid. 


Proper initialization 
of the 82750PB 
requires that the 


82750PB 
is held in a RESET 
state 
by keeping 
RE- 


SET# 
active 
for at least 
10 T-cycles, 
and then re- 
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• The number of refresh cycles that will be generat- 
ed each time a RFSH code is received from the 
8275008 is set to 14 cycles. 


• All bidirectional I/O pins are tri-stated. 


After the 82750P8 has been initialized, i.e., placed in 
the 
INITIAL 
state, 
but 
prior 
to 
releaslnq 
the 
827500B's 
reset signal, the following operations 
must be performed: 


• Load the REFRESH-CYCLES-PER-LlNEregister 
with the appropriate value (the equation for the 
value is: VALUE = (2N - 
1),where N is the num- 
ber of cycles; for example, 5 refresh cycles would 
result in VALUE = 25 - 
1 =' 3110 = 001F16. 
The refresh register is 14 bits wide and the way it 
works is to generate one refresh every time a 
right shift results in a '1' bit. It continues the right 
sifting until it finds a '0' bit and halts, Hence from 
programming point of view: 001F16 = FFOF16= 
5 refresh cycles per line. 


• Load the shadow copies of Y, VU, and 827500B 
pointers and pitches. 
• Load the appropriate 827500B Register Load list 
into VRAM starting at the address pointed to by 
.the 8275008 pointer. 


Prior to releasing the microcode processor from its 
HALTed state to run a microcode program, the fol- 
lowing operations must be performed: 
• If 82750PB code is to be executed, bit 15 of the 
82750PB CONTROL register must be set to a 
one. 
• Load a microcode program into microcode RAM 
on the 82750PB by writing to the three instruction 
word registers (mcode 1 - 
the most significant 
word 
of 
the 
instruction, 
mcode2, 
and 
mcode3 
- 
the least significant word of the in- 
struction, the one containing the next address 
field) and then writing to maddr, 
the address in 
microcode RAM where the instruction will be 
loaded. 
• Load the PC with the address in microcode RAM 
of the first instruction to be executed. 
• Write to the 82750PB CONTROL register with the 
HALT bit (bit 0) set to zero, causing the processor 
to start executing an instruction sequence, or with 
the SINGLE-STEP bit (bit 1) set to a one (keeping 
HALT also set to one), causing the processor to 
execute a single instruction. 


Performance 
Monitoring 


Two signals, FRZ# and PMON#, which are useful 
for microcode performance monitoring, are available 
I 


82750PB 


both as external signals, multiplexed on a single out- 
put pin, and as bits in the Processor Status register. 
FRZ# is active for each T-cycle when the micro- 
code processor is frozen, waiting for access to 
VRAM or to the VRAM Pointer RAM. PMON# can 
be toggled by a special ALU opcode or a special B 
bus source code. This allows PMON# to be used to 
indicate what particular segment of microcode is be- 
ing executed. The PMON/FRZ bit in the Processor 
Control register selects the signal that is being out- 
put. 


Freezes may indicate that the microcode routine is • 
not making the most efficient use of the input and 
output FIFO buffering. This is particularly important 
for the inner loops of graphics and video routines 
that are memory-bandwidth limited. Ideally, inner 
loops should be balanced so that the rate pixels are 
processed is equal to the rate that they can be read 
from and written to VRAM with no freezes. The buff- 
ering in the input and output FIFOs serve to make 
sequential reads and writes to VRAM more efficient 
by performing full 64-bit reads and writes, instead of 
individual 8-bit or 16-bit accesses. This has the ef- 
fect of averaging the VRAM read/write rate over a 
number of instruction times. For example, if the 
82750PB is performing a 64-bit read or write every 8 
T-cycles, for an average of 8 bits per T-cycle, a two 
instruction inner loop could read one 8-bit pixel and 
write one 8-bit pixel without any freezes occurring 
(assuming the source pixels and the destination pix- 
els are each sequential). 


The PMON# provides a more standard perform- 
ance monitoring capability by indicating when a par- 
ticular segment of microcode, bracketed by special 
instructions that toggle the PMON# signal, is being 
executed. This allows either absolute execution-time 
measurement or measurement of the fraction of the 
total execution time that is required by the segment. 
Either the ALU opcode "prof" or the B bus source 
code "prof" will toggle the PMON signal. 


An external HALT pin is provided on the 82750PB to 
allow external debugging hardware to immediately 
halt the microcode processor. Activating this input 
causes the microcode processor to halt prior to exe- 
cuting the next instruction. When the processor is 
halted, the VRAM interface portion of the 82750PB 
continues to operate normally, performing transfer 
cycles, refresh cycles, and shadow copies as re- 
quested by the 827500B. 


Host/VRAM 
Timing Diagrams 


Figures 3-4 through 3-8 are Host/VRAM Timing Dia- 
grams. 
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A[31:3) 


A[l) 


BEI[3:0) 


NXTFST# 


TRNFR# 


MROY# 


CLK 


NOTES: 
1. Address pin A[2] is always ZERO for the first cycle of a cycle pair and ONE for the second cycle. 
2. The two cycles of a cycle pair are both writes or both reads. 
3. Beware that the address changes (A[2] and BE[3:0l) on state TF1. This is one cycle after the data changes on a 
write cycle. 
4. The minimum number of cycles for a VRAM read of two words is 4, the sequence illustrated shows one extra T2. 
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Figure 3-4 VRAM Read and Write Cycles 
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Figure 3-5. VRAM Transfer 
and Refresh 
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NOTES: 
1, MREQ#, RFSH#, TRNFR#, and NXTFST# remain inactive during Host Register Read and Write cycles. 
2, If HALEN#/HREQ# 
synchronizers are disabled then the second Ti and Tb states will be missing. 


3, Please see note on page 32. 
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Figure 3-6. Host Register 
Read and Write Cycles 
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NOTES: 
1. MREQ#, 
RFSH# 
TRNFR#, 
and NXTFST# 
remain 
inactive 
during 
Host External 
Read and Write cycles. 


2. If the Synchronizer 
on HREQ# 
is disabled, 
then the second 
Ti state will be missing. 


3. Please see note on page 32. 
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4.0 
MICROCODE 
INSTRUCTION 
FORMAT 


Overview 


The 82750PB executes two slightly different instruc- 
tion formats: one that is backward compatible with 
the 82750PA and another that allows full access to 
the 
microcode resources of the 82750PB. The 
82750PA/82750PB bit in the 82750PB processor 
control register determines which instruction format 
is in effect (see Chapter 3). On reset, the 82750PB is 
placed in 82750PA instruction format mode. In this 
mode the 82750PB will execute binary microcode 
originally assembled for the 82750PA in a manner 
that is functionally equivalent to the 82750PA. 


The following description applies to the 82750PB in- 
struction format. Exact definitions of 82750PB in- 
struction formats and field codings are shown in Fig- 
ure 4-2 and Table 4-5. 


Instruction 
Sequencing 


The instruction word for 82750PB's microcode proc- 
essor is 48 bits wide. The Microcode RAM holds 512 
instructions. Nine bits of each instruction specify the 
address of the next instruction to be executed. Each 
instruction fetch reads two instructions (an odd ad- 
dress and even address pair) using the upper eight 
bits of the 9-bit instruction address. Both the LSB of 
the instruction address and a Condition Flag bit, se- 
lected from eight possible branching conditions, are 
used to determine whether the next instruction to be 
executed is the even address instruction or odd ad- 
dress instruction, according to the logic table shown 
as Table 4-1. 


Table 4-1. Mlcrocode 
Next Instruction 
Selection 


LSBof 
Condition 
Next 
Address 
Flag State 
Instruction 


0 
o (FALSE) 
EVEN 


0 
1 (TRUE) 
EVEN 


1 
o (FALSE) 
ODD 


1 
1 (TRUE) 
EVEN 
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For an unconditional branch, the condition flag 
FALSE (which is always zero) is selected; this caus- 
es the LSB of the address to be passed through to 
select the next instruction: LSB = 0 selects EVEN 
and LSB = 1 selects ODD. This allows uncondition- 
al branching to any of the 512 instructions in the 
RAM. For a conditional branch, the LSB of the ad- 
dress is set to a one; this causes the state of the 
condition flag to select the next instruction: FALSE 
selects the ODD instruction and TRUE selects the 
EVEN instruction. Therefore, a conditional branch 
jumps to either the odd or even instruction of an 
oddi even pair depending on the state of the condi- 
tion. 


Instruction 
Word Field Descriptions 


Each field of the microcode instruction format is de- 
scribed in the following sections. 


NADDR-NEXT 
INSTRUCTION 
ADDRESS 
FIELD 


This field holds the address of the next instruction to 
be executed. Taking advantage of the fact that the 
microcode RAM is physically organized as 256 deep 
by 96 wide (two instructions are fetched per read 
cycle), 
a 
zero 
delay 
two-way 
branch 
can 
be 
achieved. The only case in which this field is not 
used to determine the address of the next instruc- 
tion to be executed is when an instruction writes to 
the PC. (The term PC refers to the register that holds 
the address of the next instruction to be executed.) 
When an instruction loads the PC a one instruction 
delay occurs before the load takes effect. Therefore, 
the instruction pointed to by the next instruction field 
of the instruction that loads the PC is executed be- 
fore the jump to the new address occurs. This is 
shown in Table 4-2. 


There are no restrictions on the instruction following 
a PC load; it will always be executed, even while 
single stepping the processor, or if the processor is 
frozen on that instruction. 


CFSEL-CONDITION 
FLAG SELECT 
FIELD 


This field selects which condition flag will be used 
with the LSB of NADDR to select the next instruction 
from the odd/even pair. The condition flag assign- 
ment is given in Table 4-3. 
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Table 
4-2. PC Load 
Example 


Addr 
Instruction 
NADDR 
Comments 


10 
pc = 0 
55 
Load PC with zero. 


55 
rO = 1 
X 
This instruction 
is executed 
but its next 
address field is ignored. 


0 
r1 = rO 
25 
PC load takes effect after a one instructon 
delay, 
the result is that r1 = rO = 1. 


Table 4-3. Condition 
Flag Select 
Field Assignments 


Value 
Flag 
Description 


000 
FALSE 
Select for Unconditional 
Branch 


001 
CARRY 
Carry Out from ALU Condition 
Flag Latch 


010 
OVF 
Overflow 
from ALU Condition 
Flag Latch 


011 
SIGN 
Sign from ALU Condition 
Flag Latch 


100 
ZERO 
Zero from ALU Condition 
Flag Latch 


101 
LCNTZ 
TRUE if Selected 
Loop Counter 
= 0 


110 
LSB 
LSB of Data Register 
rO 


111 
MSB 
MSB of Data Register 
rO 


NOTE: 
The ALU condition flags (CARRY, OVF, SIGN, and ZERO) are latched in the ALU Condition Flag register. This register is 
updated for most-but 
not all-ALU 
operations. The remaining flags (LCNTZ, LSB, and MSB) are updated and latched each 
cycle. 


ASRC-A 
BUS SOURCE 
SELECT 
FIELD 
CNT -DECREMENT 
LOOP COUNTER 
BIT 


This field 
selects 
the element 
that 
should 
drive 
its 
data onto the A bus during the execution 
of this in- 
struction. 
The 
mapping 
for 
this 
and 
the 
following 
three fields is provided 
in Chapter 6. 


A one in this bit position 
causes 
the selected 
Loop 
Counter 
(selected 
by LC, the loop counter 
select bit) 
to be decremented. 
The new value of the loop coun- 
ter and the updated 
LCNTZ 
condition 
flag are not 
ready until the next instruction 
cycle. Therefore, 
in a 
loop 
where 
the 
loop 
counter 
is decremented 
and 
tested 
for zero in the same instruction 
(typically 
in a 
one 
instruction 
loop), 
the 
start 
value 
for the 
loop 
counter 
should be one less than the number of times 
the loop should be executed. 


ADST-A 
BUS DESTINATION 
SELECT 
FIELD 


This field 
selects 
which 
element 
should 
latch 
data 
from the A bus during the execution 
of this instruc- 
tion. See ASRC above. 


BSRC-B 
BUS SOURCE 
SELECT 
FIELD 
LIT-LITERAL 
SELECT 
BIT 


Same as ASRC, but for B bus. See ASRC above. 
When this bit is a one, the ASRC and CFSEL fields 
are replaced 
with a s-blt literal value that is driven as 
a source in the least Significant 9 bits of the A bus. In 
thiscase, 
the upper 7 bits of the A bus are forced 
to 
zeros. The mapping of bits from the literal field to the 
A bus is shown 
in Figure 4-1. 


BDST-B 
BUS DESTINATION 
SELECT 
FIELD 


Same as ADST, but for B bus. See ADST above. 


NOTE: 


A conditional 
branch 
and a literal 
on the A bus 
are not allowed 
in the same 
instruction. 
A 3-bit 
literal can be placed 
on the B bus in any instruc- 
tion. 
I 
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Inst. Word Bits 
ASRC Field 
CFSEL Field 


+- ..... 
Forced to Zero- • •• -+ 


4 
A bus bits 
15 
14 
13 
12 
11 
10 
9 
8 
7 
6 
5 
3 
2 
o 


17 
16 
15 
14 
13 
12 
11 
10 
9 
+- 
-+ 
+- .. -+ 


SHFT -SHIFT 
CONTROL 
FIELD 


Figure 4·1. Literal Field Mapping onto a Bus 


This field controls the bit shifting and byte swapping 
logic associated with register rO. The encoding of 
this field is given in Table 4-4. 


Table 4·4. SHIFT Control 
Field Coding 


SHFT 
Operation 


00 
No Shift or Swap Operation 


01 
Shift rO Right One Bit 
Position, Sign Extend 


10 
Shift rO Left One Bit 
Position, Zero Fill 


11 
Byte Swap the Value 
Being Loaded into rO* 


'Byte swapping only works when rO IS the destination on 
the A bus or the B bus. It does not swap data held in rO, 
only data being loaded. In order to byte swap data in regis· 
ter rO, rO must be both a source and destination for either 
the A or B bus. 


ALUS5-ALU 
SOURCE 
SELECT 
BITS 


These two bits are used as enables for the two ALU 
input latches. Bit 39 enables the latch that connects 
to the A bus; bit 38 enables the latch connected to 
the B bus. A one in either bit position causes the 
corresponding input latch to latch the value on the 
bus to which it is connected (the A or B bus). A zero 
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on either bit causes the corresponding latch to hold 
its current content. This allows the ALU operands 
either to come from "eavesdropping" on the A or B 
bus transfers occurring in the current instruction cy- 
cle or to be held for multiple instruction cycles in 
either the A or B input latch. 


ALUOP-ALU 
OPERATION 
CODE 
FIELD 


This field specifies the ALU instruction to be per- 
formed during the current instruction cycle. The en- 
coding of this field is given in Figure 4-2. Normally, at 
the end of the instruction execution, the result of the 
ALU operation is latched in the ALU output latch that 
can be a source on either the A or B buses. Howev- 
er, if a NOP is selected for the ALU operation, the 
ALU output latch is not latched. The data is held 
from the previous instruction. In addition to NOP, 
certain other ALU opcodes do not actually perform 
ALU operations and therefore, do not latch the ALU 
results. They are INT (microcode interrupt) and the 
PROF instruction. 


Le-LOOP 
COUNTER 
SELECT 
BIT 


This bit selects which of the two loop counters is to 
be used for decrementing or Loop-Counter-Zero 
conditional branching in the current instruction. A 
zero selects loop counter zero and a one selects 
loop counter one. 


Refer to the Intel 82750PB 
Microcode 
Programming 
Guide for more information on microcode proqrarn- 
ming, order #466718-001. 
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Table 4·5. 82750PB 
Source/Destination 
Coding 


Address 
(Hex) 
BDST 
BSRC 
ADST 
ASRC 


OxO 
Null 
Null 
Null 
Null 


Ox1 
alu 
hwid 


Ox2 
°dram3 
°dram3 
cc 


Ox3 
°dram4 
°dram4 
maddr 


Ox4 
°dram3+ 
+ 
°dram3+ 
+ 
alu 


Ox5 
°dram4+ 
+ 
°dram4+ 
+ 
cnt 
cnt 


Ox6 
°dram3- 
- 
°dram3- 
- 
cnt2 
cnt2 


Ox7 
°dram4- 
- 
°dram4- 
- 
Icnt 
Icnt 


OxB 
rO 
rO 
rO 
rO 


Ox9 
r1 
r1 
r1 
r1 


OxA 
r2 
r2 
r2 
r2 


OxB 
r3 
r3 
r3 
r3 


OxC 
r4 
. 
r4 
r4 
r4 


OxD 
r5 
rs 
rs 
rs 


OxE 
r6 
r6 
r6 
r6 


OxF 
r7 
r7 
r7 
r7 


Ox10 
rB 
°in1 
mcode3 
mcode3 


Ox11 
r9 
°in2 
mcode2 
mcode2 


Ox12 
r10 
°stat 
mcode1 
mcode1 


Ox13 
r11 
°stat# 
pc 
pc 


Ox14 
r12 
circbuf 
pixint-c 


Ox1S 
r13 
pixint 
pixint 


Ox16 
r14 
"dramt 
"drarnt 


Ox17 
r1S 
°dram2 
°dram2 


Ox1B 
circbuf 
literal 0 
°dram1 + + 
°dram1 + + 


Ox19 
literal 1 
°dram2+ 
+ 
°dram2+ 
+ 


Ox1A 
°dram1 
literal 2 
°dram1- 
- 
°dram1- 
- 


Ox1B 
°dram2 
literal 3 
°dram2- 
- 
°dram2- 
- 


Ox1C 
°dram1 + + 
literal 4 
dram1 
dram1 


Ox1D 
°dram2+ 
+ 
literalS 
dram2 
dram2 


Ox1E 
°dram1- 
- 
literal 6 
dram3 
dram3 


Ox1F 
°dram2- 
- 
literal 7 
dram4 
dram4 


Ox20 
°out1 
prof 
°out1 
°in1 
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Table 
4·5. 82750PB 
Source/Destination 
Coding 
(Continued) 


Address 
(Hex) 
BDST 
BSRC 
ADST 
ASRC 


Ox21 
out1 + + 
out1 + + 
"in2 


Ox22 
outt-lo 
out1-lo 
shift-rl 
"stat 


Ox23 
out1-hi 
out1-hi 
out1-hi 
"stat# 


Ox24 
"out2 
stat-lo 
"out2 


Ox25 
out2+ 
+ 
stat-hi 
out2+ 
+ 


Ox26 
out2-lo 
out2-lo 
shift-r 


Ox27 
out2-hi 
out2-hi 
out2-hi 


Ox28 
outt-c 
out1-e 
out1-c 


Ox29 
in1-e 
in1-e 
in1-c 


Ox2A 
in1-lo 
int-lo 
shift-I 


Ox2B 
in1-hi 
irit-hi 
in1-hi 


Ox2C 
out2-e 
out2-e 
out2-e 


Ox2D 
in2-e 
in2-e 
in2-e 


Ox2E 
in2-lo 
in2-lo 


Ox2F 
in2-hi 
in2·hi 
in2-hi 


Ox30 
stat-ram 
r8 
r8 
r8 


Ox31 
stat-e 
r9 
r9 
r9 


Ox32 
stat-Io 
r10 
r10 
r10 


Ox33 
stat-hi 
r11 
r11 
r11 


Ox34 
yeven-Io 
r12 
r12 
r12 


Ox35 
yeven-hi 
r13 
r13 
r13 


Ox36 
yodd-lo 
r14 
r14 
r14 


Ox37 
yodd-hi 
r15 
r15 
r15 


Ox38 
ypiteh 
shift 
cc 
shift 


Ox39 
stat-e 
tent 
tent 


Ox3A 
vu-le 
"dram1 
"dram3 
"dram3 


Ox3B 
vu-hi 
"dram2 
"dram4 
"dram4 


Ox3C 
vupiteh 
"dram1 + + 
"dram3+ 
+ 
"dram3+ 
+ 


Ox3D 
vpiteh 
"dram2+ 
+ 
*dram4+ 
+ 
'dram4+ 
+ 


Ox3E 
vptr-Io 
'dram1- 
- 
"dram3- 
- 
"dram3- 
- 


Ox3F 
vptr-hi 
"dram2- 
- 
"dram4- 
- 
"dram4- 
- 
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47 
46 
45 
44 
43 
42 
41 
40 
39 
38 
37 
36 
35 
34 
33 
32 
31 
30 
29 
28 
27 
26 
25 
24 


mcodel 
mcode2 


15 
14 
13 
12 
11 
10 
9 
8 
7 
6 
5 
4 
3 
2 
1 
o 
1 
15 
14 
13 
12 
11 
10 
9 
8 


LC 
SHFT 
ALU 
ALU 
LIT 
CNT 
B Bus 
B Bus 


bit 
SEL 
CNTL 
OPCOOE 
SS 
Destination 
Source 


coding 
1 
2 
5 
2 
1 
1 
6 
6 


0x0 
enl 
nop 
NOP 
hold 
nop 
nop 
null 
null 


Oxl 
ent2 
shftr 
ZERO 
latb 
I~ 
dec 
alu 
I: 
shftl 
a 
lata 
'dram3 
'dram3 


swap 
b 
both 
'dram4 
'dram4 
~ 
-a 
-orama + + 
'dram3 + + 
~ 
-b 
'dram4 
+ + 
'dram4 + + 
~ 
& 
'dram3 
'dram3 
~ 
-& 
'dram4 
'dram4 
"oxs- 
&- 
rO 
rO 
"oxs- 
++ 
rl 
rl 
~ 
r2 
r2 
~ 
- 
r3 
r3 
"oxc 
I- 
. 
r4 
r4 
~ 
< 
r5 
r5 
~ 
r6 
r6 
~ 
- + < 
r7 
r7 
~ 
+ 
r8 
"lnt 
~ 
r9 
'jn2 
~ 
+ 
rID 
'stat 
f---o;13 
-a 
rtt 
'stat# 
~ 
b 
r12 
circbuf 
f-i;;s 
a + + 
r13 
~ 
b + + 
r14 
~ 
a 
r15 
I---'----- 


b -- 
circbuf 
literal 
0 
~~ 


int 
literal 
1 


OxlA 
pro! 
'dram1 
literal 
2 
~ 
a' 
'dram2 
literal 
3 
~ 
b' 
'dram1 + + 
literal 
4 
~ 
+ < 
'dram2 + + 
literalS 
~ 
+J 
'draml 
Iiterat6 
~ 
J 
'dram2 
literal 
7 
~ 
'outl 
pro! 
~ 
out1 + + 
~ 
outl 
10 
outl 
10 
~ 
outl 
hi 
outl 
hi 
~ 
'0012 
stet-le 
~ 
out2 + + 
stet-hi 
10x28 
0012 
10 
0012 
10 
~ 
0012 
hi 
0012 
hi 


10x28 
outl 
c 
outl 
c 
~ 
inl 
c 
inl 
c 


10x2A 
inl 
10 
inl 
10 
~ 
inl 
hi 
inl 
hi 
10x2C 
out2 
c 
out2 
c 


I 
Ox20 
in2 
c 
in2 
c 


1 
Ox2E 
in2 
10 
in2 
10 
~ 
1 
Ox2F 
in2 
hi 
in2 
hi 


1= 
stat 
ram 
r8 


stat 
c 
r9 
~ 
stat 
10 
rID 
~ 
stat 
hi 
rl1 
~ 
yeven 
10 
r12 


"""""OX35 
yeven 
hi 
r13 
~ 
yodd 
10 
r14 
--ox37 
yodd 
hi 
r15 
~ 
ypitch 
shift 
~ 
stat 
c 


0iC3A 
vu 
10 
'draml 
~ 
vu 
hi 
'dram2 
~ 
vupitch 
'draml 
+ + 
--o;;3i) 
vpitch 
'dram2 + + 
~ 
vptr 
10 
'draml 
'ox3F 
vptr 
hi 
'dram2 
'-='- 


Figure 4-2, 82750PB 
Instruction 
Word Format 
I 
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0xA 
r2 
r2 


OxB 
r3 
r3 


axe 
,4 
'4 


OXO 
r5 
r5 


OxE 
r6 
r6 


OxF 
,7 
,7 


Oxl0 
mcode3 
mcode3 


Oxll 
mcode2 
mcode2 


Ox12 
mcodel 
mcodel 


Ox13 
pc 
pc 
Ox14 
pbdnt - 
c 


Ox15 
pIxInt 
pOdnt 


Ox16 
-dram1 
'draml 


Ox17 
'dram2 
'dram2 


Ox16 
'draml 
+ + 
+dram1 
+ + 


Ox19 
-dram2 + + 
+dram2 
+ + 


OxlA 
'draml 
+draml 


OxlB 
'dram2 
+dram2 


OxlC 
dram1 
dram1 


Oxl0 
dram2 
dram2 


OxlE 
dram3 
dram3 


OxlF 
dram4 
dram4 


0x20 
'0011 
-in1 


0.21 
out1 
+ + 
'in2 


0x22 
shift-rl 
'Slal 


0x23 
0011 - hi 
·stat# 
0.24 
'0012 


0.25 
0012 + + 
0.26 
shift 
r 


0.27 
out2 - 
hi 


0.26 
out1 - 
c 


0x29 
In1 - 
c 


0x2A 
shift-I 


o.2B 
inl 
hi 
0x2C 
0012 
c 


0x20 
in2 
- 
c 


0x2E 
o.2F 
in2 - hi 


0x30 
r6 
r8 


0.31 
r9 
r9 


0.32 
,10 
,10 
0.33 
'11 
'11 
0x34 
,12 
,12 


0.35 
,13 
,13 


0x36 
'14 
rt4 


0.37 
,15 
rt5 


0x36 
CC 
shift 
0x39 
Icn1 
tent 


Ox3A 
'dram3 
'dram3 


Ox3B 
'dram4 
'dram4 


Ox3C 
·dram3 + + 
-dram3 + + 
0.30 
edram4 
+ + 
-dram4 + + 


Ox3E 
-dram3 - 
- 
-dram3 - 
- 
Ox3F 
-dram4 - 
- 
-dram4 - 
- 


Figure 4-2, 82750PB 
Instruction 
Word Format 
(Continued) 
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5.0 
ELECTRICAL 
DATA 
Exposure 
to Maximum 
Ratings may affect device re- 


liability. 
Furthermore, 
although 
the 
82750PB 
con- 
tains protective 
circuitry to resist damage from static 
electrical 
discharge, 
always 
take 
precautions 
to 
avoid tiigh static voltages 
or electric 
fields. 
Maximum Ratings 


Table 5-1 is a stress rating only, and functional 
oper- 
ation at the maximums 
is not guaranteed. 
Functional 
operating 
conditions 
are given 
in the 
DC and 
AC 
Characteristics 
(Tables 
5-2, 5-3, 5-4, and 5-5). 


DC Characteristics 


Table 
5-1. Absolute 
Maximum 
Requirements 


Condition 
Maximum 
Requirement 


Case Temperature 
under Bias 
-65°C 
to + 110°C 


Storage Temperature 
- 65°C to + 150°C 


Voltage on Any Pin with Respect to Ground 
-0.5V 
to vcc + 0.5V 


Supply Voltage with Respect to Vss 
- 0.5V to + 6.5V 


Table 
5-2. DC Characteristics 
vcc = 5V ± 10%; TCASE = O°C to + 90°C 


Symbol 
Parameter 
Mln 
Typ 
Max 
UnIt 
Notes 


Vll 
Input LOW Voltage 
-0.3 
0.8 
V 
(Note 1) 


VIH 
Input HIGH Voltage 
2.0 
vcc + 0.3 
V 
(Note 1) 


VOl 
Output LOW Voltage 
0.2 
0.4 
V 
IOl = 4.0 mA(1) 


VOH 
Output HIGH Voltage 
2.4 
3.0 
V 
IOH = -1.0mA(1) 


III 
Input Leakage Current 
-10 
+10 
/LA 
vss < VIN < vcc 


loz 
Output Leakage Current 
-10 
+10 
/LA 
Vss < VIN < vcc 


tee 
Power Supply Current 
150 
200 
mA 
25 MHz(2) 


CIN 
Input Capacitance 
10.0 
pF 
Fc = 1 MHz(3) 


COUT 
Output Capacitance 
12.0 
pF 
Fe = 1 MHz(3) 


CClKIN 
CLKIN Input Capacitance 
20.0 
pF 
Fe = 1 MHz(3) 


NOTES: 
1. Measured with ClKIN = e MHz. 
2. Typical current value measured under typical conditions. Maximum current value guaranteed with 50 pF maximum ou1put 
loading. 
3. Not 100% tested. 
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AC Characteristics 


Table 
5·3. AC Characteristics 
at 25 MHz vcc = 5V ± 10%; TCASE = O·C to + 90·C; CL = 50 pF 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


Frequency 
8 
25 
MHz 
1xClock 


t1 
CLKIN Period 
40 
125 
ns 
5-1 


t2 
CLKIN High Time 
14 
26 
ns 
5-1 
(Note 1) 


t3 
CLKIN Low Time 
14 
26 
ns 
5-1 
(Note 1) 


4 
CLKIN Fall Time 
4 
ns 
5·1 


ts 
CLKIN Rise Time 
4 
ns 
5·1 


tsa 
A[31:2l. 
BE#[3:01, 
WEll. 
3 
25 
ns 
5·2 
D[31:0l. 
HINT#. 
PMFRZ# 
Valid Delay 


tsb 
MREQ#. 
TRNFR#. 
RFSH#. 
3 
18 
ns 
5-2 
. 
NXTFST #. HBUSEN # • 
HRDY # Valid Delay 


t7 
A[31:2l. 
BE#[3:ol. 
WE#. 
30 
ns 
5-2 
(Note 2) _ 


D[31 .ol Float Delay 


t8 
MRDY# 
Setup 
10 
ns 
5-3 


t9 
MRDY# 
Hold 
6 
ns 
5-3 


t10 
. HREQ#. 
VBUS[3:0l. 
RESET#. 
8 
ns 
5-3 
HALEN #. HALT # Setup 


t11 
HREQ#. 
VBUS[3:ol. 
RESET#. 
6 
ns 
5-3 
HALEN #. HALT # Hold 


t12 
A[8:2l. 
BE#[3:01, 
WE#. 
4 
ns 
5-3 
(Note 3) 


D[31:0l 
Setup 


t13 
A[8:2l. 
BE#[3:0l. 
WE#. 
6 
ns 
5-3 
(Note 3) 


D[31 :Ol Hold 


ti4 
HREG #. HRAM # Setup 
10 
ns 
5-3 


t1S 
HREG #. HRAM # Hold 
6 
ns 
5-3 


t16 
CLKOUT Valid Delay 
18 
ns 
5-4 


t17 
CLKOUT 
High Time 
%t1 
- 
6 
%t1 + 6 
ns 
5-4 


NOTES: 
1. This assumes 40 ns period. For other speeds these values should fall between 40% to 60% duty cycle. 
2. Not 100% tested. Guaranteed by design characterization. 
3. Inputs must remain valid throughout all cycles of host accesses. See Figures 3-6 through 3-8. 
4. All A.C. specifications are measured at the 1.5V crossing point with a SOpF load. 
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240854-19 


Figure 5-1. Clock Waveforms 


ClKIN 
1.5V 
I 
III 
:. 
U ~ 
~ 
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t-+- 
: 
_ 
____ 
--. 
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--.. 
I 
~ 
L 
______ 
J~~_l._5V 
J~ 
240854-20 


Figure 5-2. Output Waveforms 


NOTES: 
ty = t8, t10, t12, t14 (setup times) 
tz = t9, t11, t13, t15 (hold times) 


240854-21 


Figure 5-3. Input Waveforms 


CLKINiB= 
\,6 


CLKOUT 


\,7 
240854-14 


Figure ~4. CLKOUT 
Waveforms 
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Output Delay and Rise Time Versus Load Capacitance 


nom 


/ 
V 
./ 
1/ 
V 
V 


nom +8 


nom+4 


TypiooI 
Output 
DoIay 
(nl) 
nom +2 


norn-2 
110 
75 
100 
12S 
1110 


240654-22 


NOTE: 
This graph 
will not be linear outside 
of the CL range 
shown. 
nom 
= nominal 
value given 
in A.C. Characteristics 
table. 


Figure 5~5.Typical 
Output Valid Delay Versus Load Capacitance 
under Worst Case Conditions 


7 
v 
V 
V 
V 
V 


8 


5 


Ate 
4 
TIme (no) 
O.rN.2.(N 
3 


2 


o 
25 
110 
75 
100 
125 
1110 


c" (pIcoIOIadl) 
240654-23 


NOTE: 
This graph will not be linear outside 
of the CL range 
shown. 


Figure 5-6. Typical 
Output Rise Time Versus Load Capacitance 
under Worst Case Conditions 
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6.0 
MECHANICAL 
DATA 


Packaging Outlines and Dimensions 


Intel packages 
the 82750PB 
in a Plastic Quad Flat Pack (PQFP). Table 6-1 gives the symbol list for the PQFP. 


Table 6-1. PQFP Symbol 
List 


Letter or 
Description 
of Dimensions 
Symbol 


A 
Package 
Height: Distance 
from Seating Plane to Highest Point of Body 


A1 
Standoff: 
Distance 
from Seating Plane to Base Plane 


DIE 
Overall Package Dimension: 
Lead Tip to Lead Tip 


D1/E1 
Plastic Body Dimension 


D2/E2 
Bumper Distance 


Ds/Es 
Footprint 


L1 
Foot Length 


N 
Total Number of Leads 


The PQFP has the following 
specifications: 


1. All dimensions 
and tolerances 
conform 
to ANSI Y14.5M-1982. 


2. Datum plane -H- 
is located at the mold parting line and coincident 
with the bottom of the lead where lead 
exits plastic 
body. 


3. Datums A-B 
and -0- 
are to be determined 
where center 
leads exit plastic 
body at datum 
plane -H-. 


4. Controlling 
dimension 
is the inch. 


5. Dimensions 
D1. D2. E1. and E2 are measured 
at the mold parting 
line and do not include 
mold protrusion. 
Allowable 
mold protrusion 
is 0.18 mm (0.007 in.) per side. 


6. Pin 1 identifier 
is located 
within one of the two zones indicated. 


7. Measured 
at datum plane -H-. 


8. Measured 
at seating 
plane datum -C-. 
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Table 6-2 provides 
outline 
characteristics 
for 0.025 in. pitch. 


Table 6·2. Intel Case Outline Drawings for PQFPat 0.025 inch Pitch 


Symbol 
Description 
Mln 
Max 


N 
Leadcount 
132 
132 


A 
Package 
Height 
0.160 
0.170 


A1 
Standoff 
0.020 
0.030 


D.E 
Terminal 
Dimension 
1.075 
1.085 


01. E1 
Package 
Body 
0.0947 
0.953 


D2.E2 
Bumper Distance 
1.097 
1.103 


D3.E3 
Lead Dimension 
0.800 REF 
0.800 REF 


L1 
Foot Length 
0.020 
0.030 


1----02--~ 
~~~~~~~~.-.-~~~~~~ 
~ 
0 
~$~~0~.2~S~~~~~~~~~~ 
1----01----I 
PLANE 


Figure 6·1. Principal Dimensions of the 82750PB In the 132·Lead PQFP Package 
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Figure 
6-2. Detailed 
Dimensions 
of the 82750PB 
In the 132-Lead 
PQFP-Moldlng 
Details 


SEE DETAIL L 
'-+-++-- 
SEE DETAIL J 


1001---- 
o3/E3 -- 
...••-41 


•.•.•.---- 
o4/E4 ---- 
•.• 
•.•... 
----01£ 
------4 
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Figure 
6-3. Detailed 
Dimensions 
of the 82750PB 
in the 132-Lead 
PQFP-Termlnal 
Details 
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1.32 <'e52) 
1.22 (.'.8) 


1.98 (.135> MIN.l 


~ 
E2 
1 
1.32 <'B52> 
1.22 (.B.8> 


B.~B (.B35) 
MIN. j 


2.B3 
(.B8B) ~ 
1.~3 <'B76) 


(.B8B) 
( .B76) 


240854-27 
•• 
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Figure 6-4. 132-Lead PQFP Mechanical Package Detail-Protective 
Bumper 
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(.11118) 
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14 c , 111115 
) 


11.31 (.11112)-i I- 
1.28 
(.1111118) 


8 
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• 
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DETAIL 
J 
DETAIL L 
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Figure 6-5. 132-Lead PQFP Mechanical Package Detail-Typical 
Lead 
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NOTES. 
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9CAVersus Airflow-ft/mln 
(m/sec) 


Package 
0 
200 
400 
600 
800 
1000 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06). 
(5.07) 


132-Lead 
26.0 
17.5 
14.0 
11.5 
9.5 
8.5 
PQFP 


• __ 
._ 
" 
U'_ 
I 11"'" IIIRI 
• lIGO.Ol.all""G 
\ 
"" 
•• 
J 


TA Versus Alrflow-ft/mln 
(m/sec) 


Package 
Frequency 
0 
200 
400 
600 
800 
1000 
(MHz) 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 
(5.07) 


132-Lead 
25 
70 
76 
80 
81 
83 
84 
PQFP 


Table 6-4. Maximum TA at Various Airflows ("C) 
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intel~ 
8275008 
OISPLA Y PROCESSOR 


• 
Programmable Video Timing 
- 
28 MHz and 45 MHz Operating 
Frequency 
- 
Plxel/Llne 
Address Range to 4096 
- 
Fully Programmable Sync, 
Equalization, and Serration 
Components 
- 
Fully Programmable Blanking and 
Active Display Start and Stop Times 
- 
Genlocklng Capability 


• 
Flexible Display Characteristics 
- 
8-, Pseudo 16-, 16-, and 32-BIt/Plxel 
Modes 
- 
Selectable Plxel Widths of 1.0, 1.5, 
2.0,2.5, through 14 Periods of the 
Input Frequency 
- 
Support Popular Display Resolutions: 
VGA, XGA, NTSC, PAL, and SECAM 
- 
On-Chip Triple DAC for Analog RGBI 
YUV Output 


- 
Mix Graphics and Video Images on a 
Plxel by Plxel Basis 
- 
Real Time Expansion of the Reduced 
Sample Density Video Color 
Components (U, V) to Full Resolution 
- 
Three Independently 
Addressable 
Color Palettes 
- 
Programmable 2X Horizontal 
Interpolation 
of Y Channel 
-16 
x 16 x 2-Blt Cursor Map with 
Independently 
Programmable 2X 
Expansion Factors In X and Y 
Dimensions 
- 
YUV to RGB Color Space Conversion 
- 
2X Vertical Replication of Y, U, and V 
Data for Displaying Full Motion Video 
on VGA Monitor 
- 
Register and Function Compatible 
with the 82750DA 


Intel's 827500B 
is a custom 
designed 
VLSI chip used for processing 
and displaying 
video graphic 
information. 


It is register 
and function 
compatible 
with the 827500A. 


Reset inputs allow the 827500B 
to be genlocked 
to an external 
sync source. 
By programming 
internal 
control 
registers, 
this sync can be modified 
to accommodate 
a wide variety of scanning 
frequencies. 
A large selection 
of bits/pixel, 
pixels/line, 
and pixel widths 
are programmable, 
allowing 
a wide 
latitude 
in trading-off 
image 
quality vs. update 
rate and VRAM 
requirements. 


The 827500B 
can operate 
in a digitizing 
mode, wherein 
it generates 
timing and control 
signals to the 82750PB 
and VRAM, 
but does not output display 
information. 
Besides 
digitizer 
support 
signals and video synchroniza- 
tion, the 827500B 
outputs 
digital and analog 
RGB or YUV information 
and an 8-bit digital word of alpha data. 
This alpha channel 
data may be used to obtain a fractional 
mix of 827500B 
outputs with another video source. 


YId ••• 
Mix•• ,=..... 
-"-,....,.l...1....J 
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Subsystem 
Diagram 
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1.0 
8275008 PIN DESCRIPTION 


Pinout 


132 
130 
128 
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124 
122 
120 
118 
118 
114 
112 
110 
108 108 
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102 
100 
131 
129 
127 
125 
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105 
103 
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25 
28 
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30 
31 
32 
33 


OO~OOOOOOOOO~I~~OOOOOOOOOO~OO 
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VCC VSS GV 
VCC I!<VSS VCC VGCs\ 
VSS 
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I 
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VSS VCC 
o VSS 
AVCC 
RV VSS 
BU 
VCC DRV[7J 
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o VCC • 
PlXClK 
0BU[3:D] 
0BU[1:4] AlPHA[O] VCC 0 
g } 
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vssO 
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vccO 
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AlPHA[8] 0 
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DGY[7:0] 
AlPHA[7J 0 
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ACTDIS 0 
o 
8275008Pinout 
BPP[O]0 
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BPP[1]0 
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TOPVIEW 
DISDIG 0 
o vss 
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o 
vccO 
g 
VBUS[3:D]{ g 
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0 
o 
SCLK[1]0 
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DATAlN[13:D] 
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o 
~o 
o 
~~#o 
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HSYNCO 
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DATAlN[31] TESTACT# 
VSYNCg 
g VSS DATAlN[18:14]DATAlN[21:17] 
DATAlN[25:22] 
DATAlN[30:28]~\\::a#~~~:eQN 
:s~ 


VCC 
~!ISS 
I 
/100 ---.L.vss . I 
1ICC 
I 
/!SS 
\ 
. 
\ . 
VCC 
VSSVCC,------, 
I- 
~ 
r+: 
1 
, \FCO 
\VCCDISDAC 
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78 
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78 
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74 
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72 
71 
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88 
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Figure 1-1.8275008 Plnout 
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Pin Name 
Location 


ACTDIS 
87 


ALPHA[7) 
88 


ALPHA[6) 
90 


ALPHA[5) 
92 


ALPHA [4) 
93 


ALPHA[3) 
95 


ALPHA[2) 
96 


ALPHA[1) 
97 


ALPHA[O) 
102 


AVCC 
128 


AVSS 
125 


BG 
69 


BPP[1) 
85 


BPP[O) 
86 


BU 
122 


CB 
83 


CSYNC 
72 


DATAIN[31) 
58 


DATAIN[30) 
56 


DATAIN(29) 
55 


DATAIN[28) 
54 


DATAIN[27) 
53 


DATAIN[26) 
52 


DATAIN[25) 
50 


DATAIN[24) 
49 


DATAIN[23) 
47 


DATAIN(22) 
46 


DATAIN(21) 
44 


DATAIN[20) 
43 


DATAIN[19) 
42 


DATAIN[18) 
41 


DATAIN(17) 
40 


DATAIN[16) 
38 


I 


Table 1-1. Pin Cross Reference 
by Pin Name 


Pin Name 
Location 


DATAIN(15) 
37 


DATAIN[14) 
36 


DATAIN[13) 
31 


DATAIN[12) 
30 


DATAIN[11) 
29 


DATAIN[10l 
28 


DATAIN[9) 
27 


DATAIN[8) 
26 


DATAIN[7) 
25 


DATAIN[6) 
24 


DATAIN(5) 
23 


DATAIN[4) 
22 


DATAIN[3) 
21 


DATAIN(2) 
20 


DATAIN(1) 
19 


DATAIN[O) 
18 


DBU[7) 
103 


DBU[6) 
105 


DBU[5) 
106 


DBU(4) 
107 


DBU[3) 
110 


DBU[2) 
111 


DBU(1) 
112 


DBU[O) 
113 


DGY(7) 
8 


DGY(6) 
9 


DGY[5) 
10 


DGY[4) 
11 


DGY(3) 
12 


DGY[2) 
13 


DGY(1) 
14 


DGY[O) 
15 


DISDAC 
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Pin Name 
Location 


DISDIG 
84 


DRV(7) 
114 


DRV(6) 
118 


DRV[5) 
119 


DRV(4) 
3 


DRV[3) 
4 


DRV[2) 
5 


DRV[1) 
6 


DRV[O) 
7 


FCO 
61 


FREQIN 
64 


GY 
129 


HRESETII 
60 


HYSNC 
71 


IREFIN 
130 


PIXCLK 
120 


RESETB# 
73 


RV 
126 


SCLK(1) 
77 


SCLK[O) 
74 


TEST# 
63 


TESTACTII 
62 


VBUS(3) 
81 


VBUS[2) 
80 


VBUS(1) 
79 


VBUS[O) 
78 


Vcc 
2 


Vcc 
33 


Vcc 
35 


Vcc 
45 


Vcc 
51 


Vcc 
65 


Vcc 
67 
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Pin Name 
Location 


Vcc 
75 


Vcc 
82 


Vcc 
91 


Vcc 
98 


Vcc 
100 


Vcc 
104 


Vcc 
109 


Vcc 
116 


Vcc 
123 


Vcc 
127 
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132 


VGCS 
121 


VRESET# 
59 


Vss 
1 


Vss 
16 


Vss 
17 


Vss 
32 


Vss 
34 


Vss 
39 


Vss 
48 


Vss 
57 


Vss 
68 


Vss 
76 


Vss 
89 


Vss 
94 


Vss 
99 


Vss 
101 


Vss 
108 


Vss 
115 


Vss 
117 


Vss 
124 


Vss 
131 


VSYNC 
70 
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25 
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26 
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27 
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28 
DATAIN[10] 


29 
DATAIN[11] 


30 
DATAIN[12] 


31 
DATAIN[13] 


32 
VSS 


33 
VCC 
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43 
DATAIN[20] 


44 
DATAIN[21] 


45 
VCC 


46 
DATAIN[22] 


47 
DATAIN[23] 


48 
VSS 


49 
DATAIN[24] 


50 
DATAIN[25] 
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VCC 
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DATAIN[26] 
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DATAIN[27] 
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VRESET# 
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FCO 
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TESTACT# 


63 
TEST# 


64 
FREQIN 


65 
VCC 


66 
DISDAC 
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VSS 


77 
SCLK[1] 


78 
VBUS[O] 


79 
VBUS[1] 


80 
VBUS[2] 


81 
VBUS[3] 


82 
VCC 


83 
CB 


84 
DISDIG 


85 
BPP[1] 


86 
BPP[O] 


87 
ACTOIS 


88 
ALPHA[7] 


89 
VSS 


90 
ALPHA[6] 


91 
VCC 
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ALPHA[5] 
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VSS 
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VCC 
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VSS 
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VCC 
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DBU[3] 
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DRV[7] 
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VSS 
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VCC 
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VSS 


118 
DRV[6] 


119 
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Quick Pin Reference 


Table 1-3. Pin Descriptions 


Symbol 
Type 
Name and Function 


FREQIN 
I 
FREQUENCY 
INPUT CLOCK: In normal use, the 827500B 
supplies refresh 
timing for an associated 
VRAM through the 82750PB. 
This places a lower limit 
on the line frequency, 
which is a programmed 
multiple of FREQIN. 
It must 
generate 
enough refresh cycles, so a minimum 
line rate of 4 kHz is required. 
Furthermore, 
the 82750PB 
may run no less than Ye the speed of the 827500B, 
since the 82750PB 
samples the timing and control 
signals generated 
by the 
827500B. 
The period of FREQIN is known as a "T" 
cycle. 


RESETB# 
I 
EXTERNAL 
RESET: Input signal which places all units in the 827500B 
into an 
initialized state, and sets the transfer 
rate to a default value of 1j.3X the 
operating 
frequency. 
It is an edge sensitive 
input which must be held low for a 
minimum of ten T-cycles. 
The slowest transfer 
rate is selected 
to ensure that 
the 827500B 
will read the register information 
correctly 
during the first register 
transfer, 
independent 
of the speed of the VRAMs. 
During the reset state, the 
analog video outputs and digital outputs are set to the black level. This will 
occur a maximum of four cycles after RESETB# 
is set to a zero. This signal is 
also used in conjunction 
with the TEST ACT # input to disable outputs. 


VRESET# 
I 
VERTICAL 
RESET: By programming 
a bit in an internal register, the 827500B 
may be placed in the Genlock 
mode. If this mode is selected, 
assertion 
of 
VRESET # resets all vertical timing to the first line of the next field. It does not 
affect the horizontal 
timing, but does generate 
the on-chip end of field signals. It 
is an edge sensitive 
input that is sampled 
in the 827500B 
at the internal time 
corresponding 
to the rising edge of FREQIN. If the Genlock 
mode has not been 
enabled, this signal will have no effect on the sync timing. The 827500B 
will 
then operate in a free-running 
mode. Refer to Chapter 3 for a detailed 
description 
of genlocking 
the 827500B. 


HRESET# 
I 
HORIZONTAL 
RESET: When in the Genlock 
mode, this input will reset all of 
the horizontal 
timing to the start of the line (beginning 
of horizontal 
sync). 


HRESET # does not affect vertical timing (except for an up-to one-line delay) or 
any other 827500B 
registers. 
This signal is an edge sensitive 
input that is 
sampled 
in the 827500B 
at that internal time corresponding 
to the rising edge 
of FREQIN. As was the case with the VRESET # signal, this input will be 
ignored when not in the Genlock 
mode. 


VBUS[3:0] 
0 
VDP COMMUNICATION 
BUS: The 827500B 
outputs status and VRAM transfer 
requests over these lines to the 82750PB, 
for 2 to 16 T-cycles 
(as programmed 
by the user). Transfer 
requests can tie up the 827500BIVRAM, 
82750PBI 
VRAM, or 82750PB/827500B 
(VB US) interfaces 
for a longer period due to 
VRAM arbitration. 
When signals are not being sent out, the VBUS has value 
1111, the "null command." 


SCLK[1:0] 
0 
VRAM SHIFT CLOCKS: Transfer 
requests to the 82750PB 
cause a VRAM 
address to be set up, and the VRAM serial registers 
loaded (in the case of 
displaying) 
or unloaded 
(in the case of digitizing). 
These signals are used to shift 
data out of and into the VRAMs. 
Both signals are identical, 
and run at a 
maximum 
rate of 1X of the pixel frequency, 
except during transfer 
requests, 
at 
which time they run at 1X, 1/2X, 
or 1/3X 
of the operating 
frequency 
of the 
827500B, 
as programmed 
by the user. 


OATAIN[31:0] 
I 
DATA INPUT BUS: This is the input data clocked 
in from VRAM by the 
SCLK [1 :0] signals. The format of the input data is a function 
of the programmed 
number of bits/pixel 
and of the type of transfer 
cycle being executed. 
Data will 
be sampled 
internally 
on the rising edge of FREQIN. 
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Table 
1-3. Pin Descriptions 
(Continued) 


Symbol 
Type 
Name and Function 


FCO 
0 
FRAME CAPTURE 
ON: This is the output signal which indicates 
to the digitizer 
that the VRAM serial port has been turned from read mode to write mode. The 
digitizer may then drive the (common) 
VRAM serial register data 1/0 pins. FCO 
specifies 
digitization, 
five lines after the start of the active vertical 
display, at the 
time of HSYNC. This gives the external 
logic time to switch directions 
of the 
VRAM serial data bus. This signal will end four lines after vertical 
active stops, at 


the next HSYNC, to make sure the qigitizer is off before the next, beginning-of-field 
register transfer. 


HSYNC 
0 
HORIZONTAL 
SYNCHRONIZATION: 
Video synchronization 
signal which is 
asserted 
at the beginning 
of every line and ends a programmed 
time later. (The 
durationof 
this signal is specified 
in T-cycles.) 


VSYNC 
0 
VERTICAL 
SYNCHRONIZATION: 
Video synchronization 
signal which can be 
- 
programmed 
to start (once) and end (once) in every field. (The start and stop 
position 
may be specified 
in half-line units.) 


CSYNC 
0 
COMPOSITE 
SYNCHRONIZATION 
PULSE: This contains 
the programmed 
vertical 
serration 
and equalization 
information, 
as well as horizontal 
synchronization 
pulses. 


CB 
0 
COMPOSITE 
BLANKING: 
This signal can be programmed 
to end once and start 
once in each line, and end once and start once every field. 


BG 
0 
BURST GATE: This signal starts and stops at user-programmable 
horizontal 
positions 
in each line, in a programmable 
vertical group of lines. The primary use 
of this signal is to provide a "window" 
during which the BURST output should be 
inserted to generate 
a base band NTSC signal. The output frequency 
is set by an 
integer divisor (0-31) 
and the rate of the FREQIN clock input. To use this 
effectively, 
the 82750DB 
must operate 
at an integer multiple of the NTSC 3.58 
MHz color subcarrier. 
The number is programmed 
in two's complement 
form in 
the General Control register. 


PIXCLK 
0 
PIXEL CLOCK: 
This output signals valid data on the DGY, DRV, DBU, GY, RV, 


and BU lines. PIXCLK becomes 
active one-half 
of a T-cycle after valid data 
appears on DGY, DRV, or DBU, and coincident 
with GY, RV, and BU. During 
active display time it is issued at a steady rate of 1I(T-cycles/pixel) 
times per 
T-cycle, and otherwise 
at a steady rate of once per T-cycle. 
Its duration 
is one-half 
of a T-cycle, and its rising edge may synchronize 
with either rising or falling edges 
of FREQIN depending 
on the pixel frequency. 
This signal may be used to 
synchronize 
off-chip 
processing 
of the pixel data outputs. 


GY,RV,BU 
0 
ANALOG 
PIXEL OUTPUTS: 
These signalS are the processed 
pixel data from the 
82750DB 
in analog form. During the display, these signals may be programmed 
to 
output pixel data in either YUV or RGB format. 


Output 
DGY 
DRV 
DBU 
Format 


YUV 
Y 
V 
U 
RGB 
G 
R 
B 


DGY[7:0), 
0 
DIGITAL 
VIDEO OUTPUTS: 
These are the digital outputs of the GY, RV, and BU 
DRV[7:0), 
channels, 
respectively. 
They are valid with respect to the rising edge of PIXCLK. 


DBU[7:0] 


ALPHA[7:0] 
0 
ALPHA 
CHANNEL: 
These 8 bits are used to output a digital value for mixing the 
82750DB 
output with another video signal off-chip. 
The alpha channel 
information 
may be included 
in the pixel data, or may be output based on a comparison 
of the 
pixel data with user-programmed 
values. 


ACTOIS 
0 
ACTIVE 
DISPLAY: 
This is the active portion of the display as programmed 
by the 
user. It is delayed by the pipeline through the 82750DB, 
which is 5 lines vertically 
and a variable 
number horizontally, 
depending 
on the display mode. 


I 


• 
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Table 1-3. Pin Descriptions 
(Continued) 


Symbol 
Type 
Name and Function 


BPP[1:0] 
0 
BITS PER PIXEL: During the nonactive 
display, the user programmed 
bits/pixel 
is 


encoded 
on these lines. During active display, the BPP[O] signal is multiplexed 


with a signal, Cursor Active, which indicates 
if the cursor data is currently 
active 


(non-transparent). 
When the Cursor Active output signal is asserted, 
this indicates 


that cursor overlay data is currently 
being output. Also during the active display, 


the BPP [1] signal is multiplexed 
with a signal, VUGR, which indicates 
whether 
the 


827500B 
is operating 
in a graphics 
or video mode. When the VUGR output signal 
is asserted, 
this indicates 
the G, R, and 8 outputs are derived from the 
subsampled 
VU data. Thesepins 
allow users to latch the 8PP[1:0] 
signals during 


nonactive 
display time (as indicated 
by ACTOIS being zero) for post-processing 
of 
the 8275008 
output. The active cursor window on 8PP[0] 
can be used during 


active display, to multiplex 
in other video streams into the output display. The 


following 
table illustrates 
the encoding 
on the 8PP signals. 


Blts/Plxel 
ACTDIS 
BPP[O] 
BPP[1] 


8 
0 
0 
0 
16 
0 
0 
1 
32 
0 
1 
0 
pseudo 
16 
0 
1 
1 
8 
1 
Cursor Active 
VUGR 
16 
1 
Cursor Active 
VUGR 
32 
1 
Cursor Active 
VUGR 
pseudo 
16 
1 
Cursor Active 
VUGR 


OISOAC 
I 
DISABLE 
ANALOG 
OUTPUTS: 
When this input is active, the Analog Pixel 


Outputs are set to a high-impedance 
state. 


OISOIG 
I 
DISABLE 
DIGITAL 
OUTPUTS: 
When this input is active, the digital outputs of the 


827500B 
will be set to zero. In applications 
that use only the analog outputs of the 


827500B, 
the digital outputs must be disabled. 


TESTACT# 
I 
TEST ACTIVE: 
Active low signal that is used in conjunction 
with the RESET8 # 


signal to allow the chip to perform one of the following 
functions: 


RESETB# 
TESTACT# 
82750DB 
State 


0 
1 
Enter Reset State 
0 
0 
Enter Reset State 
Tri-State 
All Outputs 
Analog Outputs are Zero 
1 
1 
Normal Operation 
1 
0 
Reserved 


TEST# 
I 
TEST INPUT: This signal must be set to VCC to guarantee 
correct 
chip operation. 


VGCS 
0 
INTERNAL 
VOLTAGE 
REFERENCE: 
This signal must be decoupled 
to AVCC. 


IREFIN 
I 
ANALOG 
CURRENT 
REFERENCE: 
Under normal operation, 
this signal should be 
tied to a temperature 
compensated 
current reference 
to AVSS. This signal must 


be decoupled 
to AVCC. 


AVcc 
I 
ANALOG 
POWER pin provides + 5 Voc supply to the Digital to Analog Converter. 


AVss 
I 
ANALOG 
GROUND 
pin provides 
the OV connection 
to which the analog outputs 


are referenced. 
This must be connected 
to VSS. 


Vcc 
I 
POWER pins provide + 5 Voc supply input. 


Vss 
I 
GROUND 
pins provide the OV connection 
to which all inputs and outputs are 
referenced. 
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Table 
1·4 Input Pins 


Name 
Active 
Synchronousl 
Level 
Asynchronous 


FREQIN 
HIGH 
Synchronous 


RESETB# 
LOW 
Asynchronous 


VRESET# 
LOW 
Asynchronous 


HRESET# 
LOW 
Asynchronous 


OISOIG 
HIGH 
Asynchronous 


TESTACT# 
LOW 
Asynchronous 


TEST# 
LOW 
ASynchronous 


All output pins have an active level of HIGH, and are 
floated when RESETB# and TESTACT# are set to 
a zero. The exceptions are GY, RV, and BU which 
will be forced to a zero level. 


2.0 ARCHITECTURE 


Overview 


There are 10 units in the 827500B. Each of the units 
operates independently at the maximum clock rate 
input to the chip. The control information for each 
block 
is 
distributed 
in 
programmable 
registers 
throughout the chip. These registers are loaded on 
user-specified lines during the horizontal and vertical 
blanking intervals of the field. The register data that 
was read in from VRAM is passed from block to 
block during the blanking intervals of the display, on 
the same lines that the pixel information is passed 
during the active display. The Functional Block Oia- 
gram is shown in Figure 2-1. 


In order to maximize speed and compensate for pro- 
cessing delays, the chip is heavily pipelined. All in- 
ter-block information is delay-equalized to accom- 
modate the different pipeline lengths in each mod- 
ule. As a result, the total pipeline delay is dependent 
on the number of processing units that are used to 
generate the display. Chapter 4 describes how the 
user programming is affected by these pipeline de- 
lays. 


Each of the units are described in more detail in the 
following sections of this chapter. 


Sync Generation 
and Timing 


The sync generation and timing block generates all 
of the internal timing and control signals, as well 


I 


8275008 


as the video synchronization signals. Sync and tim- 
ing information may be derived from two sources: 
from the master clock, in which case the control reg- 
isters on the 827500B are programmed to provide 
the desired display frequency in terms of periods of 
the master clock (T-cycles), or from the horizon- 
tal and vertical external reset signals. (The latter 
is known as the genlock mode.) Characteristics 
such as line rate, blanking and border intervals, and 
composite synchronization parameters can be in- 
dependently 
set. 
Since 
the 
827500B 
can 
be 
reprogrammed once each line, horizontal strips of 
different resolutions can be supported on the same • 
display. However, the horizontal strips that can be 
supported are limited by the host processor's re- 
sponse to redefining the bitmap pointers resident on 
the 82750PB. 


The horizontal and vertical display parameters are 
fully programmable. Figure 2-2 illustrates the hori- 
zontal programming parameters. The line starts at 
the programmed start position, with the length of 
half of a line programmed in T-cycles. The length of 
the total line is twice the half-line length. Parameters 
such as horizontal sync start, horizontal sync width, 
horizontal blanking start and stop, and horizontal ac- 
tive start and stop are all specified by the user. Note 
that the border time is not explicitly programmed, but 
is defined as the region of the display line where 
neither active display nor blanking is programmed to 
occur. In order for the 827500B to function correctly, 
the width of the horizontal active display should be 
programmed such that the end of the horizontal ac- 
tive display coincides with the end of the last dis- 
played pixel. 


Figure 2-3 shows the vertical programming parame- 
ters. The basic unit for vertical programming is in 
units of half lines, with the half-line count for each 
field starting at zero. Where appropriate for a param- 
eter, the count is programmed in units of full lines. 
The length of the complete field is programmed in 
half lines, which makes it convenient for distinguish- 
ing between interlaced and non-interlaced displays. 
(For interlaced displays, the number of half lines is 
odd, for non-interlaced displays, it is even.) The ver- 
tical active and blanking regions may be indepen- 
dently programmed, with the border time defined as 
the region where blanking and active display is not 
on. 


NOTE: 


Sync parameters 
are completely 
independent 
of 
the display parameters. 
This allows 
the sync sig- 
nals to be positioned 
anywhere 
in the field (even 
during active display). 
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Unit Level Diagram 
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Figure 2·2. Horizontal Programming Parameters 
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Figure 2·3. Vertical Programming Parameters 
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VBUS Control 


The VBUS controller sends all 827500B requests 
for display bitmaps, VRAM refresh, and synchroniza- 
tion information to the 82750PB, at programmable 
times during a field. Transfer requests are scheduled 
to occur on a line basis, so only their vertical position 
(or line) is specified by the user. Other commands, 
like refresh requests, occur every line, and their hori- 
zontal position (or dot position) in the line must be 
specified by the user. Transfer requests are given 
the highest priority by the VBUS control circuit and 
are performed first during a blanking interval. The 
programmer has the responsibility of scheduling the 
line oriented codes, like refresh, so that they do not 
collide with the transfer requests. 


Besides arbitrating the scheduled transfer requests, 
the VBUS controller also reads the data from the 
VRAM shift registers using the two shift clock out- 
puts (SCLK[1:0)). The code corresponding to the 
type of data to be read is asserted for a programma- 
ble number of cycles on the 4-bit VBUS. The 
827500B then waits a programmable delay before' 
reading the data from the VRAM. This delay should 
be long enough to guarantee that the 82750PB has 
completed loading the information into the serial 
shift register of the VRAM. Both signals are off while 
the code causing the transfer cycle is active on the 
VBUS, as well as during the read delay time. Figure 
2-4 
illustrates this 
communication 
between 
the 
82750PB and the 8275008. 


When the delay wait is over, the shift clock outputs 
are activated. The SCLK[1:0] signal's behavior is 
dependent on the transfer rate that the user has se- 
lected=elther 
1X, 1/2X, 
or 1/3X 
the operating fre- 


quency. Note that if the RESETB*" signal is applied, 
the transfer rate is automatically set to 113X during 
the first automatic register transfer, regardless of the 
state of the transfer rate selection. The transfer rate 
may be changed in the first register transfer after 
RESETB*" is set to a logic one value. 


Figure 2-5 illustrates how the SCLKs operate in the 
1X mode in a system. SCLK[1:0] signals will toggle 
between zero and one on the 
rising edge of 
FREQIN, after an internal logic delay. The data is 
read into the 827500B on the rising edge of the in- 
ternal clock, one 827500B clock cycle after the 
SCLK outputs are asserted. Since there are 32 data 
input pins, each SCLK can read in the serial data 
from eight 256 x 4 VRAM memory devices. Adding 
external buffering to the SCLKs (to drive more mem- 
ory) will also add delay to the memory access. The 
delay increase may require more than one T-cycle 
before the VRAM data is valid. In this case, the time 
between the rising edge of the internal 827500B 
clock that generates the SCLKs and the edge that 
latches the data must be increased. 


There are two solutions, the operating frequency of 
827500B can be lowered to accommodate a longer 
T-cycle, or the 1/2X 
SCLK mode may be selected 
(as shown in Figure 2-6). When using the 
1/2X 
transfer rate, the data is read into the 827500B on 
the rising edge of the internal clock, two 827500B 
clock cycles after the SCLK outputs are asserted. 
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Figure 2-4. 82750PB/82750DB Communication 
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8275008 


Figure 2-7 illustrates 1/3X 
(default) shift clock oper- 
ation that is used during the RESET mode or may be 
programmed by the user. The first word of data is 
latched by the 8275008 on the rising ede of the 
FREQIN that is three T-cycles after the SCLK out- 
puts were asserted. This allows three full 8275008 


cycles for the VRAMs to output valid data. which 
gives extra margin for applications that need longer 
shift read cycles (due to slower memories or exter- 
nal logic delays) and do not wish to operate the 
8275008 at a slower speed. 
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Figure 2-5. 8275008 1X Shift Clock Operation 
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Figure 2-6. 8275008 1/2X Shift Clock Operation 


;.,.---- 
3 T-CYCLES 
----., 


8275008 
FREQIN 


;..- 
TOSCLK 


SCLK[1 :01 ---.f: 
\ 
V 
\ 
,, 


TACCESS -; 
r--- 
, 
! 


VRAIo4 
X 
data 
, 


.: TSETUP 


240855-10 


Figure 2-7. 8275008 ~X 
Shift Clock Operation 
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input clock when using the 1/3X 
mode. In the 1/3X 
mode the SCLK[1:0] outputs will be high for one 
T-cycle, and low for 2 T-cycles. 


VBUS CODE DESCRIPTION 


When the 827500B is actively fetching and display- 
ing pixels, VUXFER, BMX/YBMNPX, and REGX are 
typically sent over the VBUS. Of the three codes, 
REGX has top priority, followed by VUXFER, and 
last by BMX/YBMNPX. These commands may be 
programmed to occur each active line during the 
blanking interval for the line just completed. If a reg- 
ister transfer has been programmed for an active 
line, it takes priority and is executed first. Otherwise, 
immediately after the register transfer, any sched- 
uled VUXFER and BMXIYBMNPX commands are 
executed. The programmer has the responsibility for 
verifying that the sum of times required by these 
commands does not exceed horizontal non-active 
display time. The 827500B will commence fetching 
pixels at the subsequent start of active display. A 
detailed explanation of the different types of VBUS 
commands and their corresponding codes follows. 


Transfer Requests 


The following commands request the 82750PB to 
transfer information from the VRAM array into the 
VRAM shift register. When multiple requests are pro- 
grammed for a given line, they are listed in the priori- 
ty they are sent. When asserting a transfer request, 
the programmer must be aware of two other pro- 
grammed parameters, VBLEN and SCLK delay. 


The VBLEN parameter is a user programmed value 
whose bits lie in the General Control Register. It is 
the length of time, in 827500B T-cycles, that a par- 
ticular VBUS code will be held at the outputs. It is 
used to ensure that the asynchronously operating 
82750PB chip will have enough time to recognize 
and begin operating on an 827500B transfer re- 
quest. 
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transrer, me tI"u:,ut-'tj 
uses me contents or Its 
827500Bc register as the starting address of the 
827500B register data. On each subsequent ac- 
cess, the programmed pitch value in 82750PB's 
827500Bc-PITCH register is added to the accumu- 
lated start address. The programmer must ensure 
that the data is stored in VRAM at the correct ad- 
dress. Since the pitch remains constant, the longest 
register load will determine the pitch value. 


The VBUS unit performs a vertical checksum on all 
the register information. Each bit in the register word 
undergoes an exclusive-OR with the corresponding 
bit in the previous data word. The 827500B com- 
pares this information with the user generated 
checksum, which is the last 32-bit data word read 
into the 827500B during a register transfer. If the 
values do not match, the 827500B will disable all of 
its digital sync and data outputs, enter the reset 
state, and send a SHUTOOWN code (827500BSO) 
to the 82750PB over the VBUS[3:0] outputs. If the 
new checksum is correct, the new register values 
will take effect immediately. 


VUXFER (0001) 
This code is used to request VU 
data, providing new VU data is required by the 
827500B. This command is issued only on vertically 
active lines (as programmed in the register, not as 
seen on the screen) and possibly the four lines after. 
On each line, a row of V andIor U samples are load- 
ed into the VU interpolator line stores. The pattern of 
requests depends upon the mode in which the VU 
interpolator is operating. In the interlaced VU mode, 
one line of samples for both the V and U compo- 
nents are fetched during each transfer; in the non-in- 
terlaced VU mode, only one line of samples for ei- 
ther the V or U components is fetched. Table 2-1 
illustrates the pattern of requests. M is the pro- 
grammed first vertical active line, and N the last ac- 
tive line. The modes listed have VU transfer re- 
quests following the end of horizontal active of the 
lines specified, stopping with the last line, N + 4. 
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Table 
2-1. VU Transfer 
Request 
Patterns 


Mode 
Active 
Request 
VU Data 
Line 


2x Non-Interlaced 
M 
Fetch 1st Line of V 
M + 1 
Fetch 1st Line of U 


M+4 
Fetch 2nd Line of V 


M+5 
Fetch 2nd Line of U 
N+4 
Fetch Last Line of V 


2x Interlaced 
M 
Fetch 1st Line of V and U 
(Odd and Even 
M+4 
Fetch 2nd Line of V and U 
Fields) 
M+5 
Fetch 3rd Line of V and U 
N+4 
Fetch Last Line of V and U 


4x Non-Interlaced 
M 
Fetch 1st Line of V 
M + 1 
Fetch 1st Line of U 
M+4 
Fetch 2nd Line of V 
M+5 
Fetch 2nd Line of U 
M+8 
Fetch 3rd Line of V 
N+4 
Fetch Last Line of V 


4x Interlaced 
M 
Fetch 1st Line of V and U 
(Odd and Even 
M+4 
Fetch 2nd Line of V and U 
Fields) 
M+6 
Fetch 3rd Line of V and U 
N+4 
Fetch Last Line of V and U 


The 82750PB 
uses another 
internal pointer to cause 
the VRAM 
to load the desired 
VU data into its shift 
registers 
(incrementing 
the pointer by a pitch value). 
This command 
is asserted 
for a programmable 
num- 
ber of T-cycles 
(m), as specified 
in the Miscellane· 
ous 
Control 
register. 
Then, 
the 
827500B 
fetches 
them, 
tying 
up the 
827500BIVRAM 
interface 
for 
(n + 2) cycles, where n is 1f.a the programmable 
total 
number 
of 8-bit samples 
of V and U fetched. 
Note 
that 
one 
extra 
word, 
which 
may overlap 
the 
next 
VBUS command, 
is fetched. 


~~ settin~ a bit in the Miscellaneous 
Control register, 
It IS possible 
to replicate 
lines of V and U generated 
by the 
interpolator 
for the entire 
field. 
Since 
each 
line of VU data is displayed 
twice, the rate that the 
VU sample 
map has to be fetched 
from 
VRAM 
is 
reduced 
by %. Table 
2-2 lists the sequence 
of VU 
loads. 


In S?me cases, ~he VU interpolator 
may cover only a 
portion 
of the dIsplay. 
In those 
instances, 
M in the 
above examples 
would be the first line that VU inter- 
polation 
is enabled. 
N would be the last line that VU 
interpolation 
is enabled. 
Regardless 
of the state 
of 
the 
Line 
Replicate 
bit, there 
would 
be no vertical 
pipeline 
delay between 
the loading of the first line of 
samples 
and the second 
line of samples. 
The first 
line of samples 
would 
be loaded 
at M-I, 
and 
the 
second 
line at M. This reduces 
the delay 
between 
switching 
interpolation 
modes 
during 
a single 
dis- 
play. 
I 
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Table 
2-2. VU Transfer 
Request 
Patterns 
with 
Line Replicate 


Mode 
Active 
Request 
Line 


2x Non-Interlaced 
M 
Fetch 1st Line of V 


M + 1 
Fetch 1st Line of U 
M+4 
Fetch 2nd Line of V 
M+5 
Fetch 2nd Line of U 


M+8 
Fetch 3rd Line of V 
M + 9 
Fetch 3rd Line of U 
N+4 
Fetch Last Line of V 


2x Interlaced 
M 
Fetch 1st Line of V and U 
(Odd and Even 
M+4 
Fetch 2nd Line of V and U 
Fields) 
M+6 
F~tch 3rd Line of V and U 
N+4 
Fetch Last Line of V and U 


4x Non-Interlaced 
M 
Fetch 1st Line of V 
M + 1 
Fetch 1st Line of U 
M+4 
Fetch 2nd Line of V 
M+5 
Fetch 2nd Line of U 
M + 12 Fetch 3nd Line of V 
M + 13 Fetch 3nd Line of U 
N+4 
Fetch Last Line of V 


4x Interlaced 
M 
Fetch 1st Line of V and U 
(Odd and Even 
M+4 
Fetch 2nd Line of V and U 
Fields) 
M+8 
Fetch 3rd Line of V and U 
N+4 
Fetch Last Line of V and U 


BMX 
(0000) 
This 
command 
requests 
a bitmap. 


B.M~ (0000) is sent after horizontal 
active stops, be- 
ginning 
on the fifth 
line after 
vertical 
active 
starts 
and continuing 
until the fifth line after vertical 
activ~ 
stops. (There is a vertical 
pipeline 
delay of five lines 
through 
the.827500B, 
due to internal 
timing require- 
ments.) 
A hne programmed 
to start 
at line M, will 
have its first active line displayed 
at line M + 5. The 
82750PB 
uses 
an 
internal 
pointer 
to 
cause 
the 
VRAM shift registers 
to be loaded with pixel values. 


The 
827500B 
subsequently 
fetches 
them 
as 
re- 
quired for display. This command 
is asserted 
on the 
VB US for the user-programmed 
number 
of T-cycles 
and must be completed 
before active display begins. 


YBMNPX 
(0100) 
This command 
performs 
a Y bit- 


map transfer 
without 
performing 
a pitch calculation. 
When.the 
line replicate 
mode is selected 
by Bit 22 in 
the Miscellaneous 
Control 
register, 
this code 
is as- 
serted every other display line so that the same line 
of information 
can be used twice. 


•• 


1-79 


8275008 


Olgitlzer Commands 


When in the line replicate mode, and digitizing an 
NTSC source (for example, when genlocking an 
NTSC source to a system that uses only a VGA 
monitor), each line of captured data is effectively 
output at twice the rate. Since each line need only 
be stored once in memory (it is duplicated automati- 
cally in the display mode) only one WROIGI code, 
followed by a WROIGINP, is sent every other line. 
On alternate lines, two WROIGINP are sent and will 
select the last address that was written, without in- 
crementing the 82750PB bitmap address pointer. 
This is described in detail in Chapter 3. 


WROIGI (0011) 
This command requests a write of 
digitized data. The operation of this command is de- 
pendent upon the external hardware and is dis- 
cussed in the section on genlocking (page 29). If 
digitizing is enabled, this command is asserted on 
the VBUS for a programmable number of T-cycles. 
The pointer is then incremented by a pitch value. 
Since each horizontal line is stored in a single row of 
memory, this pitch value is equal to the horizontal 
resolution, in bytes, for non-interlaced bitmaps. For 
interlaced bitmaps, the pitch value is equal to twice 
the horizontal resolution, in bytes. This allows alter- 
nate lines of data to be skipped over in successive 
fields. 


WROIGINP (0111) 
This command allows access 
to digitized data without performing a pitch calcula- 
tion. WROIGINP (0111) requests that the 82750PB 
perform a transfer request at the last calculated ad- 
dress. Note that only a memory transfer cycle is per- 
formed-the 
pitch value is not added to this ad- 
dress. This will always ensure that the digitized data 
is written into the last selected memory address, in 
case 
a 
physical 
memory 
boundary 
has 
been 
crossed. This command is asserted after the WROI- 
GI transfer has completed. 


Refresh and Control Commands 


The following signals are used to pass refresh re- 
quests and control information to the 82750PB. 


OFL (1000) 
The Oisplay Format Load command is 
a maskable host processor interrupt that can be pro- 
grammed to occur at any time during the display. 
This is used by the 82750PB to transfer the shadow 
register contents into the working register set in the 
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VRAM interface. This is useful in supporting split- 
screen-type applications, where it is desirable to 
change the bitmap pointers at some point before the 
end of the display. 


8275008S0 
(1001) 
This 
command 
is 
the 
827500B Shut Oown code. Ouring every register 
transfer, the 827500B keeps an internal vertical ex- 
clusive-or checksum of the register data as it is read 
onto the chip. The last word of data that is read 
during'the 
register transfer is' the user-generated 
checksum. If the two checksums match, operation 
proceeds as normal. If they do not match, the 
827500B enters the reset state and sends this code 
to the 82750PB. The 827500B will remain reset until 
the reset pin is asserted and negated by the host 
processor. 


REFRESH 
(1010) 
This 
command 
asks 
the 
82750PB to generate up to 15 refresh cycles every 
horizontal line. The 827500B transfer cycles have a 
higher priority than refresh requests in the 82750PB. 
REFRESHwill not be asserted if programmed to oc- 
cur at the same time as a transfer request code. 


Video Synchronization 
Information 


The following codes are used to pass the video line 
and field information from 827500B to the pixel 
processor. 


VEVEN (1101) 
This code indicates the start of an 
even (Le.,second) field of a frame. This command is 
sent coincident with line one of each even field. 
When genlocking to an external source (see pg. 29), 
the 
occurrence 
of 
a vreset 
signal during pro- 
grammed horizontal active 
time will cause the 
827500B to output a VEVEN code on the VBUS. 


VOOO (1100) 
This code indicates the start of an 
odd (Le.,first or only) field of a frame. This command 
is always sent immediately after RESETB# is negat- 
ed, and coincident with line one of the odd field. 
Similarly, when genlocking, the occurrence of a vre- 
set signal during any time other than horizontal ac- 
tive time will cause the 827500B to output a VOOO 
code on the VBUS. 


HUN (1110) 
This code marks every horizontal line 
at a programmable point in the line. HUN is used by 
the 82750PB to increment its horizontal line counter. 
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Pixel Processing 
Path 


This logic accepts the 32-bit word from the input 
latch and divides the word into the programmed pix- 
el format. This will result in either four 8-bit pixels, 
two 16-bit pixels, one 32-bit pixel, or an 8-bit pixel 
with an 8-bit alpha value (pseudo 16-bit mode). The 
pixels act as addresses to the color table, or may 
bypass the table completely as described below. 


Pixel information may be mixed with the output of 
the VU interpolator, which outputs interpolated sam- 
ples derived from a reduced sample bitmap. The 
least significant bit of Y or LSB of U can be pro- 
grammed to act as a switch between using the ex- 
plicit pixel value of YUV or using the luminance por- 
tion of the pixel with the VU portion obtained from 
the interpolator. If the value of the LSB of Y (or U, 
whichever is selected) is zero, the pixel data is used. 
If the LSB of Y (or U) is one, the output of the VU 
interpolator is used. Note that if the LSB of Y is used 
as the switch flag, the luminance portion of the word 
will be only 7 bits wide. 


The alpha information is also processed in this 
block. The alpha data may come from one of two 
sources: it may be explicitly coded in the pixel word, 
as is the case in the 32-bitlpixel and pseudo 16-bitl 
pixel mode, or it may be obtained by comparing the 
Y portion of the pixel with a preprogrammed value 
and outputting one preprogrammed value if they 
match and a different value if they do not match. 
This latter capability is known as Alpha Trap. 


VU Interpolation 


When VU interpolation is enabled by the program- 
mer, and when the display is in the active region, 
"VU data" will be fetched, as required by the inter- 
polator (by the mechanisms discussed previously in 
the section titled "VBUS Code Description"). This 
data has the format V, V, ... , V, U, U, ... , U where 
each V or U is 8 bits, and the bytes are grouped into 
32-bit double-words with the earliest in lowest order. 
The number, "N", of V bytes and U bytes is the 
same; N is programmed to be either 256 samples, or 
one of 32 to 192 samples in 32-byte increments. 


The first V data and the first U data fetched on the 
first line of VU interpolation supplies the VU value for 
the first active pixel on that line. All the other VU 
pairs that are fetched define values for the grid of 
pixels defined below and to the right of this one by 
the VU expansion factor every other or every fourth 
horizontally and vertically. Most other VU values are 
filled in recursively by interpolation. Wherever there 
is a pixel which lies between two pixels with known 
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values, it is given the value of the weighted average 
of the known values. Values are understood to be 
non-negative integers. When the final value is out- 
putted, any fractions are truncated or rounded to the 
closest odd integer according to the programmed 
value of the interpolation round flag. This process is 
iterated until all pixels have assigned color values. If 
the number of VU data samples loaded into the 
82750DB is not enough to cover the active display 
area, then the last data sample will be replicated 
horizontally across the active display window. 


As mentioned previously in the VBUS Control dis- 
cussion, each line of VU data can be used twice by 
setting the Line Replicate bit in the Miscellaneous 
Control register. Also, each horizontal VU sample 
can be replicated by setting the VU Replicate bit in 
the Pixel Control register. This will cause the V and 
U pixels generated by the VU interpolator every pixel 
time to be used twice. This can result in an effective 
8X horizontal expansion, which is useful when hori- 
zontal blanking time is at a premium. This bit affects 
the horizontal interpolation algorithm only, and will 
not affect the line loading sequence for VU during 
the active display. 


When interpolation is turned on by the programmer 
(by specifying a non-zero number of samples to be 
fetched), VU interpolation may nevertheless be dis- 
abled for each pixel if the following conditions are 
met: 


1. Conditional interpolation has been selected by 
the programmer, 
AND 


Either of the two user-programmed conditions: 
a. Switching on the LSB of the U bit has been 
selected, and the lowest-order bit of the U val- 
ue fetched for the upper left pixel in the block 
has value zero. This allows switching to occur 
on a 2 x 2-pixel or 4 x 4-pixel grid, depending 
on the expansion mode the user has selected. 
The full 8 bits of Y and V are used, but the 
usable space of U has been decreased to 7 
bits. 


b. Switching on the LSB of the Y bit has been 
selected, and the low order bit of the Y value 
for the current pixel has a value of zero. 
2. Display of fetched and interpolated VU values 
may also be suppressed by setting the Interpola- 
tion Output Enable bit (in the miscellaneous con- 
trol register) to zero. This will allow VU data to be 
loaded into the VU line stores without displaying 
VU data. This is useful when a mid-screen tran- 
sition is made between two interpolation modes, 
to compensate for the vertical latency of the in- 
terpolation process. 
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Each 
channel 
(Y, U, V) has a MASK 
SET register 
and a MASK 
DATA 
register 
that 
selects 
the color 
lookup address 
bit to be changed 
and the new value 
of the bit, respectively. 
A simple 
mask operation 
on 
one channel 
is illustrated 
in Figure 2-8. 


The CLUT address 
mask operation 
is determined 
by 
a logical 
equation 
given by: 


Result 
= (mask 
set and mask 
data) I (mask 
set and data 
byte) 


Each bit of the Result byte is determined 
individually 
by this equation. 
The Result byte is then further 
pro- 


cessed 
in order to produce 
the CLUT RAM address. 


sures 
the 
proper 
half of the 
CLUT 
is used 
for all 
three components. 


The color 
table .can be bypassed 
completely 
when 
displaying 
either 
graphics 
or video, 
independent 
of 
the programmed 
bits/pixel. 
This is programmed 
by 
the user via the VIDEO PASS and GRAPHICS 
PASS 
bits in the Miscellaneous 
Control 
register. 
Table 2-3 
summarizes 
the 
various 
modes 
when 
using 
the 
CLUT. 
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MASK DATA Register 
(0 x 42) 
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Figure 2-8. Mask Operation 
on CLUT Address 


Table 2-3 CLUT Modes 


Graphics 
Video 
LSBYorU 
SPLITCLUT 
Pass 
Pass 
Colormap 
Address 


0 
X 
0 
0 
Masked Graphics 
Data 


1 
X 
0 
X 
Graphics 
Pixels Bypass CLUT 


X 
0 
1 
0 
Masked Video Data 


X 
1 
1 
X 
Video Pixels Bypass CLUT 


0 
X 
0 
1 
Even Address 
Only (Graphics) 


X 
0 
1 
1 
Odd Address 
Only (Video) 


1 
1 
X 
X 
CLUT Not Used at All 
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When writing to the CLUT, the most significant byte 
of the data word corresponds to the address, and 
the least significant 24 bits are the YUV data (least 
significant to most significant, respectively). An in- 
dex register is used to allow the 6-bit address to be 
mapped to an 8-bit number. (Refer to Chapter 4 for 
more information.) By resetting the 82750DA Dis- 
able bit, it is possible to make the CLUT look like the 
reduced entry color lookup table on the 82750DA. 


The following paragraphs summarize the possible 
bitlpixel modes, using the LSB of Y or U switching 
ability and the various graphics and video bypass 
modes. Note that there are modes where the LSB of 
Y or U are not used to switch between graphics and 
video. 


8-BIT IPIXEL 
GRAPHICS 
MODE 


This is the graphics-only mode, in which the 8 bits 
are used as inputs to all three color tables. This 
makes the color maps look like a single, ?56 x 24-bit 
CLUT and allows 256 unique colors from a palette of 
16 million to be available at any given time. If the 
Graphics Pass bit is asserted, the CLUT will be by- 
passed and the 8-bit values of the Y, U, and V chan- 
nels will be input to each channel of the converter 
matrix. 


8-BIT IPIXEL 
VIDEO 
MODE 


When used with subsampled VU information from 
the interpolator, the 8 bits are actually a luminance 
value. The Y portion addresses the Y color table, V 
the V color table, and U the U color table. By using 
the color table, a one-to-one mapping exists, allow- 
ing non-linear transformations to be applied to the 
_ pixel data to enhance the quality of the reconstruct- 
ed image. By asserting the VIDEOPASS bit in the 
Miscellaneous Control register, the color table can 
be bypassed. 


8-BIT IPIXEL 
MIXED 
MODE 


In the 8-bitlpixel mixed mode the LSB of Y or U is 
used as a switch flag to change the index to the 
color tables. When the switch flag is set to a one, 
the Y value corresponds to a luminance value, and 
the VU values are the chrominance information ob- 
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tained from the VU interpolator. In this case each 
video component is used as an address to its corre- 
sponding CLUT as described above. When the 
switch flag is set to a zero, the VU values are not 
used and the Y value is used as the address to all 
color tables. These pixels are treated the same as in 
the 8-bitlpixel graphics mode. 


In this mode the applications programmer must en- 
sure that the proper information has been loaded 
into specific areas of the color maps. For example, 
all the video pixels will use the odd address values. 
By restricting the address used in the graphics and 
video mode, two unique maps may coexist in the 
tables. One map is used for non-linear transforma- 
tions on video data, and the other for graphics color 
lookup table applications. 


As illustrated above, the CLUT can be bypassed by 
asserting either or both of the bypass controls. 


PSEUDO 
16-BIT/PIXEL 
GRAPHICS 
MODE 


In the pseudo 16-bitlpixel 
graphics mode each 
32-bit data word is made up of two, 16-bit pixel 
words. The 82750DB processes each 16-bit pixel 
word, so that the least significant 8 bits correspond 
to pixel information, and the most significant 8 bits 
are used as alpha information. The 82750DB uses 
the lower 8 bits as inputs to all three color tables. 
This makes the color maps look like a single, 256 x 
24-bit color table. If the Graphics Pass bit is assert- 
ed, the CLUT will be bypassed and the 8-bit values 
of the Y, U, and V channels will be input to each 
channel of the converter matrix. 


PSEUDO 
16-BIT/PIXEL 
VIDEO 
MODE 


When used with subsampled VU information, the 
least significant 8 bits of the pixel word are actually a 
luminance value. The most significant 8 bits are 
used as alpha information. The VU information is 
generated by the 82750DB interpolator. Each of the 
color maps uses the corresponding 8-bit video com- 
ponent as an addess. By asserting the Video Pass 
bit in the Miscellaneous Control register, the color 
table can be bypassed. 
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PSEUDO 
16-BIT/PIXEL 
MIXED 
MODE 


In this mode the LSB of Y or U is used as the switch 
flag to change the index to the color tables. When 
the LSB of Y or U is set to a one, the lower 8-bit 
value corresponds to a luminance value, and the V 
and U values are the chrominance information. In 
this case, each video component of the 827500B is 
used as a colormap address as described above. 
When the LSB of Y or U is set to zero, the V and U 
values from the interpolator are not used, and the Y 
value is used as the address to all color tables. 


16-BIT IPIXEL 
GRAPHICS 
MODE 


The 16-bit pixel word is broken up on the 827500B 
to yield 6 bits of Y, and 5 bits each of V and U. The Y 
bits are the least significant, and the U bits are the 
most significant. These values are then padded with 
zeros in the lower order bits, to obtain an 8-bit word 
for each pixel component. Each component ad- 
dresses its respective CLUT. However, the Y chan- 
nel may access only 64 unique locations, and 5-bit 
resolution for VU restricts them to 32 unique loca- 
tions each. The address range may be extended by 
using the colormap mask registers to add 2 bits of 
precision in the least significant bits for Y and 3 least 
significant bits each for VU channels. This allows the 
programmer to access all the entries in the color 
table by reprogramming the MASK OATA and MASK 
SET registers during the blanking interval. 


16-BIT/PIXEL 
VIDEO 
MODE 


This mode works like the 8-bitlpixel video mode de- 
scribed above, except that the 827500B has pro- 
cessed the information so that the Y channel con- 
tains the least significant 8 bits of the 16-bit data 
word. The V and U information is generated by the 
VU interpolator. If the SPLlTCLUT mode is selected, 
the LSB of the address is forced to an odd entry in 
the three color tables. 


16-BIT IPIXEL 
MIXED 
MODE 


When the switch flag is zero, the graphics mode is 
selected and the inputs to the CLUT are the respec- 
tive YUV data in the 6-5-5 format. These pixel values 
are extended by using the colormap masking regis- 
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ters. When the switch flag indicates the video mode, 
the lower 8 bits of the 16-bit pixel word and the VU 
values obtained from the interpolar are input to their 
respective CLUTs. If the SPLlTCLUT mode is select- 
ed, the LSB of the address is forced to either an odd 
or even entry in the three color tables, depending on 
whether the data is video or graphics information. 


32-BIT IPIXEL 
GRAPHICS 
MODE 


Eight bits each of Y, U, and V are used as addresses 
to each segment of the color table. Since the size of 
the addressable color space is not increased, the 
advantage of using the color map is for special ef- 
fects or gamma correction. The most significant 8 
bits of the 32-bit data word are used for the alpha 
channel data. If the Graphics Pass bit is asserted, 
the CLUT will be bypassed and the 8-bit values of 
the Y, V, and U will be input to each channel of the 
converter matrix. 


32-BIT IPIXEL 
VIDEO 
MODE 


The Y channel contains the least significant 8 bits of 
the 32-bit data word. The U and V information is 
generated by the VU interpolator. The YUV channels 
are input to their respective color tables. The size of 
the addressable color space is not increased, but 
this can be used to take advantage of a non-linear 
transformation, which may aid in the decompression 
process. The most Significant8 bits of the data word 
are used for the alpha channel data. 


32-BIT IPIXEL 
MIXED 
MODE 


When the switch flag is zero, the graphics mode is 
selected, and the inputs to the CLUT are the respec- 
tive 8 bits each of YUV data. These pixel values may 
be masked by using the colormap mask data and 
mask set registers. When the switch flag indicates 
the video mode, the lower 8 bits of the pixel word 
and the VU values obtained from the interpolator are 
input to their respective CLUTs. If the SPLlTCLUT 
mode is selected, the LSB of the address is set to 
either an odd or even entry in the three color tables, 
depending on whether the data is video or graphics 
information. The most significant 8 bits of the data 
word are used for the alpha channel data. 
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Y Interpolator 


The Y Interpolator performs a 2X horizontal linear 
interpolation on each line of Y values. When Y inter- 
polation is enabled, the internal pixel clock is twice 
the frequency of PIXCLK output. 


NOTE: 


If Yinterpolation is enabled, then only the integer 
values of pixel times greater than 1X may be 
used 


The interpolation may be separately controlled for 
both video and graphics pixels, via the Viden and 
Gren bits (bits 12 and 11) of the General Control 
register. A video pixel is defined as one generated 
using VU interpolated values. A graphics pixel does 
not use the VU interpolator. The effects of setting 
the control bits, the 8275008 enable flag, and vid- 
eo/graphics pixel switch (V/G Switch) on the output 
of the interpolator are summarized in Table 2-4. 


8ecause of the asymmetric nature of the internal . 
pixel clock used on 8275008, the number of T-cy- 
cles between successive Y pixels varies depending 
on the programmed pixel width. When enabled, 
there is a pipeline delay through the Y Interpolator 
equal to the number of T-cycles between each inter- 
nal pixel clock. 


When the interpolator is bypassed as described 
above, there is a fixed delay through this block. The 
V and U data are delayed by one pixel clock to allow 
the chroma data to line up with the luminance data. 
Other control signals, such as the register address 
byte (most significant byte of the 32-bit data word 
read from VRAM), the pixel clock, horizontal and 
yertical active displ~ys, composite blanking, and reg- 
ister load enable signals are also delayed by one 
pixel clock in order to line up with the YUV data. The 
proqramrner must ensure that the active display tim- 
Ing IS programmed to take the appropriate delay 
through the Y Interpolator into account. 
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Table 2-4. Control Bit Settings and 
Resulting Interpolator 
Output 


8275008 
Vlden 
Gren 
V/G 
Enable 
Switch 
Result 


0 
X 
X 
X 
Interpolator 
Bypassed 


1 
0 
0 
X 
Interpolator 
Bypassed 


1 
0 
1 
0 
Interpolate 
GraphicsPixel 


1 
0 
1 
1 
DoNot 
Interpolate 
VideoPixel 


1 
1 
0 
1 
Interpolate 
VideoPixel 


1 
1 
0 
0 
Do Not 
Interpolate 
GraphicsPixel 


1 
1 
1 
X 
Interpolate 
BothVideo 
and Graphics 
Pixels 


Cursor 


Hardware support for a 16 x 16-pixel cursor has 
been included on the 8275008. The cursor is capa- 


. ble of providing sharp color transitions, when using 
subsampled VU bitmaps. Software intervention is 
minimized, leaving the host with more processing cy- 
cles to perform other operations. 


Under normal operation, the XY starting display po- 
sition of the cursor is loaded into the Cursor Control 
register during a 827500B register load. On the dis- 
play line corresponding to the Y start position, the 
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terpret the next sixteen 32-bit words of reqister data 
as the cursor bitmap, and will disable the other regis- 
ters on the 82750D8 from decoding the address 
field of the 32-bit data word. (The checksum of the 
82750D8 register data is not performed during the 
loading of the cursor bitmap data.) The cursor bit- 
map will be loaded a line at a time, starting at line 
zero and continuing in sequential order to line 15. 
Each line in the cursor map actually contains sixteen 
2-bit cursor pixels, with the two least significant bits 
corresponding to the first cursor pixel in that line, 
and the two most significant bits corresponding to 
the 16th cursor pixel on that line. Each 2-bit pixel 
may select one of the three Cursor Color registers or 
transparency, according to the format indicated in 
Table 2-5. 


Table 2-5. Cursor Color Registers 


Cursor 
Plxel 
Output 


00 
Transparency 
(Cursor Pixel Not Displayed) 


01 
Cursor Color Register 1 


10 
Cursor Color Register 2 


11 
Cursor Color Register 3 


Three 24-bit color registers that hold the color infor- 
mation for the cursor may be written to at any time 
during the register load. The cursor may be loaded 
any time during the blanking intervals of the display. 
For displays that do not program the cursor during 
the display, the cursor bitmap may be loaded during 
the vertical blanking interval. 


When the T-cycle count equals the value pro- 
grammed into the X start position of the Cursor Con- 
trol register, the first cursor pixel can be displayed. 
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16 x 32 pixels. For non-interlaced displays, the same 
line of cursor information will appear on the same 
line every field. The cursor in this case will be 16 x 
16 pixels. The size of the cursor may be doubled 
independently in the horizontal and/or vertical direc- 
tion by setting the 2X Horizontal Cursor or 2X Verti- 
cal Cursor bit in the General Control register. In this 
case, no new data is loaded into the cursor map; the 
data is just replicated in the corresponding dimen- 
sion. Table 2-6 summarizes some of the possible 
cursor sizes. Note that by loading the cursor bitmap 
with different data at the start of every field, cursor 
sizes not listed below may be achieved. 


Table 2-6. Cursor Sizes 


2X Horz. 
2X Vert. 
Display 
Cursor Size 
Cursor 
Cursor 
(In Plxels) 


Off 
Off 
Interlaced 
16 x 32 


On 
Off 
Interlaced 
32x32 


Off 
On 
Interlaced 
16x64 


On 
On 
Interlaced 
32x64 


Off 
Off 
Non-Interlaced 
16 x 16 


On 
Off 
Non-Interlaced 
32x 16 


Off 
On 
Non-Interlaced 
16x32 


On 
On 
Non-Interlaced 
32x32 


There is a complex relationship between the cursor 
and the pixel data especially when using non-inte- 
gral divisors of the pixel clocks. Since the pixel data 
output from the 82750D8 pixel path always changes 
coincident with the rising edge of the clock, the cur- 
sor start position must be positioned on the rising 
edge of any period of the pixel clock. The program- 
mer must enforce the corresponding restrictions on 
the start and stop position of the cursor. 
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YUV to RGB Converter 


The following 
equations 
give the theoretical 
relation- 
ship between 
analog RGB components, 
R, G, B, and 
analog 
YUV components, 
Y, U, V. 


Y = 
0.298822 R + 0.586816 G + 0.114363 8 


V = R - Y = 0.701178R - 0.586816G -0.1143638 


U = 8 - Y = -0.298822 R - 0.586816 G + 0.8856378 


where: 
0.0 < G, R, B < 1.0 


0.0 < Y < 1.0 


-0.701 
< V < +0.701 


-0.886 
< U < -0.886 


Solving 
for G, R, B, we can obtain the inverse 
rela- 
tionship: 


G = Y - 0.509228 V - 0.194888 U 


R=Y+V 


8=Y+U 


where: 
0.0 < G, R, B < 1.0 


0.0 < Y < 1.0 


-0.701 
< V < +0.701 


-0.886 
< U < +0.886 


The luminance 
channel 
for the YUV inputs 
is pre- 
sumed 
to swing between 
O.OV and 1.OV. However, 


the chroma 
components 
do not and need to be nor- 
malized 
to a OV to 1V range. The offset 
binary en- 
coding 
used to obtain 
unsigned 
numbers 
must also 
be accounted 
for. This encoding 
should center the V 
and U inputs 
at the midpoint 
of the voltage 
range. 
The 
equations 
for the 
normalized 
version 
of Y, V, 


and U (Y', V', and U' respectively) 
are: 


Y' = Y 


V' = 0.5V + 0.5 


0.701 


U' = 0.5U + 0.5 
0.886 


where: 
0.0 < Y', V' U' < 1.0 


0.0 < Y < 1.0 


-0.701 
< V < +0.701 


- 0.886 < U < + 0.886 
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When 
converting 
the normalized 
analog 
values 
Y', 


V', 
U' to digital 
y, v, u values, 
the 
D.C. offset 
and 
conversion 
ranges 
are 
compatible 
with 
the 
CCIR 
601 standard 
for digital 
video. 
The 
ranges 
for the 
components 
and the corresponding 
Digital to Ana- 


log equivalent 
equations 
are given below: 
(1a) 


(1b) 


(1c) 


y = (235 - 16)Y' + 16 


where: 
16 < Y < 235 


v = (240 -16)V' 
+ 16 


where: 
16 < v < 240 


u = (240 -16)U' 
+ 16 


where: 
16 < u < 240 


(2a) 


(2b) 


(2c) 


Substituting 
the 
normalized 
analog 
voltages 
of 


Equation 
3 into Equation 
4, we obtain the digital ver- 


sion of the input data, used in the DVI® Technology 
system: 


y = (219)Y + 16 


112V 
v=-+ 
128 
0.701 


112U 
u =--+ 
128 
0.886 


where: 
0.0 < Y < 1.0 


-0.886 
< U < 0.886 


-0.701 
< V < 0.701 


16 < Y < 235 


16 < v, u < 240 


By solving 
equations 
5 for Y, U, V, and substituting 
into Equation 
2, we get the relationship 
between 
an- 
alog R, G, B and the digital DVI y, u, v data: 


(3a) 
G = 0.004566y - 0.003187 v - 0.001541 u + 0.532242 
(6a) 


(3b) 
R = 0.004566 Y+ 0.006259 v - 0.874202 
(6b) 


(3c) 


8 = 0.004566 Y+ 0.007911 u - 1.085631 
(6c) 


where: 
0.0 < R, G, B < 1.0 


16 < Y < 235 


16 < v, u < 240 


(4a) 


(4b) 


(4c) 


(5a) 


(5b) 


(5c) 
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If the inputs 
of the Digital to Analog 
Converter 
are 
scaled to accommodate 
the nominal input range of 0 
to 219, we obtain the following 
relationship 
between 
the inputs 
to the DVI Technology 
system, 
(y, v, u) 
and inputs to the Digital to Analog 
Converters 
(r, g, 
b). Note that all out of range RG8 values (> 255 or 
< 0 due to excursions 
in the inputs) 
are clipped 
to 
255 or o. 


9 = y - 0.696001 v - 0.337633u + 116.56116 
(7a) 


,= 
y + 1.370705 v - 191.45029 
(7b) 


b = y + 1.732446 u - 237.75314 


where: 
16 < Y < 235 


16 < v, u < 240 
o < g, r, b < 255 


8y substitution 
of Equation 
5 into Equation 
1, and by 
converting 
G, R, and 8 to digital values, we can ob- 
tain the inverse 
relationship 
of Equation 
7: 


y = +0.296622, 
+ 0.5666169 + 0.114363b + 16 
(8a) 


u = -0.172466, 
- 0.3367219 + 0.511206b + 126 
(8b) 


v = +0.511545, 
- 0.4261129 - 0.063434b + 126 
(8c) 


where: 
16 < Y < 235 


16 < v, u < 240 
o < g, r, b < 255 


Output Equalization 


The units on the 8275008 
process the pixel informa- 
tion 
at the 
operating 
frequency 
of the 
chip. 
If the 
output 
pixel rate is not equal 
to the 
maximum 
fre- 
quency, 
the units have null states during which pro- 
cessing 
is suspended. 
This type of operation 
is nec- 
essary 
on 
the 
8275008 
because 
of 
the 
large 
amount 
of pipelining. 
Table 2-7 gives the pattern 
of 
T-cycles 
on the 8275008 
during which processing 
is 
active, 
according 
to the programming 
shown 
in Ta- 
ble 4-2. 


The pixel information 
must be output at a rate that is 
some 
sub-multiple 
of the operating 
frequency. 
The 
divisor is programmed 
by the user, and may be from 
1 to 12 times slower 
than the period of FREQIN, 
in 


increments 
of %. Divisors of 13 and 14 are also pro- 
grammable. 
8ecause 
non-integral 
divisors 
are used, 


it is necessary 
for the 8275008 
to output 
different 


information 
on both phases of FREQIN. This is illus- 


trated 
in Figure 2-9, which uses a 2.5 divisor for the 
clock. 
Notice 
that 
the 
pixel clock 
output 
(PIXCLK) 


1-88 


(7c) 


transitions 
fall alternately 
on the active 
and inactive 
phase of the input frequency, 
while the internal 
pixel 
clock 
transitions 
always 
occur 
on the active 
phase. 


Also note that 
PIXCLK 
does 
not have a 50% 
duty 
cycle. 


The equalizing 
logic derives 
a clock that has a peri- 
od equal to the programmed 
pixel rate, providing 
an 
edge to sample 
the output 
information. 
This allows 
the 
Digital 
to Analog 
Converter 
to directly 
sample 
the output 
of the pixel data path before 
performing 
the analog 
conversion. 


Table 2-7. 8275008 
Active T-Cycle 
Patterns 


PlxelTlme 
Pattern 
Of Internal 
(T-Cycles) 
Pixel Clock 


1 
Always On 


1.5 
1 On/1 
On/1 
Off 


2 
1 On/1 
Off 


2.5 
1 On/1 
Off/1 
On/2 
Off 


3 
1 On/2 
Off 


3.5 
1 On/2 
Off/1 
On/3 
Off 


4 
1 On/3 
Off 


4.5 
1 On/3 
Off/1 
On/4 
Off 


5 
1 On/4 
Off 


5.5 
1 On/4 
Off/1 
On/5 
Off 


6 
1 On/5 
Off 


6.5 
1 On/5 
Off/1 
On/6 
Off 


7 
1 On/6 
Off 


7.5 
1 On/60ff/1 
On/7 
Off 


8 
1 On/7 
Off 


8.5 
1 On/7 
Off/1 
On/8 
Off 


9 
1 On/8 
Off 


9.5 
1 On/8 
Off/1 
On/9 
Off 


10 
1 On/9 
Off 


10.5 
1 On/9 
Off/1 
On/10 
Off 


11 
1 On/10 
Off 


11.5 
1 On/10 
Off/1 
On/11 
Off 


12 
1 On/11 
Off 


13 
1 On/12 
Off 


14 
1 On/13 
Off 
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Figure 
2-9. Divide 
by 2.5 Plxel Clock 


Digital to Analog Converters 


The Digital to Analog 
Converters 
(DACs) take three 
channels 
of video 
information 
output 
from the pixel 
data 
path, converting 
it from 
8-bit digital 
values 
to 
analog 
voltage 
levels typically 
between 
OV and 1V. 


The 
conversion 
is monotonic, 
and a pixel clock 
is 
used 
to 
derive 
a two-phase 
clock 
internal 
to the 
DAC:The 
data is sampled 
from the output 
of either 


the 
pixel 
path, 
or the 
YUV to RGB 
matrix 
on the 


rising edge of the internal 
active phase of this clock. 


The DISDAC input pin can be asserted to disable the 
analog 
outputs 
and place 
them 
into a high-imped- 
ance state. 


The analog outputs 
of the triple DAC are referenced 


to an external 
current 
source, 
which 
must 
be con- 
nected to the IREFIN pin. All the analog outputs 
are 
scaled 
by this current 
reference. 
The value 
of the 
analog 
output full scale is as follows: 


lIs = Irel • 255 
18.5 


where: 
Iref is the magnitude 
of the reference 
current. 


The output voltage 
generated 
at full scale is: 


VIs = lIs • Rext 


Rext is the load resistance 
value. 


A typical output load for the analog outputs 
(RV, BU, 


GY) is 100n. 
The speed 
of the DAC analog 
output 


rise and fall times 
is determined 
by the time 
con- 


stant: 


Rext • (Cext + Cout) 


I 


where: 
Cext is the external 
capacitance 
applied and 


Cout is the intrinsic 
capacitance 
of an ana- 


log output. 


For high performance 
the objective 
would be to mini- 
mize Rext and Cext. The voltage 
Voutfs 
can be de- 


termined 
by any combination 
of Ifs and 
Rext, 
but 


must not exceed 
1.5V. In addition, 
Ifs must not ex- 


ceed 14.7 mA. The analog 
outputs 
must go through 


an external 
buffer 
to drive 
doubly-terminated 
75n 


coax line. 


Table 2-8 lists pins which are used to configure 
the 


triple DAC. 


Table 2-8. Digital 
To Analog 
Converter 
Pins 


Signal 
Description 


IREFIN 
Analog Current Reference. 
Must Be 


Decoupled 
to AVCC. 


VGCS 
Internal Voltage 
Reference. 
Must 


Be Decoupled 
to AVCC. 


AVcc 
Analog Power 


AVss 
Analog Ground 


GY, RV, BU 
Analog Pixel Outputs 


DISDIG 
Disable Digital Outputs 


DISDAC 
Disable Analog Outputs 


NOTE: 


The digital video outputs must be disabled by 
setting DISDIG high whenever the analog out- 
puts are used Otherwise the AC and DC charac- 
teristics of the DAC are not guaranteed 
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After power-up, the host processor must set the 
RESETB# input low for a minimum of ten T-cy- 
cles in order to reset the 827500B. The host 
must then set the RESETB# input high to start 
normal operation. 


When the RESETB# input is released, a Start of 
Vertical Field command (VOOO)is sent for 16 T-cy- 
cles to the 82750PB via the VBUS. This code is im- 
mediately followed by a Register Transfer Reque~t 
command (REGX) that is held for 256 T-cycles. This 
256 T-cycle wait assures that the 82750PB has am- 
ple time to honor the 82750.oB register tra~sfer re- 
quest. The register data IS then read Into the 
827500B from the serial port of the VRAMs at a rate 
that is equal to % of the operating frequency. If the 
register transfer does not terminate after 256 T-cy- 
cles, the 827500B will automatically stop the trans- 
fer, send an 827500BSO code to the 82750PB, and 
re-enter the reset state. 


Ouring this register transfer, and on all sub~equent 
register transfers (programmed or automatic), t~e 
827500B performs a vertical checksum on the regis- 
ter data. The last 32-bit word read in during a regis- 
ter transfer is the user-generated checksum of that 
register data. If the 827500B-generated checksum 
error does not match the user-generated checksum, 
the 827500B sends a SHUTOOWN code to the 
82750PB via the VBUS, and will automatically re-en- 
ter the reset state. The 827500B will remain in the 
reset state until the RESETB# input is toggled by 
the host processor. Any VRAM requests or control 
signals programmed to occur during this time will be 
ignored. 


Normal programmed operations start after the first 
successful register load. Frame timing will start at 
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After the first field automatic register transfers will 
occur on the second line of each subsequent field. 
Note that all register transfers will occur at % of the 
operating frequency of the 827500B, unless the 1X 
or 1/2X 
SCLK mode has been programmed by the 
user. 


Throughout the reset process,.the states .~f all out- 
puts become valid at various times. Specifically, af- 
ter 
being 
held 
low 
for 
at 
least 
10 
~-cycles, 
RESETB# must transition to a high state In order 
to initiate normal operation. By the time RESETB# 
reaches this low to high transition, the states of 
SCLK[1:01, VBUS[3:01, HSYNC, VSYNC, CSYNC, 
and 
FCO 
are 
valid. 
Ten 
T-cycles 
following 
RESETB#'s transition from low to high, the states of 
BG, CB, ACTOIS, PIXCLK, OGY[7:01,ORV[7:0], and 
OBU[7:0] become valid. ALPHA[7:0] and BPP[1:0] 
signals reach a valid state 10 T-cycles fol~owingthe 
completion of the first register load follOWingreset. 


Input/Output 
Transformation 


In general, the control outputs, including the syn~ 
signals, are delayed by pipelining effects ~romtheir 
corresponding inputs. If the output sync signals are 
taken as the time base, the first pixel in a line is 
actually fetched by an SCLK that is up to 19 T-c~cles 
before its corresponding PIXCLK. Some later pixels 
may be delayed by an additional number of T-cycles, 
depending upon bits/pixels, pixel timing, and wheth- 
er Y interpolation is enabled. 


Outside of the active display region and before the 
blanking output is asserted, border pixels are output. 
Where the blanking region has been entered and the 
display is not active, the output is the value con- 
tained in the Blanking Color register. 
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Pixel handling in the active region is defined by three 
parameters: 
1. The bits/pixel parameter. 
2. Whether VU interpolation is in effect or not. 
3. If the 827500B Enable bit has been selected. 


VU interpolation is in effect for a given pixel if: 
1. The VU interpolator is turned on (VU sample load 
set to non-zero load value), 


AND 
2. VU interpolation display is permitted (VU interpo- 
lation display operations bit equals 1), 


AND 
3. One of the two following conditions is met: 
a. Either the interpolation is unconditional, 


OR 
b. The controlling Y or the controlling U sample 
for this pixel has a least significant bit of 1. 


The value of the alpha output may come from one of 
the following three sources: 
1. It may be explicitly coded into the pixel data (32- 
bitlpixel 
and 
pseudo 
16-bitlpixel 
with 
Alpha 
modes only). 
2. It may be output from one of two programmable 
registers, AlphaOand Alpha1. 
3. During the portion of the display when the border 
is active, the 8 most significant bits of the Border 
Alpha register may be output. 


Table 3-1 illustrates how the Alpha outputs are se- 
lected. 


Table 3-1. Selecting 
Alpha Outputs 


Alpha 
Alpha 
Alpha Output 
Enable 
Trap Select 


0 
X 
AlphaORegister 


1 
0 
AlphaORegister 
(8,16 bpp) 


1 
0 
MS Byte of Pixel 
(32, Pseudo 16 bpp) 


1 
1 
Trap Match = 0, 
AlphaORegister 


1 
1 
Trap Match = 1, 
Alpha1 Register 
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Genlocking 
on the 8275008 


The genlocking algorithm on the 827500B uses hori- 
zontal 
and 
vertical 
resets, 
HRESET;; 
and 
VRESET;;, obtained from an external device. When 
the Genlock bit in the Miscellaneous Control register 
is off, the 827500B will ignore all signals present on 
its HRESET;; and VRESET;; inputs. The 827500B 
will resync itself when the programmed end of line 
count is received. This allows the user to turn off 
genlock without having to worry about the state of 
the input video. 


When the Genlock bit is set to one, the 827500B will 
use the external resets to reset its internal horizontal 
and vertical sync counters. In this case, the width of 
the active line is determined by the HRESET;; sig- 
nal, and the length of the field is governed by 
VRESET;;. The programmed values for these reg- 
isters will be ignored. As shown in Figure 3-1, 
when asserted VRESET;; and HRESET;; are ef- 
fected just after the third falling edge of FREQIN. 
VRESET;; has no effect on the 827500B if the first 
half of the first line of an odd field or the second (and 
only) half of the first line of an even field is already in 
progress. HRESET;; has no effect on the 827500B 
if it occurs during the programmed first half of the 
line. The user may decrease the effect of jitter by 
reducing the "window" during which the vertical re- 
set signal is supposed to occur. This can be done by 
scheduling a register load to occur after the vertical 
active display time has ended, thereby decreasing 
the programmable horizontal active window to a size 
acceptable for the video source. When VRESET;; is 
received during this reduced, programmed horizon- 
tal active window, the 827500B is reset to an even 
vertical field. When VRESET;; occurs at any other 
time in the horizontal scan line, the 827500B is set 
to an odd field. 
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Figure 3·1. Horizontal 
and Vertical 
Reset Timing 


Digitizing Images with the 82750DB 


Digitizing is enabled by setting the Digitize Enable bit 
in the Miscellaneous Control register. Note that en- 
abling the digitize mode does not automatically en- 
able genlocking. The Genlock bit must be set sepa- 
rately if it is required. When digitizing, the 82750DB 
is used to shift digitized data into the VRAM shift 
registers, and then transfer this data into the VRAM 
array. 


The 82750DB also provides an external "digitizer 
window" signal, FCO.This signal defines the vertical 
active region that the digitizer enabled. Typically, the 
user sets up the display parameters to reflect the 
"window" of the display to be digitized. The horizon- 
tal and vertical active window size can be selected 
by programming the Active Start and Stop registers. 
FCO is derived from the Vertical Start and Stop reg- 
isters, and is used to enable the digitizer to drive the 
VRAM bus. During the programmed vertical blanking 
interval the FCO signal will be negated, and there- 
fore, the digitizer is prohibited from driving the VRAM 
bus. This will allow data to be read from the VRAM 
serial data bus during the automatic register transfer 
that is performed at the start of the field. Note that it 
will still be possible to program the 82750DB to digi- 
tize during the vertical blanking interval, in order, for 
example, to capture time codes from a VCR. 
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When capturing and displaying NTSC data during 
the horizontal blanking interval of the first display 
line, a WRDIGINP command is sent on the VBUS to 
the 82750PB. (Refer to Figure 3-2.) Recall that there 
is a 
5-line vertical 
pipeline 
delay through 
the 
82750DB. If the first display line is programmed to 
be n, the first display line will occur at n + 5. Similar- 
ly, if the last line is programmed to be m, then the 
last display will be line m + 5. The WRDIGINP 
VBUS code causes a dummy write transfer cycle 
that places the VRAMs in the write mode. The 
82750PB then sets the bitmap pointers to the first 
line's address (LO). 
This code is immediately fol- 
lowed by another WRDIGINP command that causes 
the 82750PB to perform a write transfer cycle at the 
LO address. Since no digitized data has been read 
in, invalid data is loaded into row LO of the VRAM 
array. 


During the active display of the first display line, the 
82750DB provides shift clocks at the programmed 
pixel rate. The digitized data is shifted into the 
VRAMs while the user-programmed horizontal active 
window is active. During the horizontal blanking in- 
terval of the next line, the 82750DB sends a WRDIGI 
code to the 82750PB, thereby transferring the LO 
data from the shift register to the VRAM array at the 
LO address. The 82750PB performs a pitch calcula- 
tion, pointing it to the L1 row. After the WRDIGI 
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Figure 3-2. Digitizing 
Example 


transfer 
has 
finished, 
the 
82750DB 
issues 
a 
WDIGINP command to the 82750PB that performs a 
write transfer cycle at L1 address. This will write the 
LOdata into the L1 address. The next line the L1 row 
will be written over with L1 data. This same proce- 
dure continues for the entire active display, until the 
last active line is reached (m + 5). A final pair of 
WRDIGI and WRDIGINP codes are sent to the 
82750PB to load in the last line of data. At the start 
of horizontal sync of the next line, the FCD signal 
will be negated. 


The purpose of the WDIGINP may not be apparent 
at first glance. This signal ensures that the correct 
data is written into the last selected VRAM address. 
This is necessary when crossing the physical bound- 
aries of VRAM memory. 


When the 82750DB is genlocked, the digitizing 
device 
must 
also 
provide 
the 
HRESET# 
and 
VRESET # signals. The device must ensure that 
VRESET# is never asserted during the start of the 
line. This allows a register transfer (which shortens 
the active display and is required for digitizing) to 
complete before the start of a field register transfer. 
I 


The vertical sync pulses are buffered, so the start of 
the field transfer request can be honored immediate- 
ly after the previous transfer request is finished. 


Also, captured NTSC data may be displayed on a 
VGA-type monitor. This requires the 82750DB to op- 
erate at a VGA frequency (approximately 31.5 kHz), 
which is twice that of NTSC. Each line of captured 
NTSC data is read into the 82750DB twice. Setting 
the line replicate bit makes doubling of memory un- 
necessary. Figure 3-3 illustrates how the 82750DB 
operates in such a mode. The Line Replicate, Digitiz- 
er, and Genlock bits in the Miscellaneous Control 
register are assumed to be set to one. During the 
HBI of the first display line, a dummy write transfer 
cycle (WRDIGINP) places the VRAMs in the write 
mode. The 82750PB then sets the bitmap pointers 
to the first line's address (LO).This code is immedi- 
ately followed by a WDIGINP command, causing the 
82750PB to perform a write transfer cycle at the LO 
address. Since no digitized data has been read in, 
unknown values are loaded into row LOof the VRAM 
array. 
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Figure 3-3. Digitizing 
Example with Line Replicate 


At the end of the first line the 82750DB sends two 
WRDIGINP codes to the 82750PB, thereby transfer- 
ring the LOdata from the shift register to the VRAM 
array at the LOaddress. The 82750PB does not per- 
form a pitch calculation, so the pointer remains at 
the address for LO. After the second display line 
(which has the same data as the first line), a 
WRDIGI code is sent to the 82750PB that writes the 
LOdata to the LOaddress and updates the bitmap 
pointer to L1. The WRDIGINP signal immediately fol- 
lowing this selects the L1 address. After the third 
line of data, two WRDIGINP codes that select 
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the L1 address are sent. After the fourth line, (which 
has the same data as the third line) a write operation 
is performed to load L1 data into the L1 address, 
and the 82750PB pointer is updated to address L2. 
A WRDIGINP code is sent to select the L2 address. 
This same procedure continues for the entire active 
display, until the last active line is reached (m + 5). 
A final pair of WRDIGI and WRDIGINP or two 
WRDIGINP codes are set to the 82750PB to load in 
the last line of data. At the start of horizontal sync of 
the next line, the FeD signal will be negated. 


I 


intel~ 


4.0 
PROGRAMMING 
THE 8275008 


Overview 


All registers are loaded by the issuance of a REGX 
command from the 82750DB to the 82750PB over 
the VBUS. This causes the 82750PB to load a se- 
quence of register values into the VRAM serial out- 
put registers from an address designated by a 
82750DB register pointer. After the request is grant- 
ed, a new 82750DB register word is read in with 
each SCLK. Each 32-bit word consists of a register 
address in the high byte and register values in the 
rest of the word. The sequence is terminated by a 
stop code that corresponds to the address byte be- 
ing equal to Oxff.A variable number of 32-bit words 
can be loaded. During reset, if a stop bit is not found 
within 256 T-cycles, the register transfer is terminat- 
ed, a SHUTDOWN code is asserted on the VBUS, 
and the 82750DB returns to the reset state. All 
transfer requests are terminated at the start of a new 
field. This ensures that non-terminating register 
transfers caused by bad register data will be halted. 


During this register transfer, and on all subsequent 
register transfers (programmed or automatic), the 
82750DB performs a vertical checksum on the regis- 
ter data. The last 32-bit word read in during a regis- 
ter transfer is the user-generated checksum of that 
register data. If the 82750DB-generated checksum 
error does not match the user-generated checksum, 
the 82750DB sends out a SHUTDOWN code to the 
82750PB via the VBUS, and will automatically re-en- 
ter the reset state. 
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Pipeline Delay through the 8275008 


The actual horizontal pipeline delay thro.ugh the 
82750DB is dependent on processing. elements 
used to generate the output. If Y interpolation is not 
used, the pipeline delay is: 


Horiz.Active Pipeline Delay = 16 cycles + 


SCLK Transfer Timing Delay 


Here the SCLK Transfer Timing Delay is 1 for 1X, 2 
for 1/2X, 
and 3 for 1/3X. 


If Y interpolation is used, the pipeline delay is: 


Horiz. Pipeline Delay = 16 cycles + 
SCLK Transfer Timing Delay + Integer (PixelTime) 


The integer (Pixel Time) is simply the integer value 
of the programmed pixel time. The horizontal pipe- 
line delay for blanking differs from that of active. 
When y-interpolation is on or off, the pipeline delay 
for horizontal blanking is: 


Horiz. Blanking Pipeline Delay = 10 cycles + 


SCLK Transfer Timing Delay 


The horizontal sync pipeline delay is always equal to 
o cycles. 


Thus all horizontal parameters, (e.g., horizontal 
blanking start, active stop) must be programmed to 
account for the total horizontal pipeline delay. The 
vertical blanking and vertical sync pipeline delay are 
always equal to 0 lines. All vertical parameters must 
be programmed so that this delay is taken into ac- 
count. 


• 
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PROGRAMMING 
CONSIDERATIONS 


The user must ensure that the 827500B is pro- 
grammed correctly. Illegal or illogical combinations 
of display parameters are not corrected in hardware, 
and may cause the 827500B to output erroneous 
display or timing information. The following list high- 
lights some basic guidelines to follow when pro- 
gramming the 827500B. 
1. The maximum rate that data may be read into the 


827500B is determined by the type of memory 
used. This in turn effects the maximum rate and 
depth of data that can be displayed. If 32 bits of 
data can only be read into the 827500B every 
two clock cycles, only 16 bits of data may be dis- 
played 
every 
clock 
cycle. 
The 
programmer 


should match the transfer rate (1X, 1/2X, or 
1/3X) with the memory speed, and the display 
pixel rate with the pixel depth and memory band- 
width. 
2. Blanking intervals of the display are defined by 


the non-active programmed time. During this por- 
tion of the display, programmed transfers take 
place. If a transfer does not complete before the 
start of the active display, it is terminated, and 
active display data is shifted into the 827500B at 
the programmed rate. During horizontal blanking 
intervals, the user should allow enough time for 
all programmed register, colormap, and VU data 
transfers to complete. 


3. When digitizing (capturing) images, no other bit- 


map transfers (e.g., REGX,VU) should be sched- 
uled to occur during the active portion of the field. 
4. Active start and stop times should not be pro- 


grammed to overlap the blanking stop and start 
times, taking the 
pipeline delay through the 


827500B into account. 


5. Programming the Y interpolation to occur in a 


non-integral pixel width will cause the Y channel 
to output incorrect data. 


CURSOR 
REGISTERS 


The following registers are used to program the 
characteristics of the on-chip cursor. 


Cursor 
Position 
Update 
Register 
Ox5b 


31 
24 
23 
12 
11 


o 10 110 1 1 
VerticalPosi1ion 
HorizontalPosition 


- 
Horizontal 
Position 
in units of T-cycles 


- 
Vertical 
Position 
in units of full lines 


This register gives the horizontal and vertical posi- 
tion of the cursor. The cursor will extend 16-pixel 
periods, starting at the prescribed horizontal posi- 
tion, for the next 16 lines. (Or 32-pixel periods for 32 
lines if the 2X Cursor Mode bits in the General Con- 
trol register are set to one.) 
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Cursor 
Control 
Register 
Ox5a 


31 
o 
24 
23 
12 
11 


0101 1010 
VerticalPosition 
HorizontalPosition 


- 
Horizontal 
Position 
in units of T-cycles 
- 
Vertical 
Position 
in units of full lines 


This register also gives the horizontal and vertical 
position of the cursor. The cursor will extend 16-pixel 
periods, starting at the prescribed horizontal posi- 
tion, for the next 16 lines. (Or 32-pixel periods for 32 
lines if the 2X Cursor Mode bits in the General Con- 
trol register are set to one.) Receipt of this address 
also causes the 827500B to interpret the next six- 
teen 32-bit words of register data as the 16 x 16 x 
2-bit cursor map. This will cause the register address 
decoding logic internal to the 827500B to be dis- 
abled, and the next 16 words of information will be 
loaded into the Cursor table. Each 32-bit word will be 
interpreted as a line (16 pixels) of cursor data, with 
the two least significant bits corresponding to the 
first cursor pixel to be displayed. 


Cursor 
Color 3 
Ox59 


If the cursor is enabled and the 24 bits of data in this 
register are selected, the data will be sent directly to 
the YUV conversion matrix during active display. The 
bits should be programmed as RGB values when the 
YUV to RGB matrix is not being used. 


Cursor 
Color 2 
Ox58 


If the cursor is enabled and the 24 bits of data in this 
register are selected, the data will be sent directly to 
the YUV conversion matrix during active display. The 
bits should be programmed as RGB values when the 
YUV to RGB matrix is not being used. 


Cursor 
Color 
1 
Ox57 


o 


If the cursor is enabled and the 24 bits of data in this 
register are selected, the data will be sent directly to 
the YUV conversion matrix during active display. The 
bits should be programmed as RGB values when the 
YUV to RGB matrix is not being used. 
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DISPLAY 
TIMING 
REGISTERS 


Each register has two, 12-bit components, listed 
with least significant bits first, followed by the 12 
most significant bits. Horizontal timing is measured 
in units of T-cycles (periods of the master clock) 
from the start of horizontal sync. The register con- 
tent defines the number of T-cycles that elapse be- 
fore the event controlled by this register takes place. 
The exception to this rule is the base counter, which 
specifies the number of T-cycles/half line. Zero is 
not an allowable value; use the total number of T-cy- 
cles per half line or full line instead. Unused bits 
should be zero. Sync signals are RESET to initial 
values as specified for each; "start" means to set to 
1, and "stop" means to be reset to zero. 


Base Counter 
Ox56 


23 
12 
11 


01 0 101 1 0 
# 01Lines/Field 
#01 T-Cycles/Hall Lines 


- 
T-cycles/Hal Line in units of T-cycles (Periods of the 
master Clock) 
- 
Half Lines/Field in units of half lines 


As defined by NTSC standards, vertical timing can 
be measured from the start of a field in one of two 
ways: either in units of half lines, or in units of full 
lines. When programmed for an interlaced display, 
(i.e., an odd number of half lines per field) the start 
of a field coincides with the start of a line on odd 
fields and with the midpoint of a line on even fields. 
In the latter case, for an event that is programmed in 
full lines, the first half line is ignored, and counting 
begins with the first full line. With this interpretation, 
the register content defines the number of half or full 
lines that elapse before the event controlled by this 
register takes place. The same may be said for the 
horizontal component, which is defined by the num- 
ber of T-cycles/half 
line. The hardware does not 
look for, nor correct illogical combinations of register 
settings. The monitor should be protected from dam- 
age with ex1ernal circuitry when debugging is in 
progress. 


All of the internal timing is derived from comparing 
the programmed values with the values of this regis- 
ter. The horizontal base counter is programmed us- 
ing the least significant 12 bits. In this case the val- 
ues loaded into this register should be one less than 
the desired value. Bits 23 through 12 are used to 
specify the number of half lines per field. 
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Sync Stops 
Ox55 


31 
24 
23 
12 
11 
o 


01010101 
VSYNCStop 
HSYNCStop 


- 
HSYNC Stop in units of T-cycles 
- 
VSYNC Stop in units of half lines 


Sync St~rts 
Ox54 


31 
24 
23 
o 
12 
11 


01010100 
VSYNCStart 
HSYNCStart 


- 
HSYNC Start in units of T-cycles 
- 
VSYNC Start in units of half lines 


o 


The Sync Stops and Sync Starts registers are used 
in conjunction with one another to specify the start 
and stop locations of the horizontal sync, HSYNC, 
and vertical sync, VSYNC, output signals. VSYNC 
may be programmed to start and stop at any time 
during a given field as defined on a half-line interval. 
Bits 23 through 12 in the Sync Starts and Sync 
Stops registers are used to define the start and stop 
times for VSYNC, respectively. Similarly, HSYNC 
may be programmed to start and stop at any line 
position as defined in units of T-cycles. Bits 11 
through 0 in the Sync Starts and Sync Stops regis- 
ters are used to define the start and stop positions 
for HSYNC, respectively. 


The horizontal component of the Sync Stops regis- 
ter also affects the composite sync, of CSYNC out- 
put. In this case, the CSYNC output will be the same 
as the HSYNC output, except during the vertical 
sync and equalization interval. In the latter case, the 
CSYNC output is determined by the Serration and 
Equalization registers. 


Blanking 
Stops 
Ox53 


23 
11 
o 
12 
o 1 0 1 00 1 1 
Vertical Blank Stop 
Horizontal Blank Stop 


- 
HB Stop in units of T-cycles 
- 
VB Stop in units of half lines 


The Blanking Start and Stop registers control the 
composite 
blanking output 
(CB). The 
horizontal 
blanking start and stop position, in units of T-cycles, 
can be specified to occur at any time during the line. 
By the same token, the vertical blanking start and 
stop positions can be programmed to occur at any 
half-line interval. 
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half-line count specified in the Base Counter register 
has been reached. This pulse will be repeated for 
every half line that the VSYNC output is pro- 
grammed to be active, regardless of the position in 
the field. In Figure 4-1, this continues until half line 
12, or line 6. 


o 10 100 10 
VerticalBlankStart 
HorizontalBlankStart 


- 
HB Start in units of T-cycles 
Resetsto 1 
- 
VB Start in units of half lines 
Resetsto 1 


Program values one less than desired for VB Start 
and Stop. For horizontal blanking start, load num- 
bers less than the total horizontal pipeline delay. 


Pro·Equalization 
I... 
Pulse. 
1 
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Post Equatization 
1 
•••• 
••••• 
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Figure 4-1. Programming 
the Video Sync Outputs 
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Equalization 
Parameters 
Ox50 


1211 
o 10 10000 
VerticalEqualizationStopHorizontalEqualizationSt 


- 
EQH Stop in units of T-cycles 
Resets 
to 1 
- 
EQV Stop in units of half lines 
Resets 
to 1 


Ouring the vertical equalizing period, which starts at 
field-beginning, an equalization pulse is output on 
the CSYNC signal at the beginning of each half line, 
as shown in Figure 4-1. The width of this equaliza- 
tion pulse is determined by the value in bits 11 to 0 
of this register. The half line on which these pulses 
are to stop is programmed in bits 23 through 12 of 
this register. If VSYNC is programmed to occur dur- 
ing the equalization interval (as it is for NTSC type 
displays), the serration pulses are output on the 
CSYNC signal. 


Active 
Region 
Stops 
Ox4f 


31 
2423 
1211 


o 100 1 1 1 1 
VerticalActiveStop 
HorizontalActiveStop 


- 
Actdis 
Stop in units of T-cycles 
- 
Vertical 
Stop in units of full lines 


The active region window, during which pixels to be 
displayed are fetched from VRAM, is defined by the 
Active Region Start and Stop registers. The first dis- 
play line is actually five lines after the line indicated 
in the vertical region of the Active Region Start regis- 
ter. The position of the active region on a horizontal 
line is determined by the horizontal component of 
the Active Region Start register. Pixels will be 
fetched from VRAM at a rate determined by the 
number of bits/pixel and pixel widths. In order for the 
827500B to operate properly, the horizontal width of 
the active region window must be an integral number 
of display pixel widths, taking into account the hori- 
zontal pipeline delay. Also, the Active Region Start 
and Stop must fall within a single line boundary, as 
dictated by the Base Counter register. When the first 
pixel actually appears at the output of the 827500B, 
the output is a function ot.the processing elements 
used as discussed above. 


When the active region is over, the border color is 
output 
until 
the 
programmed 
blanking 
time 
is 
reached. Both the border and blanking information is 
output at the transfer rate programmed by the user. 
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Active 
Region 
Starts 


o 
31 
1211 
2423 


o 100 1 1 10 
VerticalActiveStart 
HorizontalActiveStart 


- 
Actdis 
Start in units of T-cycles 
- 
Vertical 
Start in units of full lines 


Burst Gate Stop 


31 
24 
23 
12 
11 
0 
o 100 1 10 1 
VerticalBGStop 
HorizontalBGStop 


- 
Horizontal 
Stop Position 
in units of T-cycles 
- 
Vertical 
Stop Position 
in units of full lines 


The Burst Gate Horizontal and Vertical Start and 
Stop registers allow the user to program a window 
into which burst can be added. This is useful when 
modulating the outputs of the 827500B. 


Burst Gate Start 
o 
31 
24 
23 
12 
11 
0 
o 100 1 10 0 
VerticalBGStart 
HorizontalBGStart 


- 
Horizontal 
Start 
Position 
in units of T-cycles 
- 
Vertical 
Start 
Position 
in units of full lines 


VB US CODE 
REGISTERS 


The following group of registers are used by the pro- 
grammer to schedule when VBUS transfer or control 
codes are to be sent to the 82750PB by the 
827500B. 


Display Format 
Load Interrupt 


31 
2 23 
1 11 


o 10 0 10 1 1 
VerticalDFLPosition HorizontalDFLPositio 


- 
Horizontal 
Position 
in units of T-cycles 
- 
Vertical 
Position 
in units of full lines 


This is the programmable XY interrupt, used by the 
82750PB to perform a load of the Shadow Copy reg- 
isters. This interrupt is sent on the VBUS when the 
bits 23 to 12 match the current display line position, 
and bits 11 to 0 match the T-cycle count. 


Ox4e 


o 


Ox4d• 


Ox4c 


Ox4b 


o 
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Line Notification 
Timing 
Ox4a 


11 


Horizontal HLlN Position 


This indicates the position on each line to send a 
HLlNE code on the VBUS. The 82750PB requires 
this information to keep track of the current display 
line when drawing graphics. 


Refresh 
and Register 
Transfer 
Ox49 


12 
11 


o 1 0 0 1 0 0 1 
REGX Une Number 
Refresh Horizontal Position 


- 
REFRESH 
horizontal 
timing 
in units of T-cycles 
- 
Register 
Transfer 
Line number 
in units of full lines 


When the T-cycle count matches the value pro- 
grammed into bit 11 to 0 of this register, a refresh 
code is sent to the 82750PB. Since these codes tie 
up the 82750PB for at least eight 82750PB cycles, 
the programmer must ensure that no transfer re- 
quests are scheduled to occur during this time. 


The line number for the next register transfer is 
specified in bits 23 to 12 of this register. If pro- 
grammed to occur, REGX will always be the first 
transfer request sent to the 82750PB, immediately 
after the end of active display. 


COlOR 
REGISTERS 


The following registers specify the state of OBU, 
ORV, OGY, and ALPHA signals during the field. 


Border 
Color 
Ox48 


The 24 bits of data in this register are sent directly to 
the YUV conversion matrix during border time. Bor- 
der time is defined as the region in which neither 
active display nor blanking is programmed to occur. 
The bits should be programmed as RGB values 
when the YUV to RGB matrix is not being used. 
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Alpha Register 
Ox47 


o 


The least significant 8 bits are for the ALPHAOregis- 
ter and are used during blanking and if the alpha trap 
value is not matched. The next 8 bits are for the 
ALPHA1 register when the alpha trap value is 
matched. The most significant 8 bits provide the al- 
pha channel value during the border time. 


Blanking 
Color 
Ox46 


o 


The 24 bits of data in this register are sent directly 
through the YUV conversion matrix during the pro- 
grammed blanking time. 


CONTROL 
REGISTERS 


The following registers are used to define the oper- 
ating modes of the 827500B. 


Plxel Control 
Ox45 


23 22 21 
19 18 
14 13 
11 10 9 
8 
7 6 
0 
[[CJ 
CIIJJJ 
I 
lLL LjJ~~~ 


Pseudo 
16-81: Mode 


240855-17 


Bits 6:Q-SClK 
Delay 


The number "m" 
of T-cycles from initiation of a 
transfer request on the VBUS until the first SCLK is 
asserted by the 827500B. 
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Bit 7-VU 
Interpolation Round 


When 
equal to 0, this bit means 
truncate 
during in- 
terpolation. 
When set to one, this bit means round to 
odd during interpolation. 


Bit 8-Condltlonal 
Interpolation Enable 


When 
reset 
to zero, this bit means 
all values 
of Y 
and U are a full 8 bits of precision. 
When set to one, 
this bit means the least bit of the Y sample or the U 
sample controls 
the switching 
between 
VU interpola- 
tion and graphics 
mode. 


Bit 9-VU 
Interlace Enable 


Setting 
this bit to a one causes 
the interpolator 
to 
output 
different 
data 
on the 
odd 
and 
even 
fields. 
During the odd field, the odd lines of the interpola- 
tion sequence 
will be output. 
During the even field, 
the even lines of the interpolation 
sequence 
will be 
output. 
Full lines of the programmed 
number of sam- 
ples of both the V and U data will be read in during 
each VU transfer. 
Setting this bit to a zero will cause 
horizontally 
and 
vertically 
interpolated 
data 
to 
be 
output on both fields. Only a full line of either V or U 
samples 
will be read in during each transfer 
request 
in this mode. 


Bit 1G-4X VU Expand 


When this bit is set to a zero, a 2X expansion 
in both 
directions 
is performed. 
By setting this bit to a one, a 
4X expansion 
is performed. 


Bits 13:11-VU 
Sample Select 


Table 4-1 provides 
the code and number of V and U 
samples 
for bits 13: 11. 


Table 4·1. VU Sampling 


Code 
Number of V And U Samples 


000 
o Samples 
for Each V and U 


111 
32 Samples 
for Each V and U 


110 
64 Samples 
for Each of V and U 


101 
96 Samples 
for Each of V and U 


100 
128 Samples 
for Each of V and U 


011 
160 Samples 
for Each of V and U 


010 
192 Samples for Each of V and U 


001 
256 Samples for Each of V and U 
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Bits 18:14-Plxel Time 


Table 4-2 lists the codes 
and pixel duration 
for bits 
18:14. 


Table 4·2. Plxel Times 


Code 
Duration of Plxel 


00001 
1.0 T-cycle 


00010 
1.5 T-cycles 


00100 
2.0 T-cycles 


01000 
2.5 T-cycles 


10000 
3.0 T-cycles 


10001 
3.5 T-cycles 


10010 
4.0 T-cycles 


10100 
4.5 T-cycles 


11000 
5.0 T-cycles 


11001 
5.5 T-cycles 


11010 
6.0 T-cycles 


11100 
6.5 T-cycles 


11101 
7.0 T-cycles 


11110 
7.5 T-cycles 


00011 
8.0 T-cycles 


00101 
8.5 T-cycles 


00110 
9.0 T-cycles 


00111 
9.5 T-cycles 


01001 
10.0T-cycles 


01010 
10.5 T-cycles 


01011 
11.0 T-cycles 


01100 
11.5 T-cycles 


01101 
12.0 T-cycles 


01110 
13.0 T-cycles 


01111 
14.0 T-cycles 
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When set to one, each pixel generated by the VU 
Interpolator is held for 2-pixel times. This allows an 
effective 8X expansion of VU data. This is useful for 
high resolution applications where the blanking time 
is not sufficient to support higher VU sample loads. 


Bit 23-Pseudo 
16·Blt Mode 


When set to one and 16 bits per pixel is chosen (bits 
21:19), the 8275008 
is in the 16-bit with Alpha 
mode. Setting this signal to zero while in the 16-bitl 
pixel mode puts the 8275008 into the 16-bit (655) 
mode. This bit represents a "don't care" input for all 
other values of bitlpixel. 


General Control 
Ox44 


1_23 


'....J7IL 
'6 
__ 
'...J3DJIjjjj. 
1 
lllllU~~- 
! 
ResetV8d· 
Set To Zero 


Vd", 
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Bits 4:o-Burst 
Multiple 


These bits are used to program a divisor of the 
FREQIN 
clock 
input 
in 
order 
to 
recover 
the 
3.58 MHz NTSC color subcarrier. The programmed 
value is the two's complement of the desired divisor. 
The allowed range of values is 00000 through 11110 
which corresponds to divisions of 32 through 2. Note 
that the 8275008 must be operating at an integer 
multiple of 3.58 MHz for this to work effectively. 


Bit S-Cursor 
Enable 


When set to one, the hardware cursor will output the 
cursor data at prescribed intervals if programmed to 
do so. 
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These two bits control which output channel is 
muxed onto the alpha digital outputs. It allowsY, U, 
or V data to be available at the alpha channel. The 
coding is provided in Table 4-4. 


Table 4·4. Test Mode Select Cod,ing 


Code 
Alpha Channel Output 


00 
Alpha Channel 


01 
YChannel 


10 
V Channel 


11 
U Channel 


Bit 1O-Sync Test 


This bit must be set to zero for proper operation. 


Bit 11-Gren 


This is the Graphics Enable bit for the Y Interpolator. 
When this bit is set to one, the pixel is a graphics 
pixel, and the switch is zero, a 2X interpolation will 
be performed on the pixel. 


Bit 12-Vlden 


This is the Video Enable bit of the Y Interpolator. 
When this bit is set to one, the pixel is a video pixel, 
and jhe switch is one, a 2X interpolation will be per- 
formed on the pixel. 


Bit 16:13-Vblen 


These bits program the T-cycle length of each V8US 
code. The V8US code length will be one T-cycle 
longer than the programmed value. These bits must 
have a minimum value of 2, and a maximum value of 
15. 
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Miscellaneous Control 
Ox43 


III 
j jllt~~T~v- 


vu _",<>..qu E_ 
.....E_ 
swttchon lS8 or y 
GeriocI<E_ 


Bypass 
ConverIIon 
Ma1rbc 


TransfefT\mlng 
Spit CLUT 
Select 
Graphics Pass 
VIdeoP.,. 


827S008 
Modo 
EnobI. 
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Bits 7:Q-Alpha Trap 


Bits 7:0 are 8-bit values used for comparison with 
the current pixel's Y value, to select one of two pro- 
grammable alpha values. 


Bit 8-Alpha 
Trap Select 


A value of one enables the Y value of the current 
pixel to be compared with the value in the Alpha 
Trap register. If the two values match and Alpha has 
been enabled via the Alpha Enable bit, the contents 
of the ALPHA1 register are output on ALPHA[7:0]. If 
the two values don't match and Alpha Enable has 
been set to one, the content o( the ALPHAOregister 
is output. When Alpha Trap Select is set to a zero in 
the pseudo 16- or 32-bit mode, the most significant 
byte of the pixel word is output. When Alpha Trap 
Select is set to zero in all other modes, the value of 
the ALPHAOregister is output. 


Bit 9-Border 
Alpha Enable 


A value of one enables the eight most significant bits 
in the ALPHA register to be output. When set to a 
zero, the ALPHAO register is output during border 
time. 


Bit 1o-DIgltlze 
Enable 


When this bit is set to a one, the FCD signal will be 
set to a one, and the transfer codes for bitmaps will 
indicate that write operations should occur. 


Bit 11-VU 
Interpolator Output Enable 


This bit enables VU interpolation data to be dis- 
played. When set to a zero, all pixels are treated as 
graphic pixels. 
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Bit 12-Alpha 
Enable 


When set to one, the alpha output is governed by 
the alpha trap value, as described above. When re- 
set to zero, the contents of the ALPHAOregister is 
the alpha output in the 8- and 16-bit modes, and the 
explicit ALPHA data encoded in the pseudo 16- and 
32-bit modes. 


Bit 13-Swltch 
on LS Bit of Y 


When set to one, the least significant bit of Y is used 
as a Video/Graphics switch in all modes. When re- 
set to zero, the least significant bit of U from the 
interpolator acts as a switch. 


Bit 14-Genlock 
Enable 


\ 


This bit enables the genlock mode of the 82750DB. 
In this mode, receipt of the external HRESET11 sig- 
nal during the second half of a scan line will cause 
the termination of that scan line. Similarly, receipt of 
the externally produced VRESET11 signal will termi- 
nate the field. In both cases, terminate denotes that 
the proper on-chip signals are produced to signify 
end of the line and end of the field. 


Bit 15-Bypass 
Conversion Matrix 


When this bit is set to a one the YUV to RGB matrix 
will be bypassed, and the Y, U, and V data will feed 
directly into the Digital to Analog Converters. 


Bit 16-Spllt 
CLUT 


This bit divides the CLUT into an odd and an even 
half, depending on the polarity of the Video/Graph- 
ics switch. This switch is selectable and may be ei- 
ther the LSB of U from the interpolator or Y from the 
pixel word. The LSB of the CLUT address is set to 
one (odd address) if the Video/Graphics switch is 
one; the LSB of the CLUT address is' set to zero 
(even address) if the Video/Graphics switch is zero. 


Bit 17-Graphlcs 
Pass 


Setting this bit to a one bypasses the CLUT for 
graphics pixels, even in non-mixed modes. 


Bit 18-Vldeo 
Pass 


When set to a one, all video pixels (luminance val- 
ues associated with sub-sampled UV values) will by- 
pass the color table. For mixed modes, this corre- 
sponds to the switch flag having a value of one. 
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Bit 20:19-Transfer 
Timing 
Select 


These bits are two-bit codes that select one of three 
possible transfer shift clock rates. This allows the 
operating speed of the 8275008 to be tailored to the 
external memory access time. After RESET, the 
transfer rate is set to the slowest possible clock rate 
(1/3X). 
The programmed rate is used during all non- 


active 
display times for 
transferring 
data 
from 


VRAMs. It also defines the rate that the border and 
blanking data is output. Ouring active display, the 
data is read as needed from VRAM using the pro- 
grammed timing. The coding of these bits is listed in 
Table 4-5. 


Table 4-5. Coding of Transfer 
Timing Select Bits 


Bit 20 
Bit 19 
Result 


0 
0 
113X Transfer (Oefault) 


0 
1 
112X Transfer 


1 
0 
1XTransfer 


Bit 21-827500B 
Enable 


When set to zero, the 8275008 will be the register 
equivalent of a 827500A. When set to a one, all the 
features of the 8275008 will be enabled. 


Bit 22-L1ne 
Replicate 
Enable 


When this bit is set to one, every line in the active 
display is generated twice. Each new bitmap transfer 
occurs at half the line rate, with a new V8US code 
being used to indicate that a transfer is to take place 
without the pitch calculation. The VU Interpolator will 
also duplicate the lines it generates, yielding more 
time between transfer cycles. This mode is useful for 
obtaining a 2X increase in vertical resolution without 
the need for increasing the VRAM transfer band- 
width. 


COLOR 
MAP REGISTERS 


The following registers are used to access and con- 
trol the three 2~6 x 8-bit Golor Lookup Tables. 


Mask Data Registers 
Ox42 
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Each of the three 8-bit registers contains the bit pat- 
tern used when the corresponding bit in the Mask 
Set register is asserted. 


Mask Set Registers 
OX41 


This is a 24-bit register that contains the mask bit 
pattern for the RG8/YUV 
color map addresses. 


When a bit in this register is asserted, the corre- 
sponding bit in the address is set to the value de- 
fined in the Mask Oata registers. 


CLUT Index Register 
Ox40 


The GLUT Index register is an 8-bit register used for 
loading the color tables. This register maps the user- 
specified 6-bit color map address into an 8-bit ad- 
dress. A logical OR operation is performed between 
the 6-bit address and the 8-bit index word to obtain 
the new GLUT address. 


Color Lookup 
Table" Addresses 
OxOO-Ox3f 


If the 8275008 Enable mode bit in the Miscellane- 
ous Control register is set to zero, the GLUT ad- 
dresses are decoded to appear as addresses to the 
reduced-size 827500A color table. The least signifi- 
cant four bits of the address are used for the Y color 
table address, and the upper nibble is used to ad- 
dress the V and U color table simultaneously. This is 
a compatibility mode for the 827500A, which has a 
reduced-size color table. 


If the 8275008 Enable mode bit is set to one, the full 
color table is used. In this case, the most significant 
byte of the 32-bit data word is used as an address to 
the color table. The address is ORed with the most 
recently loaded GLUT Index register. 
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82750D8 
Register Summary 


The following 
table illustrates 
the register 
space of the 827500B. 


Table 4-6. 8275008 
Register 
Space 


Address 
8275008 
Register 


OxOO-OxOf 
CLUT Locations 
0-15 


Ox10-0x30 
CLUT Locations 
16-48 


Ox31 
CLUT Location 
49 


Ox32 
CLUT Location 
50 


Ox33 
CLUT Location 
51 


Ox34 
CLUT Location 
52 


Ox35-0x37 
CLUT Location 
53-55 


Ox38 
CLUT Location 
56 


Ox39-0x3f 
CLUT Location 
57 -63 


Ox40 
CLUT Index Register 


Ox41 
CLUT Mask Set Register 


Ox42 
CLUT Mask Data Register 


Ox43 
Miscellaneous 
Control 


Ox44 
General Control 


Ox45 
Pixel Control 


Ox46 
Blanking Color 


Ox47 
Alpha Register 


Ox48 
Border Color 


Ox49 
Register Transfer 


Ox4a 
Line Notification 
and Timing 


Ox4b 
OFLLoad 


Ox4c 
Burst Gate Start 


Ox4d 
Burst Gate Stop 


Ox4e 
Active Region Start 


Ox4f 
Active Region Stop 


Ox50 
Equalization 
Parameters 


Ox51 
Serration 
Start 


Ox52 
Blanking Start 
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Address 
8275008 
Register 


Ox53 
Blanking Stop 


Ox54 
Sync Start 


Ox55 
Sync Stop 


Ox56 
Base Counters 


Ox57 
Cursor Color 1 


Ox58 
Cursor Color 2 


Ox59 
Cursor Color 3 


Ox5a 
Cursor Control 


Ox5b 
Not Used 


Ox5c 
Not Used 


Ox5d 
Not Used 


Ox5e 
Not Used 


Ox5f 
Not Used 


Ox60 
Not Used 


Ox61 
Not Used 


Ox62 
Not Used 


Ox63 
Not Used 


Ox64 
Not Used 


Ox65 
Not Used 


Ox66 
Not Used 


Ox67 
Not Used 


Ox68 
Not Used 


Ox69-0x6e 
Not Used 


Ox6f 
Not Used 


Ox70 
Not Used 


Ox71-0x7f 
Not Used 


Ox80-0xfe 
Not Used 


Oxff 
Stop Code 


1-105 


• 


u\" 
\"naracterlstlCs 


Table 
5-2. DC Characteristics 
VCC = 5V ± 10%, TCASE = O°C to + 95°C 


Symbol 
Parameter 
Mln 
Typ 
Max 
Unit 
Notes 


Vll 
Input LOW Voltage 
-0.3 
0.8 
V 


VIH 
Input HIGH Voltage 
2.0 
VCC +0.3 
V 


VOl 
Output LOW Voltage 
0.2 
0.4 
V 
10l = 4.0 mA(1) 


VOH 
Output HIGH Voltage 
2.4 
3.0 
V 
10H = -1.0 
mA(1) 


III 
Input Leakage Current 
-10 
+10 
/l-A 
VSS < VIN < vcc 
'oz 
Output Leakage Current 
-10 
+10 
/l-A 
Vss < VIN < VCC 


ICCT 
Power Supply Current 
185 
250 
mA 
28 MHz(2) 


ICCNT 
Power Supply Current 
140 
190 
mA 
28 MHz(3) 


ICCT 
Power Supply Current 
280 
375 
mA 
45 MHz(2) 


ICCNT 
Power Supply Current 
215 
285 
mA 
45 MHz(3) 


CIN 
Input Capacitance 
10.0 
pF 
Fc = 1 MHz(4) 


COUT 
Output Capacitance 
12.0 
pF 
Fc = 1 MHz(4) 


CFREOIN 
FREQIN Input Capacitance 
20.0 
pF 
Fc = 1 MHz(4) 


NorES: 
1. Measured with FREOIN = 7 MHz. 
2. Typical current value measured under typical conditions with the Digital Outputs (DGY. DRV. and DSU) toggling. Maximum 
current value guaranteed with 50 pF maximum output loading. Analog Outputs disabled. 
3. Typical current value measured under typical conditions with the Digital Outputs (DGY, DRV. and DSU) not toggling. 
Maximum current .value guaranteed with 50 pF maximum output loading. Analog Supply Current IACC not included. 
4. Not 100% tested. 
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AC Characteristics 


Table 5-3. AC Characteristics 
at 28 MHz vcc = 5V ± 10%, TCASE = O°C to 95°C, CL = 50 pF 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


Frequency 
7 
28 
MHz 
1X Clock 


t1 
FREQIN Period 
35 
140 
ns 
5-1 


t2 
FREQIN 
High Time 
12 
23 
ns 
5-1 
(Note 1) 


t3 
FREQIN 
Low Time 
12 
23 
ns 
5-1 
(Note 1) 


~ 
FREQIN Fall Time 
4 
ns 
5-1 


ts 
FREQIN Rise Time 
4 
ns 
5-1 


tea 
HSYNC,VSYNC,CSYNC,BG,FCO 
24 
ns 
5-2 
Valid Delay 


teb 
VBUS[3:0j 
Valid Delay 
26 
ns 
5-2 


t7 
RESETB#,VRESET#,HRESET#, 
0 
ns 
5-3 
DISDIG, TESTACT 
Setup 


le 
RESETB# 
13 
ns 
5-3 


t9 
SCLK[1:0j 
Valid Delay High 
14 
ns 
5-4 
1X Mode 


t10 
SCLK[1:0j 
Valid Delay Low 
%t1 + 14 
ns 
5-4 
1X Mode 


t11 
SCLK[1:0j 
Valid Delay 
14 
ns 
5-5,5-6 
1/2X, 
113X Mode 


t12 
DATAIN[31:0j 
Setup 
5 
ns 
5-4, 5-5, 5-6 


t13 
DATAIN[31:0j 
Hold 
5 
ns 
5-4, 5-5, 5-6 


t14 
PIXCLK Valid Delay 
%t1 + 20 
ns 
5-7 
(Note 2) 


t1S 
PIXCLK Valid Delay 
20 
ns 
5-7 
(Note 3) 


t16 
DRV[7:01, 
DGY[7:01, 
DBU[7:01, 
3 
ns 
5-8 
ALPHA[7:01, 
ACTOIS, CB, BPP[Ol, 


BPP[1j 
Output Setup 


t17 
DRV[7:01, 
DGY[7:01, 
DBU[7:01, 
15 
ns 
5-8 
ALPHA[7:01, 
ACTOIS, CB, BPP[Ol, 


BPP[1j 
Output Hold 


t18 
VBUS[3:01, 
SCLK[1:0j, 
FCO, 
30 
ns 
5-9 
(Note 4) 
HSYNC,VSYNC,CSYNC,CB,BG, 
PIXCLK, DRV[7:0j, 
DGY[7:01, 


DBU[7:0j, 
ALPHA[7:01, 
ACTOIS, 
BPP[Ol, BPP[1j 
Float Delay 


t19 
DISDIG, DRV[7:01, 
DGY[7:0j, 
3t1 
ns 
5-10 
DBU[7:0j, 
Digital Output 
Disable Delay 


t20 
DISDIG, DRV[7:01, 
DGY[7:0j, 
3t1 
ns 
5-10 
DBU[7:0j, 
Digital Output Enable 
Delay 


t21 
DISDAC, RV, GY, BU Analog Output 
19 
ns 
5-11 
(Note 6) 
Disable Delay 


t22 
DISDAC, RV, GY, BU Analog 
19 
ns 
5-11 
(Note 6) 
Output 
Enable Delay 
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NOTES: 
1. This assumes a 35ns period. For other speeds, the FREQIN High and Low Times should fall within a 40% to 60% duty 
cycle. 
2. For integer pixel times t14 is the Valid Delay on all assertions of PIXCLK during active display time. 
3. For non-integer pixel times t15 is the Valid Delay on alternating assertions of PIXCLK during active display time. 
4. Not 100% tested. 
5. All AC specifications are measured at the 1.5V crossing point with a 50 pF load. 
6. Analog output delay is guaranteed at the 50% level of the full scale transition with RL = 100n and CL = 25 pF. 
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Table 
5-4. AC Characteristics 
at 45 MHz VCC = 5V ± 10%, TCASE = O·C to 95·C, CL = 50 pF 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


Frequency 
7 
45 
MHz 
1X Clock 


t1 
FREQIN Period 
22 
140 
ns 
5-1 


t2 
FREQIN High Time 
7 
15 
ns 
5-1 
(Note 1) 


t3 
FREQIN Low Time 
7 
15 
ns 
5-1 
(Note 1) 


~ 
FREQIN 
Fall Time 
4 
ns : 
5-1 


ts 
FREQIN 
Rise Time 
4 
ns 
5-1 


tsa 
HSYNC,VSYNC,CSYNC,BG,FCO 
20 
ns 
5-2 
Valid Delay 


tsb 
VBUS[3:0] 
Valid Delay 
26 
ns 
5-2 


t7 
RESETB#,VRESET#,HRESET#, 
0 
ns 
5-3 
DISDIG, TESTACT 
Setup 


t8 
RESETB# 
13 
ns 
5-3 


t9 
SCLK[ 1:0] Valid Delay High 
12 
ns 
5-4 
1X Mode 


t10 
SCLK[1:0] 
Valid Delay Low 
%t1 + 12 
ns 
5-4 
1X Mode 


t11 
SCLK[1:0] 
Valid Delay 
12 
ns 
5-5,5-6 
1/2X, 
1/3X 
Mode 


t12 
DATAIN[31:0] 
Setup 
3 
ns 
5-4, 5-5, 5-6 


t13 
DATAIN[31:0] 
Hold 
3 
ns 
5-4, 5-5, 5-6 


t14 
PIXCLK Valid Delay 
%t1 + 20 
ns 
5-7 
(Note 2) 


t1S 
PIXCLK Valid Delay 
20 
ns 
5-7 
(Note 3) 


t16 
DRV[7:01, DGY[7:0], 
DBU[7:01, 
0 
ns 
5-8 
ALPHA[7:01, 
ACTOIS, CB, BPP[Ol, 


BPP[1]IVUGR 
Output Setup 


t17 
DRV[7:01, DGY[7:0], 
DBU[7:01, 
10 
ns 
5-8 
ALPHA[7:01, 
ACTOIS, CB, BPP[Ol, 


BPP[1]IVUGR 
Output Hold 


t18 
VBUS[3:01, 
SCLK[1:0], 
FCO, 
30 
ns 
5-9 
(Note 4) 
HSYNC, VSYNC, DRV[7:01, 
DGY[7:01, ALPHA[7:01, 
ACTOIS, 
BPP[Ol, BPP[1]IVUGR 
Float Delay 


t19 
DISDIG, DRV[7:01, DGY[7:01, 
3t1 
ns 
5-10 
DBU[7:01, 
Digital Output Disable 
Delay 
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AC Characteristics 
(Continued) 


Table 5·4. AC Characteristics 
at 45 MHz VCC = 5V ± 10%, 
TCASE = O·C to 95·C, 
CL = 50 pF 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


t20 
DISDIG, 
DRV[7:0], 
DGY[7:01, 
DBU[7:0], 
Digital 
Output 
3t1 
ns 
5·10 


Enable 
Delay 


t21 
DISDAC, 
RV, 
GY. 
BU Analog 
Output 
Disable 
Delay 
19 
ns 
5-11 
(Note 
6) 


t22 
DISDAC, 
RV, 
GY, 
BU Analog 
Output 
Enable 
Delay 
19 
ns 
5-11 
(Note 
6) 


NOTES: 
1. This assumes 
a 22ns period. 
For other 
speeds, 
the FREQIN 
High and Low Times 
should 
fall within 
a 40% 
to 60% 
duty 
cycle. 
2. For integer 
pixel times 
t14 is the Valid Delay on all assertions 
of PIXCLK 
during 
active 
display 
time. 
3. For non-integer 
pixel times t15 is the Valid Delay on alternating 
assertions 
of PIXCLK 
during 
active 
display 
time. 


4. Not 100% 
tested. 


5. All AC specifications 
are measured 
at the 1.5V crossing 
point with a 50 pF load. 


6. Analog 
output 
delay is guaranteed 
at the 50% 
level of the full scale transition 
with RL = lOOn 
and CL = 25 pF. 
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Figure 5-1. Clock Waveforms 
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Figure 5·2. Output Waveforms 
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Figure 5-3. Input Waveforms 


I 


1-109 


SCLK!1 :01 


DATAIN!31 :01 


240855-24 


fREOIN 


SCLK!1:01 


DATAIN!31 :01 


240855-25 


1-110 


1 


1 


1'-- 


1 
t12: tu 
~ 
------------------~~--------- 


Figure 5·5. 1/2X 
SCLK Mode 


fREOIN 
1 
1 


1 
1 
: '~----_'",' 
3',5V 
'\ 
t 


I 
I 
: 
:""---------------t- 
1 
J 


2 
" t13 
--', 
'--, tll 
1 
1 
--: 
~tll 
~ 


--------------------------------~ 


Figure 5-6. 1I3X SCLK Mode 
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Figure 5-7. PIXCLK Waveforms 
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Figure 5·8. Output Setup and Hold 
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Figure 5-9. TESTACT# Float Delay 
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Figure 
5-10. DISDIG to Digital 
Output 
Delay 
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Figure 
5-11. DISDAC 
to Analog 
Output 
Delay 


Digital to Analog Converter Electrical Characteristics 


Table 
5-5. DAC DC Characteristics 
AVcc = 5V ± 10%; TCASE = O·C to + 95·C 


Symbol 
Parameter 
Mln 
Typ 
Max 
Unit 
Notes 


Iref 
Reference 
Current 
1000 
p.A 


Ifs 
Output Current" 
0.93" 
(255/18.5) 
" Iref 
1.07 " (255/18,5) 
" Iref 
mA 
(Note 1) 
(Full Scale) 


Vfs 
Output Voltage 
1.0 
1.5 
V 
(Full Scale) 


1NL 
Integral Nonlinearity 
1.0 
±3 
LSB 


DNL 
Differential 
±1 
LSB 
Nonlinearity 


'IACC 
Analog Supply 
3" 
Ifs + 8 
mA 
(Note 2) 
Current 


DDTR 
DAC to DAC Tracking 
2.0 
5.0 
% 
(Note 3) 
at Full Scale 


Cout 
Output Capacitance 
12 
pF 
(Note 4) 


NOTES: 
1. Maximum lIs allowed = 14.7 mA, 
2. Maximum IACC allowed = 52,8 mAr Typical value 01 IACC = 3 • Ifs + 6. 
3. Maximum deviation between RV, GY and BU outputs at lullscale output voltage. 
4, Not 100% tested. 
5, All DAC testing done with Irel = 1000 microamps, 
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NOTES: 
1 
1. Maximum 
value 
is lor RL = 1000 
and CL = 25 pF. Defined 
as 10% to 90% 01 lull scale transition. 
2. Assumes 
an 80 MHz filter on output, 
3. Glitch 
energy 
generated 
Irom the influence 
that 2 active 
outputs 
have on an idle output. 


4. DISDIG 
must be tied high. 


5. Assumes 
the use 01 0.1 p.F capacitor 
between 
VGCS 
and AVcc 
and 0.1 p.F and 10 p.F capacitors 
between 
IREFIN 
and 


AVcc· 


8275008 


Table 5-6. DAC AC Characteristics 


Symbol 
Parameter 
Mln 
Typ 
Max 
Unit 
Notes 


tr, tf 
Rise/Fall 
Time 
10 
ns 
(Note 1) 


ClkF 
Clock Feedthrough 
-28 
dB 
(Note 2) 


GlEn 
Glitch Energy 
100 
pV-sec 
(Notes 2,3) 


Skew 
Output Skew 
3 
ns 


Xtlk 
Crosstalk 
200 
pV-sec 
(Note 2) 


"'EFIN 


IREFlN1------.--( 
Ground 
vGCs 
AySst--I---+----ir--------, 
Ground 


8275008 
AyCC 
I-......• 
----+---......• 
~-- ..• 
+5V (AyCC) 


To 
"'onitor 


240855-29 


Rl = 750 
RL = Load Resistance 
Cl = 0.1 p.F 
C2 = 10 p.F 
CL = Load Capacitance 


255 
lIs = -"Irel 
18.5 


VIs = lIs" 
RL 


where: 


o :S: lout 
:S: lIs 
o :S: Vout 
:S: VIs 


Tr = rr ::::3 " RL (CL + Cour) 


Figure 5-12. Typical Output Configuration 
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Output Delay and Rise Time versus Load Capacitance 


50 
75 
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C; (picofarads) 
240855-30 


nom +6 
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V 


.// 
1/v 
/ 


nom+4 


Typical 
Output 


Delay 
(ns) 


nom+2 


nom-2 


NOTE: 
This graph will not be linear outside 
of the CL range 
shown. 
nom = nominal 
value 
given in AC Characteristics 
table. 


Figure 5-13. Typical 
Output Valid Delay versus Load Capacitance 
under Worst Case Conditions 
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Figure 5-14. Typical 
Output Rise Time versus Load Capacitance 
under Worst Case Conditions 
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6.0 
MECHANICAL 
DATA 


Packaging Outlines and Dimensions 


Intel packages 
the 82750DB 
in a Plastic Quad Flat 
Pack (PQFP). Table 6-1 gives the symbol 
list for the 
PQFP. 


Table 6-1. PQFP Symbol 
List 


Letter or 
Description 
of Dimensions 
Symbol 


A 
Package 
Height: Distance 
from 
Seating Plane to Highest Point of 
Body 


A1 
Standoff: 
Distance from Seating 
Plane to Base Plane 


DIE 
Overall Package 
Dimension: 
Lead 
Tip to Lead Tip 


D1/E1 
Plastic Body Dimension 


D2/E2 
Bumper Distance 


D3/E3 
Footprint 


D4/E4 
Foot Radius Location 


L1 
Foot Length 


N 
Total Number of Leads 


The PQFP has the following 
specifications: 


1. All dimensions 
and tolerances 
conform 
to ANSI 
Y14.5M-1982. 


2. Datum plane-H-is 
located 
at the mold parting line 
and coincident 
with the bottom 
of the lead where 
lead exits plastic 
body. 


8275008 


3. Datums 
A-B and -D- are to be determined 
where 
center 
leads exit plastic 
body at datum plane -H-. 


4. Controlling 
dimension 
is the inch. 


5. Dimensions 
D1. D2. E1. and E2 are measured 
at 


the mold parting line and do not include mold pro- 
trusion. 
Allowable 
mold 
protrusion 
is 0.18 
mm 
(0.007 in.) per side. 


6. 
Pin 1 identifier 
is located 
within 
one of the two 
zones indicated. 


7. Measured 
at datum plane -H-. 


8. Measured 
at seating 
plane datum -C-. 


Table 
6-2 
provides 
outline 
characteristics 
for 
0.025-in. 
pitch. 


Table 6-2. Intel Case Outline 
Drawings 
for PQFP at 0.025 Inch Pitch 


Symbol 
Description 
Mln 
Max 


N 
Leadcount 
132 
132 


A 
Package Height 
0.160 
0.180 


A1 
Standoff 
0.020 
0.040 


D.E 
Terminal Dimension 
1.070 
1.090 


D1.E1 
Package Body 
0.947 
0.953 


D2. E2 
Bumper Distance 
1.097 
1.103 


D3. E3 
Lead Dimension 
0.800 REF 
0.800 REF 


D4. E4 
Foot Radius 
1.023 
1.037 
Location 


L1 
Foot Length 
0.020 
0.030 


mm (inch) 
240855-32 


Figure 6-1. Principal 
Dimensions 
of the 8275008 
in the 132-Lead 
PQFP Package 
I 
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Figure 6-3. 132-Lead PQFP Mechanical Package Detall-Protectlve 
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Figure 6-4. Detailed Dimensions of the 82750DB In the 132-Lead PQFP Package-Molded 
Details 
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Figure 6-5. Detailed Dimensions of the 82750D8 in the 132-Lead PQFP Package-Terminal 
Details 
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Package Thermal Specifications 


The 
8275008 
is specified 
for 
operation 
when 
TC 
(the case temperature) 
is within the range of ooe to 
95°. Tc may be measured 
in any environment 
to de- 
termine 
whether 
the 8275008 
is within specified 
op- 
erating range. The case temperature 
should be mea- 
sured at the center 
of the top surface. 


TA 
(the 
ambient 
temperature) 
can 
be 
calculated 
from (JCA(thermal 
resistance 
from case to ambient) 
with the following 
equation: 


TA = T C - 
P • DCA 


Typical 
values 
for (JCAat various 
airflows 
are given 
in Table 
6-3 for the 132-lead 
PQFP package 
when 
using the digital outputs. 
Table 6-4 shows the maxi- 
mum TA allowable 
(without 
exceeding 
Tc> at various 
airflows. 
The power 
dissipation 
(P) is calculated 
by 
using the typical 
supply currents 
at 5V as shown 
in 
Table 5-2. 


Similarly, 
when using the analog 
outputs, 
the maxi- 
mum TA allowed 
is a function 
of Ifs. The equation 
for 
calculating 
the power is given in the following 
equa- 
tion which can then be used in calculating 
the maxi- 
mum TA. 


. 
P = 5V • (lcCNT + (3 • Ifs + 6)) 


Table 
6-3. Thermal 
Reslstances 
("C/W) 


(JCAVersus 
Alrflow-ft/mln 
(m/sec) 


Package 
0 
200 
400 
600 
800 
1000 
(0) 
(1.01) 
(2.02) 
(3.04) 
(4.06) 
(5.07) 


132-Lead 
PQFP 
26.0 
17.5 
14.0 
11.5 
9.5 
8.5 


Table 6-4. Maximum 
TA at Various 
Airflows 
("C) 


TA Versus 
Alrflow-ft/mln 
(m/sec) 


Package 
Frequency 
0 
200 
400 
600 
800 
1000 
(MHz) 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 
(5.07) 


132-Lead 
PQFP 
28 
71 
79 
82 
84 
86 
87 


45 
59 
71 
75 
79 
82 
83 
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82750LH 
HOST INTERFACE 
GATE 
ARRAY 
intel~ 


tions. It performs Power-On-Self-Test (pOSn ROM 
Decode, DVI Bus Request Arbitration, VGA DAC 
Support, Interrupt Sharing Hardware Support and 
provides General Purpose Inputs and Outputs. 


The 82750LH is fabricated on a 1.2IJ- double metal 
layer CMOS' technology and is packaged in a 160· 
lead PQFP. The 82750LH is designed to run at a 
maximum frequency of 25 MHz. 


The 82750LH or HIGA is the Host Interface Gate 
Array for use in DVI@Systems. Its primary function is 
to interface the host bus (MicroChannel for PS/2· 
based systems, ISA for Af-based systems) to the 
DVI bus. The HIGA also serves five secondary tunc- 
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with Texas 
Instruments 
internal 
standards. 
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1.0 
PIN DESCRIPTIONS 


1.1 Pinout 


82750LH 


Vee 
GNO 
104031 
104030 
104029 
104028 
104027 
104026 
104025 
MOH 
GNO 
104023 
104022 
104021 
104020 
104019 
104018 
104017 
104016 
GNO 
"cc 
104015 
10401. 
104013 
104012 
104011 
104010 
10409 
10408 
GNO 
MOO 
10401 
10402 
10403 
MD. 
10405 
10406 
10407 
Vee 
Ne 


TESTPIN 
VA2 
102 
101 
100 
Ne 
HOST# 
ROSW# 
GENOe 
GENOB 
GENOA 
Ne 
A23 
A22 
A21 
A20 
A19 
A1B 
A17 
Vee 
GNO 
A16 
A15 
Al. 
A13 
A12 
All 
Al0 
A9 
AB 
A7 
A6 
A5A. 
A3 
A2 
Al 
AO 
GNO 
"cc 


241344-2 


I 


82750LH - HIGA 
TOP VIEW 


Figure 1-1. 82750LH 
Plnout 
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"'V 
::1.3 
A11 
94 
A12 
95 
A13 
96 
A14 
97 
A15 
98 
A16 
99 
A17 
102 
A18 
103 
A19 
104 
A20 
105 
A21 
106 
A22 
107 
A23 
108 
AREQ# 
135 
ATMODE 
77 
AUDIO# 
153 
BDRESET# 
156 
BENO# 
149 
BEN1# 
150 
BEN2# 
151 
BEN3# 
152 
BHE# 
62 
BREQ# 
144 
BUSEN# 
139 
CAPT# 
158 
CDDS16# 
68 
CDSETUP# 
79 
CDSFDBK# 
71 
CHRDY 
69 
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\;11::1'1114 
1<:( 
GENOA 
110 
GENOB 
111 
GENOC 
112 
GND 
2 
GND 
11 
GND 
20 
GND 
30 
GND 
41 
GND 
51 
GND 
61 
GND 
70 
GND 
82 
GND 
100 
GND 
121 
GND 
141 
HOST# 
114 
HRDY# 
136 
IDO 
116 
ID1 
117 
ID2 
118 
INTA# 
133 
INTB# 
132 
INTC# 
131 
INTO 
66 
INT1 
65 
INT2 
64 
INT3 
63 
MADE24 
78 
MDO 
31 


MU11 
26 
MD12 
25 
MD13 
24 
MD14 
23 
MD15 
22 
MD16 
19 
MD17 
18 
MD18 
17 
MD19 
16 
MD20 
15 
MD21 
14 
MD22 
13 
MD23 
12 
MD24 
10 
MD25 
9 
MD26 
8 
MD27 
7 
MD28 
6 
MD29 
5 
MD30 
4 
MD31 
3 
MSTRB# 
138 
M_IO 
73 
NC 
40 
NC 
80 
NC 
109 
NC 
115 
NC 
143 
NC 
160 
POST# 
157 


SD7 
52 
SD8 
49 
SD9 
48 
SD10 
47 
SD11 
46 
SD12 
45 
SD13 
44 
SD14 
43 
SD15 
42 
SO# 
75 
sa 
74 
TESTPIN 
120 
VA2 
119 


VCC 
1 


VCC 
21 


VCC 
39 


VCC 
50 


VCC 
60 
VCC 
81 


VCC 
101 


VCC 
122 


VCC 
140 
VGARD# 
129 
VGAWR# 
130 
VRAM# 
146 
VREG# 
145 
VWE# 
147 
V1RST# 
155 
V2RST# 
154 
V1VALEN 
148 
I 


Location 
Pin 
Name 
1 
VCC 
2 
GNO 
3 
M031 
4 
M030 
5 
M029 
6 
M028 
7 
M027 
8 
M026 
9 
M025 
10 
M024 
11 
GNO 
12 
M023 
13 
M022 
14 
M021 
15 
M020 
16 
M019 
17 
M018 
18 
M017 
19 
M016 
20 
GNO 
21 
Vcc 
22 
M015 
23 
M014 
24 
M013 
25 
M012 
26 
M011 
27 
M010 
28 
M09 
29 
M08 
30 
GNO 
31 
MOO 
32 
M01 
33 
M02 
34 
M03 
35 
M04 
36 
M05 
37 
M06 
38 
M07 
39 
VCC 
40 
NC 


I 


Table 1-2. Pin Cross Reference 
by Pin Number 


Location 
Pin 
Name 
41 
GNO 
42 
S015 
43 
S014 
44 
S013 
45 
S012 
46 
S011 
47 
S010 
48 
S09 
49 
S08 
50 
VCC 
51 
GNO 
52 
S07 
53 
S06 
54 
S05 
55 
S04 
56 
S03 
57 
S02 
58 
S01 
59 
SOO 
60 
VCC 
61 
GNO 
62 
BHE# 
63 
INT3 
64 
INT2 
65 
INT1 
66 
INTO 
67 
RFSH# 
68 
COOS16# 
69 
CHROY 
70 
GNO 
71 
COSFOBK# 
72 
RESET 
73 
M 
10 
74 
S1# 
75 
SO# 
76 
CMO# 
77 
ATMOOE 
78 
MAOE24 
79 
COSETUP# 
80 
NC 


Location 
Pin 
Name 
81 
VCC 
82 
GNO 
83 
AO 


,84 
A1 
85 
A2 
86 
A3 
87 
A4 
88 
A5 
89 
A6 
90 
A7 
91 
A8 
92 
A9 
93 
A10 
94 
A11 
95 
A12 
96 
A13 
97 
A14 
98 
A15 
99 
A16 
100 
GNO 
101 
VCC 
102 
A17 
103 
A18 
104 
A19 
105 
A20 
106 
A21 
107 
A22 
108 
A23 
109 
NC 
110 
GENOA 
111 
GENOB 
112 
GENOC 
113 
ROSW# 
114 
HOST# 
115 
NC 
116 
100 
117 
101 
118 
102 
119 
VA2 
120 
TESTPIN 
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Location 
Pin 
Name 
121 
GNO 
122 
Vcc 
123 
GENIO 
124 
GENI1 
125 
GENI2 
126 
GEBI3 
127 
GENI4 
128 
GAVAlEN 
129 
VGARO# 
130 
VGAWR# 
131 
INTC# 
132 
INTB# 
133 
INTA# 
134 
CREQ# 
135 
AREQ# 
136 
HROY# 
137 
OSTRB# 
138 
MSTRB# 
139 
BUSEN# 
140 
VCC 
141 
GNO 
142 
ClK 
143 
NC 
144 
BREQ# 
145 
VREG# 
146 
VRAM# 
147 
VWE# 
148 
V1VAlEN 
149 
BENO# 
150 
BEN1# 
151 
BEN2# 
152 
BEN3# 
153 
AUOIO# 
154 
V2RST# 
155 
V1RST# 
156 
BORESET# 
157 
POST # 
158 
CAPT# 
159 
OROY 
160 
NC 


• 
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Figure 1-2. 82750LH 
Functional 
Signal Groupings 


I 


GENIO 
I 
GENIO is a generic input pin whose state can be read through 
General Status 
Register 
bit 4. GENIO has an internal pull up resistor of approximately 
70 KO. 


GENI1 
I 
GENI1 is a generic input pin whose state can be read through the General Status 
Register 
bit 5. GENI1 has an internal pull up resistor of approximately 
70 KO. 


GENI2 
I 
GENI2 is a generic input pin whose state can be read through 
General Status 
Register 
bit 6. GENI2 has an internal pull up resistor of approximately 
70 KO. 


GENI3 
I 
GENI3 is a generic 
input pin whose state can be read through the General Status 
Register bit 7. GENI3 has an internal pull up resistor of approximately 
70 KO. 


GENI4 
I 
GENI4 is a generic input pin whose state can be read through the General Status 
Register 
bit O. GENI4 has an internal pull up resistor of approximately 
70 KO. 


GENOA 
0 
GENOA is a generic output from the HIGA that is controlled 
by the General Control 
Register bit 4. 


GENOB 
0 
GENOB is a generic output from the HIGA that is controlled 
by the General Control 
Register 
bit 5. 


GENOe 
0 
GENOe 
is a generic output from the HIGA that is controlled 
by the General Control 
Register 
bit 2. 


Symbol 
Type 
Name and Function 


ID[2:0) 
I 
IDO, ID1 and ID2 are the DVI Device ID input pins used by the HIGA to decode 
the 
DVI Devices internal to the HIGA. The HIGA uses DVI Device 3 and DVI Device 2 
(POST ROM). IDO, ID1 and ID2 have internal pull up resistors 
of approximately 
70 KO. 


VA2 
I 
VA2 is used to decode 
DVI Device 3 into two 32-bit registers. 


MSTRB# 
I 
MSTRB# 
is the Memory Strobe signal used to indicate the end of a memory cycle. 


MSTRB # is also used to indicate when data can be read from the DVI Data Bus. 
The HIGA samples the DVI Data Bus on the rising edge of Cl.K during MSTRB #. 
This signal has an internal pull up resistor of approximately 
70 KO. 


DSTRB# 
I 
DSTRB # is the DVI Device Strobe signal used to indicate a DVI Device cycle on the 
DVI Bus. MSTRB#is 
still used to indicate when data can be read from the DVI Data 
Bus, even though the bus cycle is a DVI Device Bus cycle. The HIGA samples 
DVI 
Bus data on the rising edge of Cl.K during MSTRB#. 
This signal has an internal 
pull 
up resistor of approximately 
70 KO. 


BUSEN# 
I 
BUSEN # is the Bus Enable signal on the DVI Bus. The HIGA uses BUSEN # as an 
indicator 
to know when the DVI Bus has been given up by the 82750PB. 
This signal 
has an internal pull up resistor of approximately 
70 KO. 


Table 
1-4. DVI Bus Signal Pin Descriptions 


I 
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Symbol 
Type 
Name and Function 


BREQ# 
0 
BREQ# 
is a Bus Cycle Request 
signal to the 82750PB. 
The 82750PB 
responds 
with a BUSEN # to indicate that the request is being serviced. 


HRDY# 
I 
HRDY # is the Host Ready signal from the 82750PB. 
The HIGA uses HRDY # to 
drop the BREQ# 
signal to the 82750PB. 
This signal has an internal pull up resistor 
of approximatley 
70 KO. 


VRAM# 
0 
VRAM # is the Bus Cycle Type Indicator signal for the 82750PB. 
When VRAM # is 
active with a bus request (BREQ#) 
then the DVI Bus cycle will be a VRAM access 
or a DVI Device Access 
(as opposed 
to an 82750PB 
register access). 


VREG# 
0 
VREG # is a bus cycle type indicator 
for the 82750PB. 
When VREG # is active with 
a bus request (BREQ#) 
then the DVI Bus cycle will be an 82750PB 
register 
access. The VREG # signal is always the opposite 
of VRAM #. 


AREQ# 
I 
AREQ# 
is the Audio Bus Request 
signal on the DVI Bus. The HIGA arbitrates 
the 
external 
requests and the internal requests 
and presents 
the BREQ # signal to the 
82750PB. 
This signal has an internal pull up resistor of approximately 
70 KO. 


CREQ# 
I 
CREQ# 
is the Capture/SCSI 
Bus Request 
signal on the DVI Bus. The HIGA 
arbitrates 
the external 
requests and the internal requests 
and presents 
the 
BREQ# 
signal to the 82750PB. 
This signal has an intemal pull up resistor of 
approximately 
70 KO. 


AUDIO# 
0 
AUDIO# 
is the arbitrated 
output from the HIGA's internal arbitration 
logic. The 
HIGA arbitrates 
the Audio request, Capture/SCSI 
request and the HIGA internal 
requests 
and decides who should respond to the next DVI Bus cycle with 
BUSEN # active. AUDIO # active indicates 
that AREQ # is currently 
being serviced 
or will be serviced 
with the next BUSEN #. 


HOST # 
0 
HOST # active indicates 
that an internal request is currently 
being serviced 
or will 
be serviced 
with the next BUSEN #. 


CAPT# 
0 
CAPT# 
active indicates 
that CREQ# 
is currently 
being serviced 
or will be serviced 
with the next BUSEN #. 


ClK 
I 
ClK 
is a CM OS compatible 
clock signal for the HIGA. The HIGA uses ClK 
to 
synchronize 
events on the host bus to events on the DVI Bus. All synchronous 
events occur on the rising edge of ClK. 


MD[31:0] 
B 
MD[31 :0] are the DVI Bus Data Bus signals used to transfer 
data between 
the 
HIGA and VRAMor 
other DVI Devices. At the beginning 
of DVI Bus cycles, the 
MD[31 :0] lines carry the address information. 
These signals have internal pull up 
resistors 
of approximately 
70 KO. 


V1VAlEN 
0 
V1VAlEN 
is the VRAM latch 
Enable signal input to the 82750PB. 
While V1VAlEN 
is high and BUSEN# 
is low, address information 
is presented 
on the MD[31:0] 
data lines. 


GAVAlEN 
0 
GAVAlEN 
is the VRAM Address 
Latch Enable signal input to devices on the DVI 
Bus. GAVAlEN's 
high to low transition 
lags V1 VAlEN 
by one clock period. 


BEN [3:0] # 
B 
BENO # through 
BEN3 # are the Byte Enable signals on the DVI Bus. They define 
which byte or bytes are involved 
in the DVI Bus cycle. BENO# 
signals the use of 
the least significant 
byte, or data lines MD[7:0]. 
These signals have internal pull up 
resistors 
of approximately 
70 KO. 


VWE# 
B 
VWE # is the read/write 
direction 
control for the DVI Bus. When VWE # is low, the 
bus cycle is a write. This signal has an internal pull up resistor of approximately 
70 KO. 


Table 
1-4. DVI Bus Signal 
Pin Descriptions 
(Continued) 
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Symbol 
Type 
Name and Function 


RDSW# 
0 
RDSW # is used to gate the switch information 
onto the DVI Bus data bus lines 
MD[31:0j. 
The trailing edge of RDSW# 
(Iow to high) latches the information 
into 
the HIGA. MD[7:0jlines 
are latched 
into the 1/0 Port Switch Register while the 
MD[15:8jlines 
are latched into the POST Address 
Switch Register. The HIGA 
uses this information 
to configure 
the 1/0 Port Address 
and POST ROM Address 
while in the ATMODE 
of operation. 


POST # 
0 
POST # is the chip select output for the POST ROM on the DVI Bus. POST # is a 
decode 
of DVI Device 2, DSTRB # and not VWE #. Since POST is an eight bit 
device, the HIGA does not return the CDDS16# 
(or MEMCS16#) 
signal during 
POST accesses. 
Additionally, 
special steering 
logic inside the HIGA logically 
connects 
the MD[7:0jlines 
to the SD[7:0jlines 
regardless 
of the byte address. 
Therefore, 
consecutive 
byte accesses 
do not progress 
from MD[7:0j 
to MD[15:8j 
to MD[23:161, 
etc. when accessing 
the POST ROM. 


BDRESET# 
0 
BDRESET # is the Board Reset signal controlled 
by the General Control 
Register 
bit O. BDRESET # is also applied when the RESET input is active. Besides driving 
the output pin low, BDRESET # also resets the HIGA internal registers, 
state 
machine 
and FIFO logic. The POS and PAR registers 
are not affected 
by Board 
Reset. BDRESET # should be held low for at least 10 /Ls to guarantee 
a full board 
reset. 


V1RST# 
0 
V1 RST # is the 82750PB 
Reset signal controlled 
by the General Control 
Register 


bit 1. V1 RST is also applied when the RESET input is active. 


V2RST# 
0 
V2RST # is the 82750DB 
Reset signal controlled 
by the General Control 
Register 
bit 7. 


Table 
1-4. DVI Bus Signal 
Pin Descriptions 
(Continued) 


Symbol 
Type 
Name and Function 


ATMODE 
I 
ATMODE 
configures 
the HIGA for an AT or ISA type interface 
for the host bus 
when high. When low, ATMODE 
configures 
the host interface 
for the 
MicroChannel 
type interface. 
This signal has an internal pull up resistor of 
approximately 
70 KO. 


RESET 
I 
RESET is the main reset signal for the HIGA. This signal has an internal pull up 
resistor of approximately 
70 KO. 


SD[15:0j 
B 
SD[15:0j 
are the host data bus signals used to transfer 
data between 
the host and 
the HIGA. These signals have internal pull up resistors 
of approximately 
70 KO. 


A [23:0j 
I 
A[23:0j 
are the host address pins. The address signals are used to access the 
registers 
internal to the HIGA as well as the VRAM and DVI Devices on the DVI 
Bus. These signals have intemal pull up resistors 
of approximately 
70 KO. 


M_IO 
I 
M_IO 
is the memory or 1/0 cycle indicator 
signal on the MicroChannel 
Bus when 
(IOWRC#) 
the HIGA is in the non·ATMODE. 
M_IO 
is the ISA 10WRC# 
signal when the HIGA 
is in the ATMODE 
of operation. 


SO# 
I 
SO# 
is the Status 0 signal on the MicroChannel 
Bus when the HIGA is in the non- 
(MRDC#) 
ATMODE. 
SO# is the ISA MRDC# 
signal when the HIGA is in the ATMODE 
of 
operation. 


SU 
I 
S1 # is the Status 1 signal on the MicroChannel 
Bus when the HIGA is in the non- 
(MWRC#) 
ATMODE. 
SO# is the ISA MWRC# 
signal when the HIGA is in the ATMODE 
of 
operation. 


MADE24 
I 
MADE24 
is the below 16M memory cycle indicator 
signal on the MicroChannel 
Bus 
(IORDC#) 
when the HIGA is in the non·ATMODE. 
MADE24 
is the ISA 10RDC# 
signal when 
the HIGA is in the ATMODE 
of operation. 


Table 
1-5. Host Bus Signal 
Pin Descriptions 


I 
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VI 
the POS4 register in the HIGA. This signal has an internal pull up resistor 
of 
approximately 
70 KO. 


INTB# 
I 
INTB# 
is the Audiolnterrupt 
signal on the DVI Bus. The INTB# 
signal can be 
steered to one of the INT1 or INT2 outputs through 
programming 
of the POS4 
register in the HIGA. This signal has an internal pull up resistor of approximately 
70 KO. 


INTC# 
I 
INTC# 
is the Capture/SCSI 
Interrupt 
signal on the DVI Bus. INTC# 
can be 
steered to one of the INn 
or INT2 outputs through 
programming 
of the POS4 
register in the HIGA. This signal has an internal pull up resistor of approximately 
70 KO. 


CDDS16# 
0 
CDDS16# 
is the Card Data Size16 signal on the MicroChannel 
Bus when the 
(IOCS16#) 
HIGA is in the non-ATMODE 
of operation. 
CDDS16# 
is the ISA IOCS16# 
signal when the HIGA is in the ATMODE. 


CDSFDBK# 
0 
CDSFDBK# 
is the Card Select Feedback 
signal on the MicroChannel 
Bus when 
(MEMCS16#) 
the HIGA is in the non-ATMODE 
of operation. 
CDSFDBK# 
is the ISA 
MEMCS16# 
signal when the HIGA is in the ATMODE. 


CHRDY 
0 
CHRDY is the Channel 
Ready signal on the MicroChannel 
Bus when the HIGA 
(IOCHRDY) 
is in the non-ATMODE 
of operation. 
CHRDY is the ISA IOCHRDY 
signal when 
the HIGA is in the ATMODE. 


INT[3:0) 
0 
INTO through 
INT3 are the four interrupt 
output pins on the HIGA. In the non- 
ATMODE 
of operation 
the INT outputs are active low, open collector 
type 
outputs. 
In the ATMODE 
of operation 
the INT outputs are active high, totem 
pole type outputs. 


Table 
1-6. VGA Support Signal Pin Descriptions 


Symbol 
Type 
Name and Function 


VGARD# 
0 
VGARD# 
is the VGA DAC Read signal. VGARD# 
is active during host reads of the 
VGA DAC Test Registers. 
Note that the HIGA does not drive the SD[15:0) 
lines 
during VGARD# 
cycles, but does provide the other host timing necessary 
for the 
cycle. 


VGAWR# 
0 
VGAWR # is the VGA DAC Write signal. VGAWR # is active during host writes of 
the VGA DAC Registers. 
Note that the HIGA does not accept any data from the 
SD[15:0) 
lines during VGAWR# 
cycles. 


1-130 
I 


82750LH 


Table 
1-7. Output Pins" 
Table 
1-8. Input Pins 


Name 
Active 
Level 


INT[3:0] 
• 


CDDS16# 
Low 


CHRDY 
High 


CDSFDBK# 
Low 


GENO[C:A] 
High 


RDSW# 
Low 


HOST# 
Low 


GAVALEN 
High 


VGARD# 
Low 


VGAWR# 
Low 


BREQ# 
Low 


VREG# 
Low 


VRAM# 
Low 


V1VALEN 
High 


AUDIO# 
Low 


V2RST# 
Low 


V1RST# 
Low 


BDRESET# 
Low 


POST# 
Low 


CAPT# 
Low 


Name 
Active 
Sync/Async 
Level 


BHE# 
Low 
Async 


RFSH# 
Low 
Async 


RESET 
High 
Async 


M_IO 
High 
Async 
sa 
Low 
Async 


SO# 
Low 
Async 


CMD# 
Low 
Async 


ATMODE 
High 
Async 


MADE24 
High 
Async 


CDSETUP# 
Low 
Async 


A[23:0] 
High 
Async 


ID[2:0] 
High 
Async 


VA2 
High 
Sync 


GENI[4:0] 
High 
Async 


INT[C:A] 
Low 
Async 


CREQ# 
Low 
Async 


AREQ# 
Low 
Async 


HRDY# 
Low 
Sync 


DSTRB# 
Low 
Sync 


MSTRB# 
Low 
Sync 


BUSEN# 
Low 
Sync 


CLK 
High 
Sync 


DRDY# 
Low 
Async 


NOTE: 
•Active 
high in ATMODE. 


Active 
low in non-ATMODE. 


Table 1-9. Bidirectional 
Pins 


Name 
Active 
When Floated" 
Sync/Async 
Level 


MD[31:0] 
High 
VRAM Read, DVI Device Read 
Sync 


SD[15:0] 
High 
Host Reads 
Async 


VWE# 
Low 
Sync 


BEN[3:0] 
# 
Low 
Sync 


NOTE: 
•• All output 
and bidirectional 
pins are floated 
when TESTPIN 
and RESET 
are asserted 
together. 


I 


• 
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2.0 
INTERNAL ARCHITECTURE 


2.1 
DVIIHost Bus Interface 


2.1.1 
DVI BUS OVERVIEW 


The primary function of the HIGA is to interface the 
host bus (MicroChannel for PS/2-based systems, 
ISA for AT·based systems) to the DVI Bus. The DVI 
Bus connects VRAM, the 82750PB, Audio and host 
together through one common address and data 
path. The 82750PB normally owns the DVI Bus. The 
HIGA is designed to run at the same frequency as 
the 82750PB. 


The data bus from the host passes through the 
HIGA before reaching the DVI Bus and also creates 
an intermediate path for passing around the HIGA. 
This allows for two types of accesses to compo- 
nents in the DVI system: accesses that require the 
use of the DVI Bus and those that do not. The two 
types of accesses are classified into a Slow Access 
Group and a Fast Access Group respectively. When 
the HIGA is running at 25 MHz a Fast Access takes 
about 400 ns (10 cycles) to complete while a Slow 
Access can take up to 3000 ns (75 cycles) to com- 
plete. 


The DVI Bus supports up to 16 Mbytes of address 
space. The first 15 Mbytes of address space is re- 
served for RAM while the last 1 Mbyte of address 
space is reserved for communication between de- 
vices on the DVI Bus. The last 1 Mbyte of address 
space is divided into eight 128 Kbyte areas with 
each device assigned to one of the eight areas as a 
DVI Device ID. DVI Device ID 0 is the first 128K area 
above 15 Mbytes and DVI Device ID 7 is the last 
128 Kbyte area. The following DVI Device ID assign- 
ments have been made: 
Device ID 7 -82750PB 
Registers via the Host Bus 
Device ID 6 - 
Capture Subsystem Registers 
Device ID 5 - 
Audio Subsystem, Genlock and Key- 
ing Registers 
Device ID 4 - 
CDROM Subsystem Registers 
Device ID 3 - 
Reserved 
Device ID 2 - 
Power-On-Self-Test (POST) ROM 
Device ID 1 - 
Reserved 
Device ID 0 - 
Reserved 
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2.1.2 
EXPANDED 
MEMORY 
SPACE 
HOSTI 
VRAM 
CONNECTION 


VRAM is seen by the host in memory space by a 
method 
known as 
EMS or 
Expanded Memory 
Space. The EMS method of expanding memory 
uses a technique in which memory is paged mapped 
through a "window" of the host's address space. 


The HIGA supports 13 EMS window options ranging 
in size from 0 bytes to 16 Mbytes. The EMS window 
options that are supported are 0 bytes, 8 Kbytes, 
16 Kbytes, 32 Kbytes, 64 Kbytes, 128 Kbytes, 
256 Kbytes, 512 Kbytes, 1 Mbytes, 2 Mbytes, 
4 Mbytes, 8 Mbytes, and 16 Mbytes. The EMS win- 
dow size and starting window location are estab- 
lished by setting bits in the Configuration Registers. 


A window's starting address must be a multiple of its 
size. As an example, a 128 Kbyte window must start 
on a 128 Kbyte boundary in the host's memory ad- 
dress space. Each EMS window is divided into four 
equally sized pages which have corresponding Page 
Address Registers (PARs). The PARs are used to 
map the logical (window) address space of the host 
independently into four pages of physical addresses 
in VRAM (with page aligned starting addresses). 


2.1.3 
HOST/VRAM 
FIFO CONNECTION 


There are four FIFOs connecting the host to VRAM 
in addition to the EMS connection. The FIFOs are in 
the host's I/O address space whereas the EMS win- 
dow is in the host's memory address space. The 
FIFOs are divided into four types and there is one of 
each type available: a 16-Bit-Read FIFO, a 16-Bit- 
Write FIFO, a 32-Bit-Read FIFO and a 32-Bit-Write 
FIFO. A 16-bit FIFO will only access 16-bit words in 
the VRAM address space and is useful for access- 
ing 82750PB registers. A 32-bit FIFO can access 
32-bit words in the VRAM address space and is use- 
ful for accessing VRAM in high-performance modes 
or through Host DMA channels. All FIFOs can be 
operated in an Auto-increment Mode in which the 
VRAM address is incremented after each access 
through the FIFO. 


I 


2.1.4 
DVI DEVICE 
ACCESS 


In order to allow accesses to DVI Devices (which are 
memory mapped) or to setup a FIFa without having 
to change the Page Address Registers, a Quick Ac· 
cess method is provided. The Quick Access method 
is selected by writing the DVI Device ID to a control 
register via an I/O operation. In this mode, the next 
memory access through the PARs (not FIFOs) will 
have the physical address modified to point to the 
selected DVI Device. The DVI Device ID must be 
rewritten for the next access through this path, even 
if the previous access was to the same DVI Device. 
This method reduces the overhead for accessing a 
DVI Device to one Fast Access and one Slow Ac· 
cess operation, without disturbing the PARs. Note 
that the Quick Access is armed for the next memory 
access through any of the PARs, regardless of the 
source of the access (Host or DMA). 


2.2 
Power-On-Self- 
Test ROM 


The HIGA provides support for configuring and en- 
abling/disabling 
a Power-On-Sett-Test 
ROM. The 
POST ROM, when installed and enabled, will be ex- 
ecuted after the host's POST but before the system 
boots and may contain a self-test program and any 
special configuration programs. 


2.3 
DVI Bus Request Arbitration 


The HIGA arbitrates all of the requests for the DVI 
Bus and asserts the acknowledge for the highest 
priority request. The possible requests are: EMS 
from the HIGA's EMS logic, FIFa from the HIGA's 
internal FIFa logic, CAPT from the external CREQ# 
pin, AUDIO from the external AREQ# 
pin and 
V1REQ from DVI Device 3. The FIFa request is an 
arbitrated request from the FIFa logic which ranks 
the requests as follows: 32·Bit·Write FIFa, 16·Bit· 
Write FIFa, 32·Bit·Read FIFa, 16-Bit-Read FIFa. 
Table 2-1 summarizes the request priorities. 


Table 2-1. Bus Request 
Priority 


Priority 
Request 


1 
EMS 


2 
32-Bit-Write FIFa 


3 
16-Bit-Write FIFa 


4 
32-Bit-Read FIFa 


5 
16-Bit-Read FIFa 


6 
CAPT 


7 
AUDIO 


8 
V1REQ 
I 
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The arbitration algorithm is such that once requests 
have been stacked up, the arbitration logic services 
the stacked up requests before any other requests 
will be considered. This behavior keeps a high-priori- 
ty request from monopolizing the DVI Bus. For ex- 
ample, consider a situation where HREQ is assert- 
ed. While HREQ is being serviced AREQ is assert- 
ed, followed by CREQ. Even though AREQ was as- 
serted first, CREQ will be the next to get attention. 
This is because when one request is already in the 
process of being serviced no other requests are 
considered and they stack up. Then when arbitration 
continues 
(after 
the 
service 
is 
completed) 
all 
stacked up requests enter at the same time and the 
one with the highest priority will be the next to be 
serviced. Only stacked up requests and requests 
that become active on the same ClK edge are arbi- 
trated against each other. Otherwise, requests are 
honored in the order in which they are received. 


Another example of the arbitration algorithm is one 
in which all of the possible requests become active 
in the opposite order of priority, all separated by one 
clock period. The first request to be serviced is the 
lowest priority one since it came in first, followed by 
the rest of the requests in order of priority or the 
opposite order in which they arrived. This is be- 
cause they all stacked up behind the lowest priority 
request. 


2.4 
VGA DAC Support 


The DVI Board may have a VGA type DAC onboard 
which can be used along with the Video Feature 
Connector to "copy" the video stream from the sys- 
tem's VGA. The onboard VGA video can then be 
keyed with 82750DB's output. The final RGB video 
information is output to a standard VGA monitor and 
connector. The monitor ID supplied to the monitor 
connector by the connected monitor can be read by 
the host. 


All I/O writes on the system bus are monitored by 
the HIGA. When an I/O write falls into the range of 
03C6-03C9, hex for the PS/2· based system or 
X3C6-X3C9 hex for the AT-based system, a VGA 
write to the onboard VGA DAC is generated, dupli- 
cating the data in the system's VGA DAC. I/O reads 
to 03C6-03C9 are ignored by the DVI Board to pre- 
vent two I/O devices from driving data at the same 
time. Using this eavesdropping programming tech- 
nique together with the system's Video Feature Bus 
allows the DVI Board to maintain an RGB video 
stream identical to the system's VGA RGB creating 
a VGA compatible mode that is transparent to the 
programmer or user. 
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The 
General 
Control 
Register 
contains 
the 
DINT 
(Disable 
Interrupt) 
bit that can be used to reassert 


• 
• 
• 
'11" 110:;\:; 
IIU 
space 
at t:SA~E-BASE+3F 
hex plus 
the 
POS registers 
through 
the 
POS setup 
mecha- 
nism in PS/2-based 
machines. 
A register 
map of the 
HIGA registers 
is shown 
in Table 2-2. 


Table 2-2. HIGA Register 
Map 


B1 
..................•.•.•.•.••.•..•.•••..... 
BS 
B7 
..................••.....••..•....•...•... 
BO 


PAROHIGH 
PAROLOW 


PAR1 HIGH 
PAR1 LOW 


PAR2HIGH 
PAR2LOW 


PAR3HIGH 
PAR3LOW 


ROM SK SELECT 
REGISTER 
DVI DEVICE 
QUICK 
ACCESS 


32·BIT·WRITE 
FIFa 
DATA 
HIGH 
32·BIT·WRITE 
FIFa 
DATA 
LOW 


32·BIT·WRITE 
FIFa 
ADDR 
CNTR 
BYTE 
32·BIT·WRITE 
FIFa 
CONTROL 


16·BIT·WRITE 
FIFa 
DATA 
HIGH 
16·BIT·WRITE 
FIFa 
DATA 
LOW 


16·aIT·WRITE 
FIFa 
ADDR 
CNTR 
BYTE 
16·BIT·WRITE 
FIFa 
CONTROL 


32·BIT·READ 
FIFa 
DATA 
HIGH 
32·BIT·READ 
FIFa 
DATA 
LOW 


32·BIT·READ 
FIFa 
ADDR 
CNTR 
BYTE 
32·BIT·READ 
FIFa 
CONTROL 


16·BIT·READ 
FIFa 
DATA 
HIGH 
16·BIT·READ 
FIFa 
DATA 
LOW 


16·BIT·READ 
FIFa 
ADDR 
CNTR 
BYTE 
16·BIT·READ 
FIFa 
CONTROL 


GENERAL 
STATUS 
REGISTER 
POSO 


GENERAL 
CONTROL 
REGISTER 
POS1 


1/0 PORT SWITCH 
REGISTER 
POS2 


POST ADDRESS 
SWITCH 
REGISTER 
POS3 


RESERVED 
POS4 


RESERVED 
POSS 


VGA TEST 
REGISTER 
3C9 
VGA TEST 
REGISTER 
3CS 


VGA TEST 
REGISTER 
3C7 
VGA TEST 
REGISTER 
3C6 


BASE+O 


BASE+2 


BASE+4 


BASE+6 


BASE+S 


BASE+20 


BASE+22 


BASE+24 


BASE+26 


BASE+2S 


BASE+2A 


BASE+2C 


BASE+2E 


BASE+30 


BASE+32 


BASE+34 


BASE+36 


BASE+3S 


BASE+3A 


BASE+3C 


BASE+3E 


The fast access 
registers 
for the FIFO data ports can become 
slow access registers 
if the access 
must cause 
a DVI Bus cycle in order for it to complete. 
Reading an empty read FIFO or writing a full write FIFO are the only 
examples 
of this condition. 
All other system 
registers 
and VRAM are in the Slow Access 
Group. 


1-134 
I 


2.7.1 PAGE ADDRESS REGISTERS 
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I b15 I b14 I b13 I b12 I b11 I b10 I b9 I b8 I b7 I be I b5 I b4 I b3 I b2 I b1 I bO I 


Figure 2-1. fARn 


Page Address Registers 0-3 
are read/write regis· 
ters 
located 
at 
BASE+ 0 
hex, 
BASE+ 2 
hex, 
BASE+ 4 hex and BASE+ 6 hex respectively. The 
PARs are used to map the logical (window) address 
space of the host indep~ndently into four pages of 
physical addresses in VRAM. The PARs are 16-bit 
registers, although only the necessary top PAR bits 
are used in each EMS Window Mode. 


2.7.2 DVI DEVICE QUICK ACCESS REGISTER 


b7 
bO 


~ 
1021101 
liDO I BASE+8 


Figure 2-2. DVI Device Quick Access Register 


The DVI Device Quick Access Register is a write- 
only register located at BASE+8 hex. This is an 
8-bit register with only the lowest three bits defined. 
The value of the 102, 101 and 100taken as a three 
bit field define a DVI Device ID which is used in con- 
juction with the Quick Access Method of bypassing 
the PARs to access a DVI Device Register. The 
Quick Access Register will be used to force the ac- 
cess to the DVI Device ID contained in this register. 
The Quick Access is enabled by the writing of the 
Quick Access Register and lasts for just one access 
through the EMS Window. 


The user is cautioned to keep in mind that the host 
CPU will turn one 16-bit access at an odd address 


into two 8-bit accesses and the Quick Access would 
be enabled for only the first 8-bit access. The user is 
also cautioned to keep in mind that the HIGA does 
not distinguish between host cycles performed by 
the CPU and those that are performed via DMA. 


2.7.3 ROM 8K SELECT REGISTER 


b7 
bO 
I 
0 I 
0 I 
0 I 
0 I RP31 RP21 RP1 I RPOI BASE+9 


Figure 2-3. ROM 8K Select Register 


The ROM 8K Select 8egister is an 8-bit write-only 
register located at BASE+ 9 hex. Only the lowest 
4 bits are defined. This register is used to select the 
8K page of POST ROM. On RESET the contents of 
this register is initialized to zero. 


2.7.4 FIFa CONTROL, ADDRESS AND DATA 
REGISTERS 


There are five registers associated with each of the 
FIFOs. 


2.7.4.1 32-Bit-Wrlte FIFa Registers 


The 32-Blt-Wrlte FIFa Control Register is a read/ 
write register located at BASE+ 22 hex. Each of the 
bits in this register is described below. 


~ 
TEST 
FULL 
FAST 
1~I---,R.c::s-,-l--,----,RSO'::"::""...J1 
BASE+ 22 WRITE 


Figure 2-4. 32-Blt-Wrlte FIFa Control Register 


I 
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AUTO (Bit 7 Read/Write)- 
The AUTO bit when set 


to a one will cause the FIFO Address Counter to 
increment after each DVI Bus cycle. 


TEST (Bit 6 Read/Write)- 
The TEST bit is used for 
diagnostics and for ensuring a clean initialization of 
the FIFO. When initializing the FIFO, the first write to 
the control register should have this bit set to a one. 


FULL (Bit 5 Read)- The FULL bit reflects the status 
of the data holding registers in the FIFO. If the FULL 
bit is on and data is written to the data ports then the 
access becomes a Slow Access. 


FAST (Bit 4 Read)-The 
FAST bit allows the FIFO 


to perform Next-Fast VRAM cycles when the FIFO is 
full and the FIFO Address Counter is pointing to an 
even long-word. The Next-Fast VRAM cycle allows 
the FIFO to deposit two 32-bit words in one DVI Bus 
cycle, saving the overhead of arbitration and transfer 
of control for the second 32-bit word. 


EMPTY (Bit 3 Read)- The EMPTY bit reflects the 
status of the data holding registers in the FIFO. A 
write FIFO should always be empty before any 
change is made to its FIFO Control Register that 
may cause a pending data operation to fail. Such 
operations include a change from AUTO to not 
AUTO (or the reverse), any change to the FIFO Ad- 
dress Counter and setting the TEST or TCLK bits. 
The 32-Bit-Write FIFO will try to empty the data reg- 
isters whenever the most significant byte of a 32-bit 
word is received from the host. If a transfer from the 
host ends on the first, second or third byte then the 
FIFO will not go empty until the host performs a 
Flush Command. Sending a Flush Command to a 
FIFO that would have normally gone empty or was 
already empty does no harm whatsoever. 


TCLK (Bit 5 Write)- The TCLK bit should be left as 
a zero for proper FIFO operation. 


RS2-RSO (Bits 2-0 Read/Write)-The 
RS2-0 bits 


are used as a field to select other registers in the 
FIFO. Table 2-3 indicates which registers are ac- 
cessed by each combination of RS2-0. 


Table 2·3. RS2-0 Register Selection 


RS2 
RS1 
RSO 
Register 


0 
0 
0 
Address Counter Byte 0 
(Least Significant) 


0 
0 
1 
Address Counter Byte 1 


0 
1 
0 
Address Counter Byte 2 
(Most Significant) 


0 
1 
1 
Test Register 


1 
0 
0 
Reserved 


1 
0 
1 
Reserved 


1 
1 
0 
Reserved 


1 
1 
1 
Reserved 


The 32·Blt·Wrlte 
FIFO Address 
Counter 
Byte 


Register 
is 
a 
write-only 
register 
located 
at 


BASE+23 hex. This register holds each byte that is 
written to the FIFO's Address Counter before an ac- 
cess is initiated. Together the three FIFO Address 
Counter Bytes hold the 24-bit byte address of the 
destination of the FIFO data. The FIFO Control Reg- 
ister and the FIFO Address Counter Byte Register 
can be written with the same 16-bit I/O operation. 


~~~~~~~~~=-~Bl~I~B~OJI 
••• 
+ro 
I 


Figure 2·5. 32·Blt-Wrlte FIFO Address Counter 


Byte Register (RS= 0,1,2) 
32·Blt-Wrlte FIFO Test Register (RS = 3) 


The 32·Blt·Wrlte 
FIFO Test Register 
is an a-bit 


read/write register located at BASE+ 23 hex that is 
accessed when bits RS[2:0] in the corresponding 
control register = 3(010). This register is used main- 
ly for Flush Commands and diagnostics, however, it 
can be useful for gathering the information needed 
when saving the state of the FIFO for later restora- 
tion. When the FIFO Test Register is written, a Flush 
Command is performed. (The data written is not im- 
portant.) When the FIFO Test Register is read, it re- 


I B151 B141 B131 B121 Bll I Bl0 I B9 I B8 I B7 I B61 
B51 
B4 I B3 I B2 I Bl·1 
Bol 


BASE + 21 
BASE + 20 


Figure 2·6. 32·Blt-Wrlte FIFO Data Registers 
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turns some information. BO will be zero when the 
first byte has been written to the FIFO Data Regis- 
ter, B1 will be zero when the second byte has been 
written and so on. 


The 32-Blt-Wrlte 
FIFO Data High and Low Regis- 
ters 
are two 8-bit write-only registers located at 
BASE+ 20 hex and BASE+ 21 hex. The FIFO data 
registers can be written 8 or 16 bits at a time. How- 
ever, when writing with byte operations, address 
BASE+ 21 hex must be written between any writes 
to BASE+ 20 hex. 


2.7.4.2 
16-Blt-Wrlte 
FIFO Registers 


The 16-Bit-Write FIFO registers are identical to that 
of the 32-Bit-Write FIFO described above except 
that the FAST bit in the 16-Bit-Write FIFO Control 
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Register has no meaning. Locations for the 16-Bit- 
Write FIFO Registers are listed below. 


16-Bit-Write 
FIFO Control 
Register 
= BASE+26 
hex 


16-Bit-Write 
FIFO Address 
Counter 
Byte Register 
= 
BASE + 27 hex 


16-Bit-Write 
FIFO Test Register 
= BASE+27 
hex 


16-Bit-Write 
FIFO Data High Register 
= BASE + 25 hex 


16-Bit-Write 
FIFO Data Low Register 
= BASE + 24 hex 


2.7.4.3 
32-Blt-Read 
FIFO Registers 


The 32-Blt-Read 
FIFO Control 
Register 
is an a-bit 
read/write register located at BASE+2A hex. Each 
of the bits in this register is described below. 


AUTO 
TEST I 
TCLK I 
FAST I 
x 


AUTO 
TEST 
1~L...._R_S_2--,-_R_S1-...J~ 
BASE+2A READ 


RS1 
~ 
BASE+2A WRITE 
RS2 


Figure 2-7. 32-Blt-Read 
FIFO Control 
Register 


I 


11 
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~;~-~g~ificant 
byi~-Of~'32~bit-;;~~d"i~-~~~d-'fr~;;; 
the host. If a transfer from the host ends on the first, 
second or third byte then the FIFO will contain some 
residual data. When in the AUTO mode the FIFO will 
always contain some residual data after the transfer 
is finished. This unwanted data is a 'by-product of 
reading ahead and trying to keep the FIFO full. 


FAST (Bit 4 Read)- The FAST bit allows the FIFO 
to perform Next-Fast VRAM cycles when the FIFO is 
empty and the FIFO Address Counter is pointing to 
an even long-word. The Next-Fast VRAM cycle al- 
lows the FIFO to fetch two 32-bit words in one DVI 
Bus cycle, saving the overhead of arbitration and 
transfer of control for the second 32-bit word. 


EMPTY (Bit 3 Read)- The EMPTY bit reflects the 
status of the data holding registers in the FIFO. If the 
EMPTY bit is on and data is read from the data ports 
then the access becomes a Slow Access. 


TCLK (Bit 5 Write)- The TCLK bit should be left as 
a zero for proper FIFO operation. 


RS2-RSO (Bits 2-0 
Read/Write)-The 
RS2-0 bits 
are used as a field to select other registers in the 
FIFO. 


• 
., 
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The 32-Blt-Read 
FIFO Address 
Counter 
Byte 
Register 
is 
a 
write-only 
register 
located 
at 
BASE+ 2B hex. This register holds each byte that is 
written to the FIFO's Address Counter before an ac- 
cess is initiated. Together the three FIFO Address 
Counter Bytes hold the 24-bit byte address of the 
source of the FIFO data. The FIFO Control Register 
and the FIFO Address Counter Byte Register can be 
written with the same 16-bit I/O operation. 


The 32-Blt-Read FIFO Test Register 
is an 8-bit 
read/write register located at BASE+ 2B hex that is 
accessed when RS[2:0) in the corresponding con- 
trol register =3 (011). This register is used for diag- 
nostic purposes only. 


~ 
B6 I B5 I B4 I B3 I B2 I B1 ~ 
BASE+2B 


Figure 2-8. 32-Blt-Read FIFO Address Counter 
Byte Register (RS=0,1,2) 
32-Blt-Read FIFO Test Register (RS3) 


The 32-Blt-Read FIFO Data High and Low Regis- 
ters are two 8-bit read-only data registers located at 
BASE+ 28 hex and BASE+29 hex. The FIFO data 
registers can be written 8 or 16 bits at a time. How- 
ever, when writing with byte operations, address 
BASE+ 29 hex must be written between any writes 
to BASE+ 28 hex. 


I B151 B141 B131 B121 B111 B10 I B91 
B81 
B71 
B61 
B5 I B4 ~ 


BASE+ 29 
BASE+ 28 


Figure 2-9. 32-Blt-Read FIFO Data Registers 
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2.7.4.4 
16-Blt-Read 
FIFO Registers 


The Registers for the 16-Bit-Read FIFO are identical 
to those of the 32-Bit-Read FIFO described above 
except that the FAST bit in the 16·Bit-Read FIFO 
Control Register has no meaning. Locations for the 
16-Bit-Read FIFO Registers are listed below. 


16-Bit-Read 
FIFO Control 
Register 
= BASE + 2E hex 


16-Bit-Read 
FIFO Address 
Counter 
Byte Register 
= 
BASE+2F 
hex 


16-Bit-Read 
FIFO Test Register 
= BASE+2F 
hex 


16-Bit-Read 
FIFO Data High Register 
= BASE+2D 
hex 


16-Bit-Read 
FIFO Data Low Register 
= BASE + 2C hex 


2.7.5 
GENERAL 
STATUS 
REGISTER 


Figure 2-10. General 
Status Register 


The General Status Register is an 8-bit read-only 
register located at BASE+ 31 hex. This register con- 
tains three dedicated bits that are used to identify 
interrupt sources and five general purpose input bits 
(GENI0-4) which correspond directly to the input 
pins of the same names. In the bit definitions below, 
the bit names and descriptions shown in brackets 
([ l) are examples of some useful functions that 
these pins can serve in a typical DVI environment. 
These bits may, of course, be assigned to any com- 
patible input at the user's discretion. 


2.7.6 
GENERAL 
CONTROL 
REGISTER 
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GENI3 (Bit 7)-General 
Purpose Input 3. 


[VFB (Bit 7)-VFB 
will be a zero if the DVI Board is 
connected to a Video Feature Bus.] 


GENI2 (Bit 6)-General 
Purpose Input 2. 


[CS2 (Bit 6)-CS2 
will be a zero if the DVI Board is 
hosting a Capture Board.] 


GENI1 (Bit 5)-General 
Purpose Input 1. 


GENIO (Bit 4)-General 
Purpose Input O. 


SINT (Bit 3)-8INT 
(SCSI Interrupt) is a read-only bit 
that reflects the status of the interrupt signal from 
the SCSI/Capture Subsystem. The SINT bit will be a 
one whenever the interrupt signal is active. 


AINT (Bit 2)-AINT 
(Audio Interrupt) is a read-only 
bit that reflects the status of the interrupt signal from 
the Audio Subsystem. The AINT bit will be a one 
whenever the interrupt signal is active. 


VINT (Bit 1)-VINT 
(Video Interrupt) is a read-only 
bit that reflects the status of the interrupt signal from 
the Video Subsystem (82750PB). The VINT bit will 
be a one whenever the interrupt signal is active. 


GENI4 (Bit O)-General Purpose Input 4. 


[DACT (Bit O)-DACT (DAC Test) bit is a read-only 
bit that reflects the status of the RGB signals on the 
Video Output Connector. The DACT bit will be a 
zero whenever one of the RGB levels is above the 
50% of full scale point.] 


BASE+33 


'--_...l....._--'--'-.c...:='-1..-=..::..:..c..:c..J...._--'l...-_...J. 
__ 
....L_---.J 
WRITE 


BASE + 33 READ 


Figure 2-11. General 
Control 
Register 


I 


11 
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The General Control Register is an 8-bit read/write 
register located at BASE+ 33 hex. Bit 0 changes 
definition depending on whether the register is being 
read or written. This register is reset to 0 by system 
reset. Four of the bits in this register are dedicated 
and three of the bits are General Purpose Output 
bits which correspond to the output pins of the same 
names. In the bit definitions shown below the bit 
names and descriptions show in brackets ([ l) are 
examples of some useful functions that these bits 
can serve in a typical DVI environment. They may, of 
course, be assigned to any compatible output at the 
user's discretion. 


DBEN (Bit 7)-DBEN 
(82750DB Enable) is used to 
reset the 82750DB device when it is a zero or to 
allow the 82750DB to run when it is a one. 


DINT (Bit 6)-DINT 
(Disable Interrupt) is used to re- 
assert interrupts at the end of an interrupt routine or 
to disable all of the interrupts from the DVI Board. 


GENOB (Bit 5)-General 
Purpose Output B. 


[PAL (Bit 5)-PAL 
(PAL/NTSC) selects the PAL 
Mode when a one, or NTSC Mode when a zero for 
the Y-C output circuit.] 


GENOA (Bit 4)-General 
Purpose Output A. 


[DDIG (Bit 4)-DDIG 
(Disable 82750DB Digital Out- 


puts) is used to disable the digital video data outputs 
on the 82750DB device. If the DVI Board uses the 
analog video data outputs from the 82750DB device, 
the DDIGbit should be set to a one. The power dissi- 
pation of the 82750DB will increase significantly if 
the DDIG bit is a zero, especially at frequencies 
above 30 MHz. The 82750DB digital outputs are 
wired directly to the Ditgital Display Connector when 
in ATMODE. The DDIG bit must be a zero to enable 
the use of this connector.] 


GENOC .(Bit2)-General 
Purpose Output C. 


PBRST (Bit 1)-PBRST 
(82750PB Reset) is used to 
reset the 82750PB device when a one, or to disable 
it when a zero. The PBRST bit is momentarily set to 
a one at power on time before being reset and held 
to a zero, initializing the 82750PB to its reset state. 
The contents of VRAM cannot be guaranteed after 
the PBRST bit has been toggled to a one. The 
PBRST bit, when a one, disables arbitration for the 
. 


2.7.8 
POST ADDRESS SWITCH REGISTER 


intel~ 


DVI Bus in the 82750PB device. The only operation 
that should be attempted when the PBRST bit is a 
one is the writing of a zero to the PBRST bit. 


BDRST (Bit 0 Write)-BDRST 
(Board Reset) is used 
to reset the entire DVI Board. This reset is so com- 
plete that this bit should only be used for debugging 
purposes. The contents of VRAM cannot be guaran- 
teed after the BDRST bit has been toggled to a one. 
Setting this bit to a one is the same as a power on 
reset except that the POS Registers, PAR Registers, 
82750PB and the General Control Register itself are 
not affected. Writing a 03 followed by a 00 to the 
General Control Register is as complete a reset at- 
tainable without powering down. The BDRST bit 
should be held as a one for at least 10 f.Ls to guaran- 
tee a full board reset. Note also that the BDRST bit 
cannot be read back. The BDRST bit, when a one, 
disables the DVI Bus. The only operation that should 
be attempted when the BDRST bit is a one is the 
writing of a zero to the BDRST bit. In normal opera- 
tion, the General Control Register should either be 
90 hex or BOhex. 


DRDY (Bit 0 Read)-DRDY 
(Capture Data Ready) is 
a read-only bit that reflects the status of the Data 
Ready bit from the capture board. 


NOTE: 
When trying to set or reset a bit in the General 
Control Register, do not forget to reset the DRDY 
bit before writing the data back out, or the system 
will get stuck in external wait state forcing you to 
use the main reset switch of the host! 


2.7.7 
I/O PORT SWITCH REGISTER 


b7 
bO 
I A9 I A8 I A7 I A6 I AS I A4 I A3 I A2 I BASE+3S 


Figure 2-12.1/0 Port Switch Register 


The I/O Port Switch Register is an 8-bit read-only 
register located at BASE+ 35 hex. The contents of 
this register determines the I/O 
BASE when in 
ATMODE. This register is normally read during the 
execution of the POST ROM. The value of bits 
A9-A2 are compared against the host address bits 
A9-A2 
by hardware. This register has no meaning 
when operating in non-ATMODE. 


~.................................... 
. 
~ 


ARP3 I 
ARP2 I 
ARP1 I 
ARPO I APEN I 
SW2 I 
SW1 I 
SWO I BASE + 37 
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Figure 2-13. POST Address Switch Control Register 
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RESET 
VALUE 
b7······················· 
. bO 


80 


00 


01 


30 


EF 


DC 


POS5 - BASE + 3A 
. 


POS4 - B)l.SED + 38 


POS3 - BASED + 36 


POS2 - BASE + 34 


POS1-BASE+32 


POSO - BASE + 30 


1 
EMS10 
EMS9 
EMS8 
EMS7 
EMS6 
EMS5 
EMS4 


EMS3 
EMS2 
EMSl 
EMSO 
VISl 
VISO 
AlSO 
CISO 


1/07 
1/06 
1/05 
1/04 
1/03 
1/02 
EW3 
PEN 


RP3 
RP2 
RPl 
RPO 
EW2 
EWl 
EWO 
CDEN 


MOST 
SIGNIFICANT 
ID BYTE 


LEAST 
SIGNIFICANT 
ID BYTE 


Figure 2·14. HIGA Configuration 
Registers 


The POST Address Switch Register is an 8-bit read- 
only register located at BASE+ 37 hex. ARP3-ARPO 
reflect the status of the POST Address Switches 
when in ATMODE. The APEN bit reflects the status 
of the POST ROM Enable Switch when in ATMODE. 
SW2-SWO are definable switches also when in 
ATMODE. This register has no meaning when in 
non-ATMODE. 


2.7.9 
CONFIGURATION 
REGISTERS 


Figure 2-14 shows the HIGA Configuration Registers 
for both the PS/2 and PC/AT implementations. The 
Configuration Registers are used to set the I/O port 
addresses (in non-ATMODE), the POST ROM ad- 
dress, the interrupt levels for the Audio Subsystem 
and the 82750PB and the EMS Window configura- 
tion. Additionaily, the configuration registers contain 
one control bit (CDEN) used to enable and disable 
the entire board. 


When in non-ATMODE the configuration registers 
are accessed as outlined in the respective PS/2 
Technical Reference Manual. The registers can also 
be accessed via the 
I/O 
port path. The non- 
ATMODE mode prevents the writing of these regis- 
ters through the I/O port path but not the reading of 
the registers. 


When operating in ATMODE the configuration regis- 
ters are accessed only through the I/O port address- 
es. The I/O port is contained in the I/O Port Switch 
Register. The configuration register POS3 has no 
meaning when in ATMODE, except for the EW3 and 
PEN bits. In addition, the RPO-3 bits in the configu- 
ration register POS2 have no meaning when in AT- 
MODE. Instead, these 
parameters 
are 
set 
via 
switches on the board. 


NOTE: 
The POS registers on PS/2-based implementations 
should not be written by any of the software drivers 
or application code. The writing of POS registers is 
reserved exclusively for the IBM power-on configu- 
ration program. A possible exception to this would 
be the POST program that may disable the POST 
ROM after execution in order to create space for 
the EMS Window. 


2.7.9.1 
I/O 
Port BASE Address 
Determination 


The I/O port BASEs in non-ATMODE differ slightly 
from the I/O port BASEs in ATMODE. Figure 2-15 
illustrates the differences between the two systems. 
On the PS/2 (MicroChannel Bus), the host I/O ad- 
dress is mapped directly into the I/O BASE. Bits A10 
through A15 are compared against POS3 register 
bits to select the I/O address range for the entire 
DVI Board. Bits A6 through A9 must be zeroes, while 
bits AO through A5 select specific functions within 
the BASE. 


A15 
A14 
A13 
A12 
All 
Al0 
A9 
A8 
A7 
A6 
A5 
A4 
A3 
A2 
Al 
AO 
0 
0 
0 
0 


A9 
A8 
A7 
A6 
A5 
A4 
A3 
A2 
A15 
A14 
A13 
A12 
All 
Al0 
Al 
AO 
0 
0 


non-ATMODE 


ATMODE 


Figure 2·15. DVI Board I/O Port Mapping 
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S~T would be 36 hex for the POS3 register. 


2.7.9.2 
P~SO 


POSOis an eight bit readlwrite register located at 
BASE+ 30 hex. When read, the eight bits in this reg- 
ister return the least significant byte of the DVI 
Board Identification during the PS/2 
power on se- 
quence. The value returned by this register is hard- 
wired in the HIGA. This register can be read in 
ATMODE through the appropriate 1/0 port. 


POSOis also used to enable and disable the EMS 
Window. Writing 55 hex to this register will enable 
the EMS Window, and writing 54 hex to this register 
will disable the EMS Window. 


2.7.9.3 
POS1 


POS1 is an eight bit read-only register located at 
BASE+ 32 hex. The eight bits in this register return 
the most significant byte of the DVI Board Indentifi- 
cation during the PS/2 power on sequence. The val- 
ue returned by this register is hard-wired in the 
HIGA. This register can be read in ATMODE through 
the appropriate 1/0 port. 


2.7.9.4 
POS2 


POS2 is an eight bit readlwrite register located at 
BASE+ 34 hex. Each of the bits in this register is 
described below. POS2 is reset to 30 hex by system 
reset. 


CDEN 
(Bit O)-When 
the GDEN bit is a zero, the 


entire DVI Board is disabled. In non-ATMODE, the 
HIGA will not respond to 1/0 or memory addresses 
and will not drive the interrupt lines. In ATMODE the 
HIGA will not respond to memory addresses and will 
not drive the interrupt lines. 
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BASE+ 36 hex. POS3 is the 110 Port Address Con- 
figuration Register. In non-ATMODE, this register is 
used to set the 1/0 addresses for the BASE func- 
tions. Each of the bits in this register is described 
below. POS3 is reset to 01 hex by system reset. 


PEN (Bit 0)- The PEN bit is used to open and close 
the POST ROM in the host's address space. When 
set to a 0, the space that was being used by the 
POST ROM can be reused for EMS Window space 
for VRAM. 


EW3 (Bit 1)-The 
EW3 bit (along with EWO-2 in 
POS2) set the EMS Window mode. The EMS Win- 
dow size can be one of 13 sizes ranging from 
o Mbytes to 16 Mbytes in size'. 


1/02-7 (Bits 2-7)-The 
1/07-1/02 
field is used to 
compare against the host's address bus bits A15- 
A10 respectively during 1/0 read or write operations 
when in non-ATMODE. If POS3 was initialized to CO 
then the 1/0 port BASE would be set to GOOO.The 
1/07-1/02 
bits have no meaning when operating in 
ATMODE. 


2.7.9.6 
POS4 


POS4 is an eight bit readlwrite register located at 
BASE+ 38 hex. Each of the bits in this register is 
described below. POS4 is reset to zero by system 
reset. 


CISO (Bit O)-CISO is the Interrupt Level Select bit 
for the GDROM Subsystem interrupts. When GISO= 
0, then the interrupt will be presented on the INT[1] 
pin. When GISO= 1, then the interrupt will be pre- 
sented on the INT[2] pin. 


AlSO (Bit 1)-AISO 
is the Interrupt Level Select bit 
for the Audio Subsystem interrupts. When AlSO = 0, 
then the interrupt will be presented on the INT[1] 
pin. When AlSO = 1, then the interrupt will be pre- 
sented on the INT[2] pin. 
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VISO-1 
(Bits 2-3)-VISO 
and VIS1 are the Interrupt 
Level Select bits for the 82750PB interrupts. Table 
2-5 indicates where the interrupt will be presented 
for each combination of VISOand VIS1. 


Table 2-5. 82750PB 
Interrupt 
Level Selection 


VIS1 
VISO 
Pin 


0 
0 
IN[O] 
0 
1 
IN[1] 
1 
0 
IN[2] 
1 
1 
IN[3] 


EMSO-3 
(Bits. 4-7)-EMSO-3 
(along with EMS4- 
10 in POS5) are used by the board to set the ad- 
dress of the EMS Window. The memory space is 
divided into 2048 8 Kbyte segments by the HIGA. 
The desired starting address (divided by 8192) of the 
EMS window would be written in POS5 and POS4 by 
the initialization software. 


2.7.10 
VGA TEST 
REGISTERS 


82750LH 


2.7.9.7 POS5 


POS5 is an eight bit read/write register located at 
BASE+ 3A hex. Each of the bits in this register is 
described below. POS5 is reset to 80 hex by system 
reset. 


EMS4-10 
(Bits 
0-6)-EMS4-10 
(along 
with 
EMSO-3 in POS4) are used by the board to set the 
address of the EMS Window. The memory space is 
divided into 2048 8 Kbyte segments by the HIGA. 
The desired starting address (divided by 8192) of the 
EMS window would be written in POS5 and POS4 by 
the initialization software. 


07 (Bit 7)-07 
is used by the PS/2 error recovery 
system and is set by hardware to indicate "no error 
information available on this board". This bit should 
always be set to a one whenever POS5 is written. 


rV~G~A_T~e~st~R~e~gis~te~r 
__ 3~C_7__ I~W~~L)~V~G~A_Te~s~t_R~~is_te_r 
__306 
__ ~\~~EE~~~ 
BASE+3C 


~V_G_A_T_e_st_R_e~gis_w_r 
__ 3_C_9__ I~W~~~I~V_G_A_Te_s_t_R~~is_te_r 
__3OB 
__ ~\~~EE~'~ 
BASE+3E 


Figure 2-16. VGA Test Registers 


The VGA Test Registers are four 8-bit read/write 
registers located at BASE+ 3C hex, BASE+ 3D hex, 
BASE+ 3E hex and BASE+ 3F hex. These registers 
provide an alternate programming path for testing 
the OVI Board's VGA OAC. Port BASE+ 3C is the 
same register found at 03C7, Port BASE+ 3D is the 
same register found at 03C9, Port BASE+ 3E is the 
same 
register 
found 
at 
03C6 
and 
Port 


I 


BASE+ 3F is the same register found at 03C8. The 
VGA Test Registers can be written or read at any 
time but should only be accessed with byte I/O op- 
erations. The (C03C)[33E4] type numbers in Figure 
2-16 represent examples of I/O port addresses for 
(non-ATMOOE)and [ATMOOE] if the BASE was set 
to COOOor 02E4 respectively. 
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3.0 
HARDWARE 
INTERFACE 


3.1 MicroChannellnterface 
Operation 
and Timing 


MicroChannel 
host 
cycles 
consist 
of four types 
of 
cycles: 
Memory 
Read, Memory 
Write, 1/0 Read and 
1/0 Write. 
Each of the four cycle types is explained 
in Figure 
3·1. The HIGA meets 
all of the timing 
re- 
quirements 
for the MicroChannel 
Bus as published 
by IBM. Refer to any IBM PS/2 Technical 
Reference 
Manual 
for detailed 
explanations. 


3.1.1 
1/0 READ 


In the 1/0 Read cycle timing diagram pictured 
in Fig· 
ure 3·1, ClK 
is the main timing source for the HIGA. 


The signals sources 
from the MicroChannel 
Bus are 
assumed 
to 
be 
completely 
asynchronous 
to 
the 
HIGA, 
therefore, 
ClK 
is used 
to synchronize 
the 
events 
on the MicroChannel 
Bus with internal 
HIGA 
operations. 


The cycle is not considered 
valid until CMD# 
is ac- 
tive. At that time, .the internal 
synchronizer 
starts to 
sample the cycle and turn the CMD# 
into a synchro- 
nous internal 
request 
for an operation. 
Also at that 
time, 
the 
internal 
host 
interface 
sequencer 
moves 
into state T1. 


The internal 
operation 
starts at the beginning 
of the 
third 
ClK 
(T1) since 
CMD#, 
address, 
status, 
etc. 
were valid before the rising edge of that ClK 
period. 
During T2 and T3 the internal 
read operation 
takes 
place. At the end of T3 the internal 
read operation 
is 
completed, 
CHRDY is asserted 
and the internal 
host 
interface 
sequencer 
moves into state T4. The inter- 
nal host interface 
sequencer 
repeats 
state T4 until 
the host negates 
CMD#. 


The HIGA drives the data bus with valid data from 
the end of T3 (CHRDY) 
until CMD# 
is negated 
by 
the host (all T4's). 
If the operation 
required 
the use 
of the DVI Bus, additional 
T3's would have been in· 
serted 
until the operation 
was completed. 
Examples 
of 1/0 operations 
that require the use of the DVI Bus 
are reading 
an empty 
read 
FIFO and writing 
a full 
write FIFO. 
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3.1.2 
I/O 
WRITE 


The 1/0 Write cycle timing 
diagram 
pictured 
in Fig· 
ure 3·2 is basically the same as the 1/0 Read opera- 
tion except 
the data bus is driven 
by the host. The 
data should 
be valid a minimum 
of 25 ns before T3 
and should 
remain valid until CMD# 
is negated. 


3.1.3 Memory Read 


The Memory 
Read cycle 
in Figure 
3·3 requires 
the 
use of the DVI Bus before 
it can be completed 
and 


. therefore 
must be extended. 
The cycle is extended 
by inserting 
additional 
T3 states 
until 
MSTRB # 
is 
received, 
indicating 
valid data on the DVI Bus. 


The number of ClK 
periods from BUSEN # active to 
CHRDY 
inactive 
is fixed by the programming 
of the 
memory timing in the VSCGA device. The number of 
ClK 
periods from BREQ# 
active to BUSEN# 
active 
is 
variable 
and 
depends 
on 
the 
activity 
of 
the 
82750PB. 
This time is known as the DVI Bus latency 
and can be as long as one 82750PB 
Next·Fast 
cycle 
(NXTFSn 
plus one regular 82750PB 
memory 
cycle. 


This latency time is the main reason for the FIFOs in 
the HIGA. The FIFOs allow fast 1/0 
Read or Write 
cycles on the host side while independently 
execut- 
ing DVI Bus cycles on the DVI Bus side. The FIFOs 
can also stack 
host data into 32·bit 
and 64·bit 
DVI 
Bus cycles. 


The operation 
and timing 
of a memory 
read to the 
POST ROM (an eight bit device) is different 
than the 
one shown in Figure 3·3 in two ways. First, the HIGA 
will 
not 
return 
the 
CDDS16# 
signal 
during 
POST 
ROM 
accesses. 
Second, 
sp-ecial steering 
logic 
in· 
side the HIGA logically 
connects 
the MD[7:0] 
lines 
to the SD[7:0] 
lines regardless 
of the byte address. 
Therefore, 
consecutive 
byte accesses 
do not proq- 
ress from MD[7:0] 
to MD[15:8] 
to MD[23:16] 
when 
accessing 
the POST ROM. 


Note 
that 
the 
MREQ# 
(82750PB) 
and 
RAS# 
(VSCGA) signals are not connected 
to the HIGA but 
nonetheless 
were thought 
to be useful signals to ln- 
clude in the timing diagrams. 


3.1.4 
MEMORY 
WRITE 


The Memory 
Write cycle timing 
diagram 
pictured 
in 
Figure 
3·4 
is basically 
the 
same 
as the 
Memory 
Read operation 
except the SD(15:0) data bus is driv- 
en by the host. 
. 
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Figure 3-1. MlcroChannelllO 
Read Timing 
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Figure 3-2. MlcroChannelllO 
Write Timing 


I 


1·145 


CDDSI6#~----------------------------------------------------------- 
SD[15:0l------------------------------------~--------------------c:VZA~L~IDLnDA~T~A~>---- 


BREQ#------------------~,~ 
_II---------------- 
VRAM#----------------~,~ 
_J,_ 


BUSEN#--------------------------~, 
_"------------ 
V1VALEN#------------------------------~,~ 
~,r---------- 


MREQ#l----------------------------------~'____'r------------------------------- 
MSTRB#------------------------~--------------------,'____'r-------------------- 
HRDY#----------------------------------------------------\' 
-J,r------------ 
RAS#l----------------------------------------~, 
'r------------------- 


MD[31:ol----------------------------~~V~A~LRIDcA~D~DmR~ES~S[::r------~~~--------------- 
241344-6 


NOTE: 
1. Signal 
not physically 
connected 
to the HIGA. 


Figure 3-3. MlcroChannel 
Memory 
Read Timing 
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Jr- 
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V1VALEN.------~----------------------~---------------------------Jlr--------- 
~REQ.l--------------------------------~~r----------------------------- 
~STRB.--------------------------------------------~~r------------------- 
HROY.------------------------------------------------__\~ 
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RAS.l--------------------------------------~\ 
_I------------------ 


~D[3 1:0] ---------------------------{:::]VlALlJi[b 
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NOTE: 
1. Signal 
not physically 
connected 
to the HIGA. 


Figure 3·4. MicroChannel Memory Write Timing 
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3.2 
ISA Interface 
Operation 
and 
Timing 


PC/AT host cycles consist of four types of cycles: 
Memory Read, Memory Write, I/O Read and I/O 
Write. Each of the four cycle types is explained be- 
low. The HIGA meets all of the timing requirements 
for the ISA Bus as published by Intel. 


NOTE: 
The HIGA adheres strictly to the ISA specification 
by requiring BALE active before each cycle. Some 
implementations of the ISA Bus do not generate 
BALE before the second cycle of a 16-bit memory 
cycle converted to two 8-bit cycles. If the HIGA is 
to operate on these implementations properly, a 
7-bit latch is needed for the HIGA's A23-17 lines 
(latched by BALE) with the CMD# (BALE) pin tied 
high on the HIGA. 


3.2.1 I/O READ 


In the I/O Read cycle timing diagram pictured in Fig- 
ure 3-5, ClK is the main timing source for the HIGA. 
The signals sources from the ISA Bus are assumed 
to be completely asynchronous to the HIGA, there- 
fore, ClK is used to synchronize the events on the 
ISA Bus with internal HIGA operations. 


The cycle is not considered valid until MADE24 
(IODRC#) is active. At that time, the internal syn- 
chronizer starts to sample the cycle and turn the 
MADE24 (IODRC#) into a synchronous internal re- 
quest for an operation. Also at that time, the internal 
host interface sequencer moves into state T1. 


The internal operation starts at the beginning of the 
third ClK (T1) since MADE24 (IODRC#), address, 
status, etc. were valid before the rising edge of that 
ClK period. DuringT2 and T3 the internal read oper- 
ation takes place. At the end of T3 the internal read 
operation is completed, CHROY is asserted and the 
internal host interface sequencer moves into state 
T4. The internal host interface sequencer repeats 
state T4 until the host negates MADE24 (IODRC#). 


The HIGA drives the data bus with valid data from 
the end of T3 (CHRDY) until MADE24 (lODRC#) is 
negated by the host (all T4's). If the operation re- 
quired the use of the DVI Bus, additional T3's would 
have been inserted until the operation was complet- 
ed. Examples of I/O operations that require the use 
of the DVI Bus are reading an empty read FIFO- 
and wriiting a full write FIFO. 
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3.2.2 I/O WRITE 


The I/O Write cycl,etiming diagram pictured in Fig- 
ure 3-6 is basically the same as the I/O Read opera- 
tion except the data bus is driven by the host. The 
data should be valid a minimum of 25 ns before T3 
and should remain valid until M_IO 
(IOWRC#) is 
negated. 


3.2.3 
MEMORY 
READ 


The Memory Read cycle in Figure 3-7 requires the 
use of the DVI Bus before it can be completed and 
therefore must be extended. The cycle is extended 
by inserting additional T3 states until MSTRB# 
is 
received, indicating valid data on the DVI Bus. 


The number of ClK periods from BUSEN# active to 
CHRDY inactive is fixed by the programming of the 
memory timing in the VSCGA device. The number of 
ClK periods from BREQ# active to BUSEN# active 
is variable and depends on the activity of the 
82750PB. This time is known atI the DVI Bus latency 
and can be as long as one 82750PB Next-Fast cycle 
(NXTFST) plus one regular 82750PB memory cycle, 
This latency time is the main reason for the FIFOs in 
the HIGA. The FIFOs allow fast I/O Read or Write 
cycles on the host side while independently execut- 
ing OVI Bus cycles on the DVI Bus side. The FIFOs 
can also stack host data into 32-bit and 64-bit DVI 
Bus cycles. 


The operation and timing of a memory read to the 
POST ROM (an eight bit device) is different than the 
one shown in Figure 3-7 in two ways. First, the HIGA 
will not return the MEMCS16# signal during POST 
ROM accesses. Second, special steering logic in- 
side the HIGA logically connects the MD[7:0] lines 
to the SO[7:0] lines regardless of the byte address. 
Therefor, consecutive byte accesses do not prog- 
ress from MD[7:0] to MD[15:8] to MD[23:16] when 
accessing the POST ROM. 


Note that 
the 
MREQ# 
(82750PB) and 
RAS# 
(VSCGA) signals are not connected to the HIGA but 
nonetheless were thought to be useful signals to in- 
clude in the timing diagrams. 


3.2.4 
MEMORY 
WRITE 


The Memory Write cycle timing diagram pictured in 
Figure 3-8 is basically the same as the Memory 
Read operation except the SD[15:0] data bus is 
driven by the host. 
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Figure 3-5. ISA 1/0 Read Timing 


TO 
I 
TO I 
T1 I 
T2 I 
T3 I 


T4 I 
T4 I 
T4 I 
TO I 


ClK 


J 
>C 
A(23:0) 


COSETUP. 
~ 
~ 
(AEN) 


MIO 


(IOWRC) 
\- 
I 


CHROY 


(IOCHRO\,) 
, 
I 
'----- 


COOSl61 
~ 
r 
(IOCSl61) 


$0(15:0) 
( 
VAlIODATA 
>-- 


241344-9 


Figure 3-6. ISA 110Write Timing 
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NOTE: 
1. Signal 
not physically 
connected 
to the HIGA. 


Figure 3-7. ISA Memory 
Read Timing 
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NOTE: 
1. Signal 
not physically 
connected 
to the HIGA. 


Figure 3-8. ISA Memory Write Timing 
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3.3 
VGA Support Operation 
and 
Timing 


3.3.2 
VGA WRITE 


In the VGA Write cycle, the HIGA does not respond 
to the host 1/0 write cycle in any way other than 
passing the cycle through to the VGAWR# pin. (No- 
tice that the HIGA stays in state TO.) This passive 
method of eavesdropping on the 1/0 writes to the 
system's VGA DAC allows the VGA DAC controlled 
by the HIGA to be an exact copy of the system's 
VGA DAC without interfering with the timing of the 
host bus. Note that the address is the address of 
one of the system VGA DAC registers (03C6). 


3.3.1 
VGA READ 


The VGA Read cycle timing diagram pictured in Fig- 
ure 3-9 is the same as an 1/0 Read operation to an 
internal register except for two major differences. 
The 5D[15:0] data bus is driven by the VGA DAC 
(not by the HIGA) and the read operation is present 
on the VGARD# signal. Note that the address is the 
address of one of the internal VGA Test Registers. 
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Figure 3·9. MlcroChannel 
VGA Read Timing 
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Figure 3-10. MlcroChannel 
VGA Write Timing 
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Figure 3-11.ISA 
VGA Read Timing 
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Figure 3-12. ISA VGA Write Timing 
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uy 
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~Vllt~ 
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VML.t:I't 
!:IUtJ~ 
low followed by GAVAlEN one clock period later. 
The time that BUSEN# is active and GAVAlEN is 
high is used by the DVI Device to drive the address 
of the transaction onto the MD[31:0] lines and drive 
the VWE# and BENn# signals to their valid states. 
The first rising edge of ClK after GAVAlEN goes 
low is used to latch the MD[31:0] lines onto the 
VA[21:0] bus. 


The 82750PB, seeing V1VAlEN and VRAM# low, 
asserts the MREQ# signal which starts the actual 
memory cycle (RAS#). The MSTRB# signal is as- 
serted at the end of the memory cycle and the DVI 
Device accepts the data (for a read) or removes the 
data (for a write). At the same time MSTRB# is as- 
serted to the DVI Devices, MRDY# (not shown) is 
presented to the 82750PB. The MRDY# causes 
HRDY# from the 82750PB which in turn causes the 
HIGA to drop the BREQ# signal. BREQ# being off 
signals the end of the DVI Bus VRAM cycle to the 
82750PB which then takes back the DVI Bus by re- 
moving BUSEN#. 


3.4.2 
NEXT-FAST 
VRAM 
ACCESS 
CYCLE 


Figure 3-14 shows the timing for a Next-Fast DVI 
BUS VRAM cycle. The Next-Fast VRAM cycle trans- 
fers two 32-bit words to/from VRAM in one DVI Bus 
cycle. The NextFast cycle is distinguished from a 
normal VRAM cycle by MD[31] when the address of 
the transaction is presented on the MD[31:0] lines. 
If MD[31] is a one when GAVAlEN goes low then 
the cycle will be a Next-Fast cycle. The FIFOs in the 
HIGA use the Next-Fast cycle to transfer 64 bits of 
information when possible. 


The Next-Fast cycle is very similar to the single 
transfer VRAM cycle. When the Next-Fast cycle is 
executed the VRAM Controller (VSCGA) performs a 
two CAS page mode memory cycle instead of the 
single GAS memory cycle. The VRAM Controller 
also ORs on the VA[2] address line to increment the 
Column Address during the second CAS cycle. For 
this reason, all Next-Fast cycles must start on even 
32-bit boundaries. 
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er words, Next-Fast cycles can be requested but 
they cannot be demanded or assumed. 


3.4.3 
DVI DEVICE 
ACCESS 
CYCLE 


Figure 3-15 shows the timing for a typical DVI Bus 
DVI Device cycle. Since BUSEN# is active for this 
cycle we know the cycle is not being executed by 
the 82750PB, but is being executed by another DVI 
Device. A 
DVI Device cycle 
executed 
by the 
82750PB would have the same timing for DSTRB#, 
MREQ#, etc. but would not have any activity on the 
BUSEN#, V1VAlEN 
and GAVAlEN 
signals. The 
82750PB also puts the address of the transaction 
directly on the VA[21:0] bus. 


The DVI Device cycle RAS# is not generated, keep- 
ing VRAM off of the DVI Bus. Instead of RAS#, the 
VRAM Controller (VSCGA) generates DSTRB#. A 
DVI Device is selected by the DSTRB# signal along 
with the IDOthrough ID2 Device ID lines. The IDO, 
ID1, and ID2 Device ID lines are usually tied to the 
VA[171, VA[18] and VA[19] signals, giving each DVI 
Device 128 Kbytes of address space. 


The selected DVI Device will latch VWE# and the 
BEN[3:0] # 
signals 
on 
the 
leading 
edge 
of 
DSTRB#. If VWE# was latched high then the cycle 
is read and the selected DVI Device will drive data 
onto the MD[31:0] lines until DSTRB# is negated. If 
VWE# was latched low then the cycle is a write and 
the selected DVI Device will accept data from the 
MD[31:0] lines on the rising edge of ClK 
during 
MSTRB#. The length of DSTRB# (programmable 
in VSCGA) should be long enough to allow for the 
slowest DVI Device access time. 


3.4.4 
82750PB 
REGISTER 
ACCESS 
CYCLE 


Figure 3-16 shows the timing for a typical DVI Bus 
82750PB Register cycle. The 82750PB Register cy- 
cle is used to access internal registers in the 
82750PB. The major differences in this type of cycle 
is the absence of MREQ#, MSTRB#, DSTRB# and 
I 


82750LH 


RAS#. The DVI Bus data bus is driven by the HIGA 
for write cycles and by the 82750PB for read cycles. 
The HRDY# signal still initiates the end of the cycle 
and is asserted when the internal register access 


has been made. 82750PB Register cycles can only 
be performed by the HIGA since an 82750PB Regis- 
ter cycle must be started with VRAM# 
off and 


VREG#on. 
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NOTE: 
1. Signal 
not physically 
connected 
to the HIGA. 


Figure 3-13. DVI Bus DVI Device VRAM Access Timing 
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NOTE: 
1. Signal 
not physically 
connected 
to the HIGA. 
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Figure 3-14. DVI Bus Net-Fast 
VRAM Access Timing 
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NOTE: 
1. Signal 
not physically 
connected 
to the HIGA. 
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Figure 3-15. DVI Bus DVI Device Access Timing 
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NOTE: 
1. Signal 
not physically 
connected 
to the HIGA. 


1-158 


Figure 3-16. DVI Bus 82750PB 
Register 
Access Timing 
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wnlle 
tne FIFO is trying 
to gain access 
to the DVI 
Bus the host can be writing 
additional 
bytes to the 
FIFO. 
If the FIFO becomes 
full before 
the first ac- 
cess is granted 
and the VRAM 
address 
is an even 
long-word 
address, 
then the access 
will turn into a 
Next-Fast 
VRAM 
cycle. The Next-Fast 
VRAM 
cycle 
will allow 
the FIFO to write 
both long-words 
in the 
same DVI Bus cycle. This action forces 
the FIFO to 
become 
more efficient 
as the DVI Bus gets busier. 


The operation 
of the 32-Bit-Write 
FIFO can be modi- 
fied and controlled 
through 
the programmin'g 
of the 
32-Bit-Write 
FIFO Control 
Register. 
To initialize 
the 
FIFO, 
the first 
write 
to the 
FIFO's 
control 
register 
should 
have the TEST 
bit set to a one. All subse- 
quent writes to the control 
register 
should 
have the 
TEST bit reset to a 0, The FIFO's address 
counter 
is 
written 
or 
read 
with 
three 
I/O 
operations 
to 
the 
FIFO's 
Control 
Register 
and three I/O operations 
to 
the 
FIFO's 
Address 
Counter 
Byte 
Register. 
The 
FIFO Address 
Counter 
Bytes must be written 
in the 
order 
of low followed 
by middle 
followed 
by high. 


The 
low 
byte 
of the 
FIFO 
address 
counter 
is at 
RS =0, with the middle 
byte at RS = 1 and the high 
byte at RS =2. The FIFO Control 
Register 
and the 
FIFO Address 
Counter 
Byte Register 
can be written 
with the same 16-bit I/O operation. 
The three 
FIFO 
Address 
Counter 
Bytes hold the 24-bit byte address 
of the destination 
of the FIFO data. The transfer 
of 
data can start and end on any byte boundary. 
The 
FIFO data high and low registers 
hold the data and 
can be written 8 bits or 16 bits at a time. However, 
when writing with byte operations, 
data high register 
must be written 
between 
any writes to the data low 
register. 


The 
16-Bit-Write 
FIFO 
is a low performance 
path 
from the host to all of the devices 
on the DVI Bus, 


including 
VRAM. The 16-Bit-Write 
FIFO has just one 
16-bit 
buffer 
which 
is used 
to hold 
data while 
the 
FIFO 
waits 
for access 
to the 
DVI Bus.The 
16-Bit- 
Write 
FIFO will dump 
its data registers 
into VRAM 
whenever 
the most significant 
byte of the data regis- 


ter is written. 
The operation 
of the 16-Bit-Write 
FIFO 
is very similar 
to that 
of the 32-Bit-Write 
FIFO de- 
I 


tllIllJlY 
me uooateo 
oytes 
In the 
FIFO 
into VRAM, 


The programmer 
should wait until the FIFO is empty 
before accessing 
the FIFO after executing 
the Flush 
Command. 


4.1.2 USING THE READ FIFOs 


The 32-Bit-Read 
FIFO is a high performance 
path 
from VRAM to the host. The FIFO contains 
two long- 
word buffers designed 
to maximize the throughput 
of 
the host's 
bus and the DVI Bus. The FIFO will try to 
read the first two long-words 
of data as soon as the 
FIFO Address 
Counter is written. Assuming 
the FIFO 
is empty 
and the FIFO Address 
Counter 
is pointing 
to an even long-word 
address, 
the access 
will be a 
Next-Fast 
VRAM 
cycle. The Next-Fast 
VRAM 
cycle 
will allow 
the 
FIFO to read both 
long-words 
in the 
same DVI Bus cycle. 
(If a Next-Fast 
VRAM 
cycle 
is 
not possible, 
then the FIFO would 
require 
two 
DVI 
Bus cycles 
to fill the data buffers.) 
The host could 
now read all eight bytes from the FIFO without 
hav- 
ing additional 
wait states 
inserted. 


While 
the host 
is reading 
bytes 
or words 
from 
the 
FIFO with I/O reads, the FIFO is busy trying to read 
long-words 
from VRAM in an effort to keep the FIFO 
full. The operation 
of the 32-Bit-Read 
FIFO can be 
modified 
and controlled 
through 
the programming 
of 
the 32-Bit-Read 
FIFO Control 
Register. 
To initialize 
the FIFO, the first write to the FIFO's control 
register 
should 
have the TEST 
bit set to a one. All subse- 
quent writes 
to the conrol 
register 
should 
have the 
TEST bit reset to a 0. The FIFO's address 
counter 
is 
written 
or 
read 
with 
three 
I/O 
operations 
to 
the 
FIFO's 
Control 
Register 
and three I/O operations 
to 
the 
FIFO's 
Address 
Counter 
Byte 
Register. 
The 
FIFO Address 
Counter 
Bytes must be written 
in the 
order 
of low followed 
by middle 
followed 
by high. 
The 
low 
byte 
of the 
FIFO 
address 
counter 
is at 
RS = 0, with the middle 
byte at RS = 1 and the high 
byte at RS = 2. The FIFO will fetch data immediately 
after the high Address 
Counter 
Byte is written. 
The 
FIFO Control 
Register 
and the FIFO Address 
Coun- 
ter Byte Register 
can be written with the same 16-bit 
I/O 
operation. 
The 
three 
FIFO 
Address 
Counter 
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Bytes hold the 24-bit byte address of the destination 
of the FIFO data. The FIFO data registers hold the 
read data. 


The 16-Bit-Read FIFO is a low-performance path 
from all of the devices on the DVI Bus, including 
VRAM to the host. The 16-Bit-Read FIFO has just 
one 16-bit buffer which is used to hold data while the 
FIFO waits for the host to read the data. Operation 
of the 16-Bit-Read FIFO is very similar to that of the 
32-Bit-Read FIFO described above except that the 
data is now processed in 16-bit pieces. 


Both read FIFOs can be put into an Auto-increment 
Mode by setting the AUTO bit in the corresponding 
FIFO Control Register. When in Auto-increment 
Mode the VRAM address pointer is incremented af- 
ter each access through the FIFO. The Read FIFOs 
will not prefetch data unless they are in the Auto-in- 
crement Mode. When not in the Auto-increment 
Mode, the FIFO's VRAM pointer will remain un- 
changed following the access. Accessing the Read 
FIFOs when empty will cause inserted wait states 
until the FIFO has data. 
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4.2 Programming 
the VRAM Modes 


The EMS10-0 bits in the POS4 and POS5 registers 
set the location of the EMS Window while the 
EW3-0 bits in POS2 and POS3 set the EMS Win- 
dow size. After power-on or Board Reset the EMS 
Window is disabled. To enable the EMS Window, 55 
hex must be written to POS2 (BASE+ 30). The EMS 
Window can be disabled either by writing 54 hex to 
p~so,or by selecting EMS Mode F, or by executing 
a Board Reset. The remainder of this section ex- 
plains the operating modes of the EMS logic and in 
some cases gives examples of POS register set- 
tings. 


Figure 4-1 shows the OK byte EMS Window Mode or 
Mode F. Mode F is selected by writing an F value to 
the four EW3-0 bits in POS2 and POS3. The loca- 
tion of the EMS Window will not affect the operation 
of the board in the mode since the host address bus 
is logically disconnected from the VRAM address 
bus. It is impossible for the host to access VRAM 
through its memory address space if the board is in 
this mode. 


H~~-r--r--r--r--r--r--r--r--r--r--r--r--r--r--r--r--r--r--r--r--r--r--r~ 
'~'~~~~-L,-L,-L,-L,-L,-L,-L,-L,-L,-L,-L,-L,-L,-~-L,-~-L,-~-L,-~-L~ 
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Figure 4-1. OK Byte EMS Window Mode-Mode 
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Figure 4-2. 8K Byte EMS Window Mode-Mode 
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Figure 4-2 shows the 8 Kbyte EMS Window Mode or 
Mode O.Mode 0 is selected by writing a 0 value to 
the four EW3-0 bits (as a four bit field) in POS2 and 
POS3. The location of the EMS Window is deter- 
mined by the EMS10-0 
bits in POS4 and POS5. 
Since the EMS10-0 bits select on which 8K bound- 
ary the EMS Window begins and an 8K Window size 
must start on an 8K boundary, any value in the 
EMS10-0 
bits would be logically correct. If the 
EMS10-0 bits are written with a 68 hex value (as an 
11-bit field), then the EMS Window would start at 
host location 00000, because 68 hex x 2000 hex 
(Le. 8K) is 00000. 


In EMS Window Mode 0 the EMS Window is divided 
into four 2K byte pages controlled by four different 
Page Address Registers or PARs. The decode of 
host address bits A11 and A12 determines the PAR 
used during the access. The PARs are actually 
16-bit registers although only bits P3 through P15 
are used in the 8K Mode. The VRAM address gener- 
ated by the host access is determined by the con- 
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tents of the selected PAR as well as the state of the 
host's address bus, as is shown in the figure above. 
The Match Field in the same figure defines the bits 
of the host's address bus that must match the 
EMS10-0 bits. 


The Page Address Registers (PARs) are used in 
combination with the host's address bus to generate 
the VRAM address bus. In each EMS Mode only the 
necessary top bits of the PARs are used. The re- 
maining bits in the PARs can be written with any 
value without affecting the operation of the board. 


As an example, in order to access the registers in 
DVI Device 5 which begin at FAOOOO,a PAR is writ- 
ten with FAOO.The particular PAR used is depen- 
dent on the desired host address because the PARs 
are always selected by bits on the host's address 
bus. The FAOOvalue will work in any EMS Mode 
since in each EMS Mode only the necessary top bits 
of the PARs are used. The PARs can be written and 
read with either byte 1/0 or word 1/0 operations. 


PARn 


Hest 
~ ~~~~~~~~~~~~~~oLoL~~~~~.r.r~~ 


241344-21 


Figure 4-3. 16 KByte EMS Window Mode-Mode 
1 
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Figure 4-4. 2 MByte EMS Window Mode-Mode 
8 


Table 4-1. EMS Window Modes 


Mode 
Window 
EW3:0 
EMS10:0 
Page 
Size 
Size 


F 
OK 
1111 (F) 
DON'T CARE 
0 


0 
8K 
0000 (0) 
NNNNNNNNNN 
2K 


1 
16K 
0001 (1) 
NNNNNNNNNO 
4K 


2 
32K 
0010 (2) 
NNNNNNNNNOO 
8K 


3 
64K 
0011 (3) 
NNNNNNNNOOO 
16K 


4 
128K 
0100 (4) 
NNNNNNNOOOO 
32K 


5 
256K 
0101 (5) 
NNNNNNOOOOO 
64K 


6 
512K 
0110 (6) 
NNNNNOOOOOO 
128K 


7 
1M 
0111 (7) 
NNNNOOOOOOO 
256K 


8 
2M 
1000 (8) 
NNNOOOOOOOO 
512K 


9 
4M 
1001 (9) 
NNOOOOOOOOO 
1M 


A 
8M 
1010 (A) 
NOOOOOOOOOO 
2M 


B 
16M 
1011 (B) 
00000000000 
4M 
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4.3 
Using the Power-On-Self- 
Test 
ROM 


The 
Power-On-Self-Test 
ROM 
or POST 
ROM 
is a 
single 64 Kbyte ROM that is page mapped through a 
fixed 
size 
EMS 
Window 
in the 
memory 
address 
space 
between 
OCOOOOhex and ODFFFF 
hex. On 
8 Kbyte boundaries, 
any single 
ROM can appear 
in 
anyone 
of the sixteen 
8 Kbyte pages from OCOOOO 
hex to ODEOOO hex. 
In non-ATMODE, 
the 
POST 
ROM's 
address 
space 
is determined 
by the 
ROM 
Page Select 
bits in the HIGA's 
POS2 register 
(RPO- 
~P3). In ~ TMODE, 
the POST ROM's address 
space 
IS determined 
by the ROM Page Select Register. 
For 
both types of systems, 
the 1 of 8 POST ROM selec- 
tion is done with the ROM Select bits in the ROM 8K 
Select 
Register 
and is set to select 
ROM 0 by RE- 
SET. A POST ROM Enable bit in the POS3 register 
can be used to disable 
the POST ROM. When 
dis- 
abled, 
the 
DVI Board 
will 
not 
respond 
to memory 
accesses 
in the 
POST 
ROM 
address 
space. 
The 
POST ROM is enabled 
by RESET, 
but can be dis- 
abled by writing the POS3 register 
in non-ATMODE. 
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In ATMODE, 
the ROM Enable 
Switch 
or the POS3 
register 
can be used to disable 
the POST ROM. 


5.0 
ELECTRICAL 
DATA 


5.1 
D.C. Characteristics 


Maximum Ratings 


Table 5-1 contains 
stress ratings only, and function- 
• 
al operation 
at the maximums 
is not guaranteed. 
Ex- 
posure 
to Maximum 
Ratings 
may affect 
device 
reli- 
ability. Furthermore, 
although 
the 82750LH 
contains 
protective 
circuitry to resist damage from static elec- 
trical discharge, 
this device 
is sensitive 
to ESD lev- 
els above 
1000V. Always 
take precautions 
to avoid 
high static voltages 
or electric 
fields. 


Table 5-1 
Maximum 
Ratings 


Condition 
Maximum 
Requirement 


Maximum Operating 
Junction 
Temperature 
100·C 


Storage Temperature 
- 65·C to + 150·C 


Voltage on Any Pin with Respect to Ground 
-O.5Vto 
+7V 


Supply Voltage with Respect to Vss 
-O.5Vto 
+7V 


Input Current Clamp (VI < 0 or VI > Vcc) 
±20mA 


Output Current Clamp (Vo < 0 or vo > Vccl 
±20mA 


Continuous 
Output Current Low 
20mA 


Continuous 
Output Current High 
20mA 


Table 5-2. Recommended 
Operating 
Conditions 


Recommended 


Parameter 
Condition 


Mln 
Nom 
Max 


Supply Voltage 
(Vccl 
4.50V 
5.0V 
5.50V 


Operating 
Temperature 
Range 
O·C 
70·C 


I 
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Table 5·3. D.C. Characteristics 
VCC = 5V, TCASE = 25°C 


Symbol 
Parameter 
Mln 
Typ 
Max 
Units 
Notes 


VIL 
Input LOW Voltage 
0.8 
V 
vcc = 4.5V 


VIH 
Input HIGH Voltage 
2.0 
V 
vcc = 5.5V 


VOL(l) 
Output LOW Voltage 
0.5 
V 
VI = 0.1 vcc. IOL = 4 mA 


VOL(2) 
Output LOW Voltage 
0.5 
V 
VI = 0.1 vcc. IOL = 16 mA 


VOL(3) 
Output LOW Voltage 
0.5 
V 
VI = 0.1 VCC, IOL = 20 mA 


VOH(l) 
Output HIGH Voltage 
3.7 
V 
VI = 0.9 vcc. IOH = 4 mA 


VOH(2,3) 
Output HIGH Voltage 
3.7 
V 
VI = 0.9 vcc. IOH = 12 mA 


IIL(4) 
Input Leakage Current 
-70 
/loA 
VIL = DV 


IIL(5) 
Input Leakage Current 
±1 
/loA 
VIL = DV 


IOZ(6) 
Output Leakage Current 
-70 
/loA 
Vo = DV 


IOH(l) 
Output HIGH Current 
4 
mA 


IOH(2,3) 
Output HIGH Current 
12 
mA 


IOL(l) 
Output LOW Current 
4 
mA 


IOL(2) 
Output LOW Current 
16 
mA 


IOL(3) 
Output LOW Current 
20 
mA 


Icc 
Power Supply Current 
15 
mA- 


CIN 
Input Capacitance 
7 
pF 


COUT 
Output Capacitance 
34 
pF 


VT(7) 
Input Threshold 
Voltage 
1.3 
V 


NOTES: 
1. All output and bidirectional pins except INT[3:0)#, CDDS16#, CHRDY, CDSFDBK#, SD[15:0). 
2. INT[3:0), CDDS16#, CHRDY, CDSFDBK# only. 
3. SD[15:0) only. 
4. All input pins except RESET, M_IO, 51 #, SOli', MADE24, TESTPIN. 
5. RESET M_IO, 51#, SOli', MADE24, TESTPIN. 
6. Specified for MD[31:0), VWE#, BEN[3:0) # only. 
7. Specified for all input pins except RESET, M_IO, 51 #, 50#, MADE24. 
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Table 
5-4. ClK 
D.C. Characteristics 
vcc = 5V, TCASE = 25°C 


Symbol 
Parameter 
Mln 
Typ 
Max 
Units 
Notes 


VIL 
Input LOW Voltage 
0.9 
V 
VCC = 4.5V 


VIH 
Input HIGH Voltage 
3.85 
V 
VCC = 5.5V 


IlL 
Input LOW Leakage 
±1 
/LA 
VIH = vcc 


IIH 
Input HIGH Leakage 
±1 
/LA 
VIL = OV 


VT 
Input Threshold 
Voltage 
2.5 
V 


CIN 
Input Capacitance 
7 
pF 
11 
Output Delay and Rise Time Versus Load Capacitance 


nom+8 
/' 
/' V 


-: 
/' 
-: 


I 


Valid 


~~: 
nom+" 
(ns) 


nom 


nom-" 
25 
50 
75, 
100 


<1. (pr) 


241344-23 
nom = nominal 
value 
used in AC Characteristics 


Figure 
5-1. Typical 
Output 
Valid Delay 
versus 
load 
Capacitance 


I 


125 


RI •• 
Time 
(ns) 


./ / 
/ 
V 


./ 
V 
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Figure 
5-2. Typical 
Output 
Rise Time 
versus 
load 
Capacitance 
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t3 
CMD# 
Active Setup to ClK 
17 
ns 
5-1, -2, -3, -4, -5, -6 
2 


t4* 
A[23:0l, 
M_IQ 
Valid to CDSFDBK# 
Active 
60 
ns 
5-1, -2, -3, -4, -5, -6 


t5* 
A[23:ol, 
M_IQ 
Valid to CDDS16# 
Active 
55 
ns 
5-1, -2, -3, -4, -5, -6 


t6 
A[23:0l, 
M_IQ 
Valid to CHRDY Inactive 
60 
ns 
5-1, -2, -3, -4, -5, -6 


t7 
A[23:01, M_IQ 
Invalid to CDSFDBK# 
Inactive 
4 
40 
ns 
5-1, -2, -3, -4, -5, -6 


t8 
A[23:0l, 
M_IQ 
Invalid to CDDS16# 
Inactive 
2 
40 
ns 
5-1, -2, -3, -4, -5, -6 


t9 
ClK 
to CHRDY Active 
31 
ns 
5-1, -2, -3, -4, -5, -6 


t10 
ClK 
to READ DATA VALID 
9 
51 
ns 
5-1, -3, -5, 
1 


t11 
READ DATA VALID Hold from CMD# 
Inactive 
0 
ns 
5-1, -3, -5 
1 


t12 
WRITE DATA VALID Setup to ClK 
25 
ns 
5-2, -4,-6 
1 


t13 
WRITE DATA VALID Hold from CMD# 
Inactive 
30 
ns 
5-2, -4,-6 
1 


t14 
ClK 
to BREQ# 
Active 
9 
28 
ns 
5-3, -4 


t15 
CMD# 
Active to VRAM# 
Active 
4 
40 
ns 
5-3, -4 


t16 
CMD# 
Active to VGARD# 
Active 
35 
ns 
5-5 


t17 
CMD# 
Inactive to VGARD# 
Inactive 
7 
ns 
5-5 


t18 
CMD# 
Active to VGAWR# 
Active 
34 
ns 
5-6 


t19 
CMD# 
Inactive to VGAWR# 
Inactive 
7 
ns 
5-6 


NOTES: 
1. Measured 
with CL = 100 pF on SO[15:0). 


2. CMO# 
is asynchronous. 
CMO# 
setup time is specified 
only to guarantee 
timing 
shown. 
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Figure 5-1. MlcroChannelllO 
Read AC Timing 
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Figure 5-2. MlcroChannelllO 
Write AC Timing 
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ClK 


A[23:0] 


ILIO 


SI# 


SO# 


Ct.CO# 


CHROY 


COSFDBK# 


COOSI6# 


SO[15:0] 


BREQ# 


VRAt.C# 
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V1VAlEN# 


t.CREQ#l 


t.CSTRB# 


HROY# 


RAS#l 


t.C0[31:0] 


TO 
I 
TO 
I 
T1 
I 
T2 
IT3(1)1T3(2)1T3(3)1T3(4)1T3(5)1T3(6)1T3(7)1T3(8)!T3(9)! 
T4 
! U 
! u 
I 
TO 
! 


, 
, 
, 
, 
------------------------------~~~-------------------------- 
--------------------------------------~~~----------------- 
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NOTE: 
1. Signal 
not physically 
connected 
to the HIGA. 
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Figure 5-3. MlcroChannel 
Memory 
Read Timing 
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NOTE: 
1. Signal 
not physically 
connected 
to the HIGA. 


I 


Figure 5-4. MlcroChannel 
Memory 
Write AC Timing 
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Figure 5-5. MlcroChannel 
VGA Read AC Timing 
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Figure 5-6. MlcroChannel 
VGA Write AC Timing 
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5.2.2 ISA BUS INTERFACE 
TIMING 


Table 
5-6. ISA Bus Interface 
AC Characteristics 


Symbol 
Parameter 
Mln 
Max 
Units 
Figure 
Note 
to 
A[23:01, CDSETUP# 
(AEN) Valid 
25 
ns 
5-7,-11 
to MADE24 
(IORDC#) 
Inactive 


t1 
MADE24 
(IOROC#) 
Inactive to 
26 
ns 
5-7, -8, -11, -12 
CHRDY (IOCHRDY) 
Inactive 


t2 
A[23:01, CDSETUP# 
(AEN) Valid 
30 
ns 
5-7, -8, -9, -10, -11, -12 
to CDDS16# 
(IOCS16#) 
Active 


t3 
ClK 
to CHRDY (IOCHRDY) 
Active 
31 
ns 
5-7, -8, -9, -10, -11, -12 


t4 
ClK 
to READ DATA VALID 
9 
51 
ns 
5-7, -9, -11 


t5 
A[23:0] 
CDSETUP# 
(AEN) Hold 
5 
ns 
5-7, -11 
from MADE24 
(IORDC#) 
Active 


t6 
MADE24 
(IORDC#) 
Active to 
7 
26 
ns 
5-7,-11 
CHRDY (IOCHRDY) 
Invalid 


t7 
READ DATA VALID Hold from 
0 
ns 
5-7, -11 
1 
MADE24 
(IORDC#) 
Active 


t8 
SD[15:0] 
Tri-State 
from MADE24 
30 
ns 
5-7,-11 
(IORDC #) Active 


t9 
A[23:0] 
CDSETUP# 
(AEN) Invalid 
5 
30 
ns 
5-7, -8, -11, -12 
to CDDS16# 
Inactive 


t10 
A[23:0], 
CDSETUP# 
(AEN) Valid 
25 
ns 
5-8, -12 
to M_IO 
(IOWRC#) 
Inactive 


t11 
M_IO 
(IOWRC#) 
Inactive to 
25 
ns 
5-8, -12 
CHRDY (IOCHRDY) 
Inactive 


t12 
WRITE DATA VALID Setup to ClK 
25 
ns 
5-8, -12 


t13 
A[23:0] 
CDSETUP# 
(AEN) Hold 
10 
ns 
5-8, -12 
from M_IO 
(IOWRC#) 
Active 


t14 
M_IO 
(IOWRC #) Active to 
25 
ns 
5-8, -12 
CHRDY (IOCHRDY) 
Invalid 


t15 
WRITE DATA VALID Hold from 
32 
ns 
5-8, -12 
1 
M_IO 
(IOWRC #) Active 


t16 
SO [15:0] Tri-State 
from M_IO 
30 
ns 
5-8, -12 
(IOWRC#) 
Active 


t17 
A[23:01, CDSETUP# 
(AEN) Valid 
39 
ns 
5-9 
Setup to SO# (MRDC#) 
Active 


t18 
A[23:0] 
Valid Hold from SO# 
22 
ns 
5-9 
(MRDC#) 
Active 


t19 
SO# (MRDC#) 
Active to CHRDY 
39 
ns 
5-9 
(IOCHRDY) 
Inactive 


I 
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Table 
5-6. ISA Bus Interface 
AC Characteristics 
(Continued) 


Symbol 
Parameter 
Mln 
Max 
Units 
Figure 
Note 


t20 
A[23:0] 
Invalid to CDSFDBK# 
(MEMCS16#) 
0 
ns 
5-9, -10 
Inactive 


t21 
ClK 
to BREQ# 
Active 
9 
28 
ns 
5-9 


t22 
CDSETUP# 
(AEN) Valid Hold from SO# 
8 
ns 
5-9 
(MRDC#) 
Inactive 


t23 
SO# (MRDC#) 
Inactive to CHRDY (IOCHRDY) 
8 
39 
ns 
5-9 
Invalid 


t25 
READ DATA VALID Hold from SO# (MRDC#) 
0 
ns 
5-9 
1 
Inactive 


t26 
SD[15:0] 
Tri-State 
from SO# (MRDC#) 
Inactive 
30 
ns 
5-9 


t27 
A[23:0], 
CDSETUP# 
(AEN) Valid Setup to S1 # 
39 
ns 
5-10 
(MWRC#) 
Active 


t28 
S1 # (MWRC#) 
Active to CHRDY (IOCHRDY) 
66 
ns 
5-10 
Inactive 


t29 
A[23:0] 
Valid Hold from S1 # (MWRC#) 
Active 
41 
ns 
5-10 


t31 
CDSETUP# 
(AEN) Valid Hold from S1 # 
8 
ns 
5-10 
(MWRC#) 
Inactive 


t32 
S1 # (MWRC#) 
Inactive to CHRDY (IOCHRDy) 
7 
40 
ns 
5-10 
Invalid 


t33 
WRITE DATA VALID Hold from S1 # (MWRC#) 
32 
ns 
5-10 
1 


Inactive 


t34 
SD[15:0] 
Tri-Statefrom 
S1 # (MWRC#) 
Inactive 
30 
ns 
5-10 


t35 
MADE24 
(IORDC#) 
Inactive to VGARD# 
Active 
42 
ns 
5-11 


t36 
MADE24 
(IORDC#) 
Active to VGARD# 
Inactive 
4 
24 
ns 
5-11 


t37 
M_IO 
(IOWRC#) 
Inactive to VGAWR# 
Active 
36 
ns 
5-12 


t38 
M_IO 
(IOWRC#) 
Active to VGAWR# 
Inactive 
8 
23 
ns 
5-12 


t39 
SO# (MRDC#) 
Active to VRAM# 
Active 
6 
55 
ns 
5-9 


t40 
S1# 
(MWRC#) 
Active to VRAM # Active 
6 
55 
ns 
5-10 


NOTE: 
1. Measured 
with CL = 100 pF on SO[15:0]. 
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Figure 5-7. ISA 1/0 Read AC Timing 
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Figure 5-8. ISA 1/0 Write AC Timing 
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NOTE: 
1. Signal 
not physically 
connected 
to the HIGA. 


Figure 5-9. ISA Memory 
Read AC Timing 
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NOTE: 
1. Signal 
not physically 
connected 
to the HIGA. 


Figure 5-10. ISA Memory 
Write AC Timing 
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Figure 5-11. ISA VGA Read AC Timing 
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Figure 5-12. ISA VGA Write AC Timing 
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5.2.3 
DVr BUS INTERFACE 
TIMING 


Table 
5-7. DVI Bus Intertace 
AC Characteristics 


Symbol 
Parameter 
Mln 
Max 
Units 
Figure 
Note 


to 
Cl.K to BREQ# 
Active 
28 
ns 
5·13, ·14, ·15, ·16 


t1 
Cl.K to V1VAlEN 
Inactive 
30 
ns 
5·13, ·14, ·15, ·16 


12 
Cl.K to GAVAlEN 
Inactive 
4 
30 
ns 
5·13, ·14, ·15, ·16 


t3 
V1VAlEN 
Inactive to VWE# I 
40 
ns 
5·13, ·15 
BEN[3:0] 
# Valid 


t4 
BUSEN # Active to VALID ADDRESS 
7 
ns 
5·13, ·14, ·15, ·16 
1 


t5 
MSTRB # Active Setup to elK 
10 
ns 
5·13, ·15 


t6 
WRITE DATA VALID Hold from cue 
60 
ns 
5·13, ·15 
1 


t7 
WRITE DATA VALID Hold from cue 
6 
ns 
5·13, ·15 
1 


t8 
READ DATA VALID Setup to cue 
0 
ns 
5·13, ·15 
1 


t9 
READ DATA VALID Hold from cue 
14 
ns 
5·13, ·15 
1 


t10 
Ol.K to BREQ# 
Inactive 
4 
30 
ns 
5·13, ·14, ·15, ·16 


t11 
Cl.K to V1VAlEN 
Active 
4 
30 
ns 
5·13, ·14, ·15, ·16 


t12 
elK 
to GAVAlEN 
Active 
28 
ns 
5·13, ·14, ·15, ·16 


t13 
VWE # IBEN [3:0] # Hold from 
2 
40 
ns 
5·13, ·15 
BUSEN# 
Inactive 


t14 
VWE # IBEN [3:0] # Setup to DSTRB # 
40 
ns 
5·14 
Active 


t15 
VWE # IBEN [3:0] # Hold from 
40 
ns 
5·14 
DSTRB# 
Active 


t16 
WRITE DATA VALID Setup to DSTRB# 
40 
ns 
5·14 
1 


Active 


t17 
WRITE DATA VALID Hold from 
20 
ns 
5·14 
1 


DSTRB# 
Active 


t18 
DSTRB# 
Active to READ DATA VALID 
60 
ns 
5·14 


t19 
READ DATA VALID Hold from 
0 
ns 
5·14 
1 


DSTRB# 
Inactive 


120 
DSTRB# 
Inactive to MD[31:0] 
Tri·State 
2 
15 
ns 
5·14 
2 


121 
BUSEN # Active to VWE # IBEN [3:0] # 
3 
55 
ns 
5·16 
Valid 


122 
HRDY # Active Setup to Cl.K 
0 
ns 
5·16 


NOTES: 
1. Measured 
with CL = 100 pF on MD[31:0). 


2. With VWE # high. 


I 
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.----r 


DST~ 
------------~----_+--~----~---------------4i.-------------4i-------------- 
~d~ 
~m~ 


. xC==4=========~i========~xc====== 


VA(21:O)'=====i====XC::t======t:=====::::;XC:=:=:= 


VWEtlBEN)3:0)l 


HRDV. ------------~--------------+----------------4----,~ 
~/~------------ 


RA~' --~--------~--------------+----------------4---------------------------- 
--.i18~: 
-------------i---~·::~~~~+=>------------~.U 
ft~::~. 
MD(31:OK~ad) 
~ 
VAU)o\OORESS 
~I----------------------- 
. 
---J. 
• 
'-- 
-----...:.~ 
~ 
~ 
~17.--- 


K-- V~~~E§ 
~C::::V~~~O@OA~~~~:~*:------------------------- 
MD(31:0KWrile) 


...: 
M 
! 
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NOTE: 
1. Signal 
not physically 
connected 
to the HIGA. 


Figure 5-13. DVI Bus DVI Device VRAM Access AC Timing 
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82750LH 


I 
TO I 
TO I 
T 1 I 
T2 
I 
T3 
I 
T4 
I 
T5 
I 
T6 
I 
T7 
I ra 
I 
T9 
I re 
I 
T9 
I 
TO I 
TO I 


elK 
~'\....~~~~~ 


VRA~# 
~~ 
~ 
__ -+ 
~ 
+-__~~ 
__J' 


#-1 
to I- 
tl0- 
I- 


BREQ 
~~ 
~ 
__ -+ 
~ 
+-__~r~------~--- 


_t12 


BUSEN# 


V1VALE N 


GAVAlE N 


~REQ# 1 


~STRB# 


DSTRB# 


VWE#/BEN[3:0] 
# 


VA[21:0 
]1 


HRDY# 


RAS#1 


~D[31 :O](Road ) 
. 


~D[31 :O](Write ) 


- 
f-tl3 


- 
tl1-- 
- 


-t2!- 
'\. 


-l t5 I-- 
-l t5 
- 
-------+--+---------~~~r---~~Jr---~----- 


_ t31- 
I 
I 


x 
x 


'\, 


VALID ADDRESS 


f--t6~1 
f--t6~1 


VALID ADDRESS >---i: 
VALID DATA 
--{{:::1V~Alg]ID[gDA~T~A:JI_------------- 
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NOTE: 
1. Signal 
not physically 
connected 
to the HIGA. 


Figure 5-14. DVI Bus Next-Fast 
VRAM Access AC Timing 


I 


tll 
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82750LH 
intel~ 


V1VALE 


GAVALE 


VWEIIO/BEN[3:0] 


VA[21:0 


~D[31 
:O](Road 


~D[31 
:O](Writ. 


I 
TO 
I 
TO 
I 
T 1 
I 
T2 
I 
T3 
I u 
I 
T5 
I 
T6 
I 
T7 
I 
T8 
I 
T9 
I 
TO 
I 
TO 
I 


CL K ~ 
~ 
~ 
~ 
~ 


110 " 
/ 
;~ 
- 
I--tl0 
I 


110 
- 
tl1- 
- 
I-tll 


N 
'l 
r 
- 
t21- 
- 
1-112 


N 
r 
, 


110 


110 
r--tI4- 
I--t151 


110 
I 
I 


]' 
X 
X 


110 
'\ 
/ 
, 
a= 
) 
VALID ADDRESS 
I--t18--L..:: 
t20 


VALID DATA 
~ 
~ 


f-t16 
~t17 
) 
VALID ADORESS 
VALID 
DATA 


VRA~ 


BREO 


BUSEN 


~REQIIO 


~STRB 


DSTRB 


HRDY 


RASIIO 
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NOTE: 
1. Signal 
not physically 
connected 
to the HIGA. 
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Figure 5-15. DVI Bus DVI Device Access AC Timing 


I 


intel~ 
82750LH 


V1VAlEN 


GAVAlEN 


MSTRB# 


VWE'/BEN[3:O)' 


VA[21:O)' 


MOf31 :O)(Read) 


MD[31:O)(Wri1e) 


TO 
I 
TO 
I 
Tl 
T2 
Din 
I 
BID 
I 
n 
TB 
I 
T9 
TO 
I 
TO 
I 


cue 1 


VRAM. 
~r------------~----~--------------------------~----~~:~ 
_ 


...; 
10 
:.., 
...:..:. 'H:.,."-------~--------- 
BRECt H 
I.' 


----------~~~~i~--~------------------------~-----J~---------- 
+:,,- 
:. 
------------~--~.~~ 
-?';,p--;:s::.------- 


~Q~ 
~m~ 
: ~~ 
~--------~.-J/,r------- 


HRDYt 


~Q'~ 
======~!====~*=============tti==~------- 
====~====xc========~i~!==~x~=== 
-:~ 
-------------+--------------------------------~~ 
i . 


i 


/r----------- 


!' 
VALID AOOAESS 
j, 
, 
---------~+:=+i-w~~K==]Vi%~D~~~E~~::l__t 
,.~ 
~ 


17 t.- 


VAlDDATA 
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NOTE: 
1. Signal 
not physically 
connected 
to the HIGA. 


I 


Figure 5·16. DVI Bus 82750PB 
Register 
Access AC Timing 
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0.25 
HIN 
.J L ' 
0.SO±0.15 
~or' 
,. 
In 


18BB'd8BB'd8BB'dB'dll'll'db'WB'd8BB'd8BH8B888BB88 
I 


2S.00±0.10 
sq. 


31.20±0.25 
sq. 


NOTES: 
1. All dimensions 
in mm. 


2. Co-planarity 
within 
O.10mm. 


DETAIL 
A 
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6.2 
Package Thermal Specifications 


Thermal 
Impedance 
is defined 
as the ability to dissi- 
pate heat generated 
by an electronic 
device 
and is 
characterized 
by (JJA and 
(JJC. It is measured 
in de- 


grees 
Celcius 
per Watt. 
(JJA is the thermal 
imped- 
ance from the IC chip junction 
to still air ambient with 


the package 
mounted 
in a socket 
or directly 
mount- 
ed on a PC Board. 
(JJC is the thermal 
impedance 
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from the IC junction 
to the external 
package 
case. 


Measurements 
are typically 
taken using high air flow 


to simulate 
an infinite heat sink. The thermal 
charac- 
teristics 
of the 
160 lead PQFP package 
are as fol- 


lows: 


9JA = 60.0·C/W 


9JC = 18.O"C/W 


I 


82750LV 
Technical Specifications 
•• 
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rectly 
loaded 
into VRAM 
through 
the parallel 
port; 
(4) It prioritizes 
CD-ROM 
and video capture 
DMA re- 
quests and implements 
the DVI protocols 
to execute 
those 
requests; 
(5) It converts 
the time-multiplexed 
addresses 
on the DVI Data Bus to a separate 
Ad- 
dress Bus for use by the other DVI Bus components. 


The 
following 
figure 
shows 
a simplified 
block 
dia- 
gram of the VSCGA gate array interconnections 
in a 
typical 
DVI system. The DVI Bus and the VBUS con- 


tory operations 
are handled 
under 
software 
control 
but the main task of transferring 
data is automated 
by a state machine 
contained 
within the array. 


The Video Capture 
Bus uses the parallel data port of 
the VRAM to introduce 
video data into the DVI sys- 
tem. This is a departure 
from earlier systems 
which 
used the serial port for this function 
and was done to 
isolate the capture 
video from the DB display video, 
which also requires 
the serial VRAM data. 


VSCGA 
------ 
Bus 
Interfaces 
------ 
.1 Address 
1 
VRAM 
.. 
Mux 
I 
1 
I 
Interface 
VRAM 
------ 


SCSI 
SCSI Bus 
, 
CD ROM 
1 
Interface 
, 
SCSI 
Interface 
------ 
Capture 
Capture 
Bus 
, 
Capture 
I 
Control 
Interface 


" 
Subsystem 


DVI Bus 
VBUS 


NOTE: 
Manufactured 
and tested 
for Intel by Texas 
Instruments 
in accordance 
with Texas 
Instruments 
intemal 
standards. 


241345-1 
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1.0 
PIN DESCRIPTION 


1.1 Plnout 


82750LV 


82750LV 


.. 
12 
12 


13 


13 


13 


BRSTACk- 
:J 1 


DelK 
:J' 
RECSU· :JI 


RASO_ 
• 
VauS3 
RAS1. 


• 
CS2fLG 
RAS2. 


• 
BRST~ 
RAS3. 
0 
YVUS[L 
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, 


t 
.nW 
CAS1 
• 
, 
, 
[VDLOOO 
CAS2. 
, 


O[ 
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, 
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WUX 
;~ 


MXTfST. 
... 
, 
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, 
'e 
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m 
, 
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241345-2 
I 


""""z 
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• 


82750LV 
intel~ 


Vcc 
GNO 
VA6 
VA7 
VA8 
VA9 
VA10 
VA11 
SENO# 
SEN1# 
SEN2# 
SEN3# 
GNO 
WE# 
TROE# 
CASO# 
CAS1# 
CAS2# 
CAS3# 
GNO 
VCC 
V1CLK 
RASO# 
RAS1# 
RAS2# 
RAS3# 
VA23 
VA22 
VA21 
VA20 
IoIUX 
VA19 
VA18 
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VA14 
VA13 
VCC 
VA12 


ORVEN 
NC 
101031 
101030 
101029 
101028 
101027 
101026 
101025 
101024 
GNO 
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101018 
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82750LV - VSCGA 
TOP VIEW 


~N~.~~~~~O-N~.~~~~~O-N~.~~~~~O-N~.~~~~~O 
......... ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 


Figure 1·1_ 82750LV 
Plnout 


I 


infel~ 


Pin Name 
Location 
ACK# 
66 


ACKA# 
67 


ATN# 
64 
ATNA# 
65 


BEO# 
145 


BEU 
146 


BEU 
147 
BE3# 
148 


BRSTACK# 
134 
BRSTRQ# 
129 
BSY# 
75 
BUSEN# 
139 
CASO# 
16 
CAS 1# 
17 
CAS2# 
18 
CAS3# 
19 
CD# 
77 
CHANREQ# 
125 
CS2FLG 
128 
DBO 
42 
DB1 
44 


DB2 
46 


DB3 
48 
DB4 
52 
DB5 
54 
DB6 
56 


DB7 
58 
DBAO 
43 


DBA1 
45 
DBA2 
47 
DBA3 
49 
DBA4 
53 
DBA5 
55 
DBA6 
57 
DBA7 
59 
DBP 
62 


DBPA 
63 


DCLK 
133 
DRVEN 
120 
DSTRB# 
137 
I 


Table 1-1.Pin Cross Reference 
by Pin Name 


Pin Name 
Location 
EVEN_ODD 
132 


.GAVALEN 
144 


GND 
2 


GND 
13 


GND 
20 


GND 
41 


GND 
51 


GND 
61 


GND 
71 


GND 
82 
GND 
91 


GND 
100 
GND 
110 
GND 
121 
GND 
141 
HTIM 
131 
INT# 
124 
10# 
79 
MOO 
90 
MD1 
89 
MD2 
88 


MD3 
87 
MD4 
86 
MD5 
85 
MD6 
84 


MD7 
83 


MD8 
92 


MD9 
93 
MD10 
94 
MD11 
95 
MD12 
96 
MD13 
97 
MD14 
98 
MD15 
99 
MD16 
102 
MD17 
103 


MD18 
104 
MD19 
105 
MD20 
106 
MD21 
107 


Pin Name 
Location 


MD22 
108 


MD23 
109 


MD24 
111 


MD25 
112 


MD26 
113 


MD27 
114 


MD28 
115 


MD29 
116 


MD30 
117 


MD31 
118 
MRDY# 
142 
MREQ# 
150 
MSG# 
76 


MSTRB# 
138 


MUX 
31 


NC 
80 
NC 
119 


NC 
136 


NC 
160 
NXTFST# 
151 
OR_MEM 
127 
RASO# 
23 
RAS1# 
24 
RAS2# 
25 
RAS3# 
26 
REGSEL# 
135 
REQ# 
78 
RFSH# 
153 
RSTIN# 
72 
SEL# 
73 
SELA# 
74 
SENO# 
9 


SEN1# 
10 
SEN2# 
11 
SEN3# 
12 
SRST# 
68 
SRSTA# 
69 
SYSRST# 
126 
TRNFR# 
152 
TROE# 
15 


82750LV 


Pin Name 
Location 


V1CLK 
22 


VAO 
154 


VA1 
155 


VA2 
156 


VA3 
157 


VA4 
158 


VA5 
159 


VA6 
3 


VA7 
4 


VA8 
5 
VA9 
6 
VA10 
7 


VA11 
8 
VA12 
40 


VA13 
38 


VA14 
37 


VA15 
36 
VA16 
35 
VA17 
34 


VA18 
33 
VA19 
32 


VA20 
30 
VA21 
29 
VA22 
28 


VA23 
27 


VBUS3 
149 


Vcc 
1 


Vcc 
21 


Vcc 
39 


Vcc 
50 


Vcc 
60 


Vcc 
70 


Vcc 
81 


Vcc 
101 


Vcc 
122 


Vcc 
140 
VSCSEL# 
123 
VWE# 
143 
WE# 
14 
YVUSEL 
130S 
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82750LV 


Pin 
Location 
Name 


1 
VCC 
2 
GND 


3 
VA6 


4 
VA7 


5 
VA8 


6 
VA9 
7 
VA10 


8 
VA11 


9 
SENO# 


10 
SEN1# 


11 
SENU 
12 
SEN3# 


13 
GND 


14 
WE# 


15 
TROE# 


16 
CASO# 


17 
CAS1# 


18 
CAS2# 


19 
CAS3# 


20 
GND 


21 
VCC 
22 
V1CLK 


23 
RASO# 


24 
RAS1# 


25 
RAS2# 


26 
RAS3# 


27 
VA23 
28 
VA22 


29 
VA21 


30 
VA20 
31 
MUX 


32 
VA19 


33 
VA18 
34 
VA17 


35 
VA16 
36 
VA15 
37 
VA14 
38 
VA13 
39 
Vcc 
40 
VA12 
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Table 1-2. Pin Cross Reference 
by Pin Number 


Pin 
Location 
Name 


41 
GND 


42 
DBO 


43 
DBAO 
44 
DB1 


45 
DBA1 


46 
DB2 


47 
DBA2 


48 
DB3 


49 
DBA3 


50 
VCC 
51 
GND 


52 
DB4 


53 
DBA4 


54 
DB5 


55 
DBA5 


56 
DB6 
57 
DBA6 


58 
DB7 


59 
DBA7 


60 
Vcc 
61 
GND 


62 
DBP 
63 
DBPA 


64 
ATN# 


65 
ATNA# 


66 
ACK# 


67 
ACKA# 


68 
SRST# 


69 
SRSTA# 


70 
VCC 
71 
GND 


72 
RSTIN# 


73 
SEL# 


74 
SELA# 


75 
BSY# 


76 
MSG# 


77 
CD# 
78 
REQ# 


79 
10# 


80 
NC 


Pin 
Location 
Name 


81 
Vcc 
82 
GND 


83 
MD7 


84 
MD6 


85 
MD5 


86 
MD4 


87 
MD3 


88 
MD2 


89 
MD1 


90 
MOO 


91 
GND 


92 
MD8 


93 
MD9 


94 
MD10 


95 
MD11 


96 
MD12 
97 
MD13 


98 
MD14 


99 
MD15 


100 
GND 
101 
VCC 
102 
MD16 
103 
MD17 


104 
MD18 


105 
MD19 
106 
MD20 


107 
MD21 
108 
MD22 


109 
MD23 


110 
GND 


111 
MD24 
112 
MD25 
113 
MD26 
114 
MD27 
115 
MD28 


116 
MD29 


117 
MD30 
118 
MD31 
119 
NC 
120 
DRVEN 
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Pin 
Location 
Name 


121 
GND 


122 
VCC 
123 
VSCSEL# 


124 
INT# 


125 
CHANREQ# 


126 
SYSRST# 


127 
OR_MEM 


128 
CS2FLG 


129 
BRSTRQ# 


130 
YVUSEL 


131 
HTIM 


132 
EVEN_ODD 


133 
DCLK 


134 
BRSTACK# 


135 
REGSEL# 


136 
NC 


137 
DSTRB# 


138 
MSTRB# 


139 
BUSEN# 


140 
Vcc 
141 
GND 


142 
MRDY# 


143 
VWE# 


144 
GAVALEN 


145 
BEO# 


146 
BE1# 


147 
BE2# 


148 
BE3# 


149 
VBUS3 
150 
MREQ# 


151 
NXTFST# 


152 
TRNFR# 


153 
RFSH# 


154 
VAO 


155 
VA1 


156 
VA2 


157 
VA3 
158 
VA4 


159 
VA5SS 
160 
NC 
I 
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1.2 Pin Descriptions 


1.2.1 
DVI BUS PIN DESCRIPTIONS 


Symbol 
Type 
Name and Function 


BE[3:0]11 
BI 
BYTE ENABLES 
3•.0: These signals are used to decode 
the CAS signals on the 
VRAM interface. 
These lines are latched at the leading edge of each CAS cycle. 


CHANRQII 
0 
DMA CHANNEL 
REQUEST 
SIGNAL: 
This signal is output to the HIGA to indicate 
a request to perform 
a memory operation. 
After VSCSEl 
11 and BUSEN 11 are 
asserted 
CHANRQII 
will be negated. 


BUSENII 
I 
BUS ENABLE 
SIGNAL: 
This signal is output by the 82750PB 
in response 
to a 
CHANRQII 
indicating 
that access to the DVI Bus has been granted. 


MSTRBII 
0 
MEMORY 
STROBE 
SIGNAL: 
A memory strobe pulse is generated 
for each non- 
82750PB 
cycle and is provided 
as a data valid strobe for VRAM and register 
access. 


MRDYII 
0 
MEMORY 
READY SIGNAL: 
This signal is an output to the 82750PB 
indicating 
the 
end of a memory cycle. 


DSTRBII 
0 
DEVICE STROBE 
SIGNAL: 
A pulse is generated 
on this signal whenever 
an 
access is made to one of the eight DVI devices. This signal is provided 
as a data 
valid strobe for register accesses. 


DRVEN 
I 
DRIVE ENABLE 
SIGNAL: 
This signal is provided for testing purposes 
only. 
Bringing this signal low will switch all output and bidirectional 
pins their high 
impedance 
state. 


GAVAlEN 
I 
GATE ARRAY 
VRAM ADDRESS 
LATCH 
ENABLE: 
This signal is driven by the 
HIGA. The rising edge of the first V1 ClK 
following 
the negation 
of GAVAlEN 
latches address information 
from the MD[31 :0] Bus onto the VA[23:0] 
Bus. 


INTII 
0 
INTERRUPT 
SIGNAL: 
This signal is an output from the VSCGA that indicates 
to 
the Host that a service is being requested. 


MD[31:0] 
BI 
MEMORY 
DATA BUS: This bus is the data path between 
the Host and the 
VSCGA. the VSCGA and VRAM or other DVI devices. 


SYSRSTII 
I 
MASTER 
RESET SIGNAL: 
Asserting 
this signal will reset all of the internal 
logic of 


the VSCGA and should only be used for power-on 
initialization. 


V1ClK 
I 
MASTER 
CLOCK 
SIGNAL: 
This signal is driven by the KAGA and is used to drive 


the internal state machine 
and control 
logic in the VSCGA. 


VA[23:0] 
O/BI 
VRAM ADDRESS 
23 ••0: These signals are driven by the VSCGA for all non- 
82750PB 
cycles. They are driven active when BUSEN 11 is asserted 
by the 
82750PB. 
VA2·4 and VA 17·23 are bidirectional 
signals that are looped back and 
used as inputs to the VSCGA or are driven by the 82750PB 
and used by the CD· 
ROM subsystem 
logic. All other signals on this bus are outputs. 


VSCSElII 
I 
VRAM SCSI CAPTURE 
SELECTION 
SIGNAL: 
This signal is an input from the 
HIGA that indicates 
that the VSCGA is the next request to be serviced. 


VWEII 
BI 
VRAM WRITE 
ENABLE 
SIGNAL: 
This signal is driven by the requesting 
DVI 
device during DVI device and VRAM transfers 
to indicate the direction 
of the 
transfer. 
A high indicates 
a read and a low indicates 
a write. 


OR_MEM 
0 
OFL-MEM: 
This signal is an output that is used for diagnostic 
purposes 
only and 
should be left unconnected 
for normal operation. 


I 
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1.2.2 
SCSI INTERFACE 
PIN DESCRIPTIONS 


Symbol 
Type 
Name and Function 


CD# 
I 
CONTROL/DATA 
SIGNAL: This signal is an input to the VSCGA that indicates 
whether 
control or data information 
is present on the SCSI bus. A high indicates 
control 
information 
is present and a low indicates 
data is present. 


10# 
I 
INPUT/OUTPUT 
SIGNAL: This signal is an input to the VSCGA that controls 
the 
direction 
of data movement. 
A high indicates 
data input to the VSCGA and a low 
indicates 
data output from the VSCGA. 


MSG# 
I 
MESSAGE 
SIGNAL: This signal is an input to the VSCGA that is used during the 
message 
phase of a SCSI transfer. 


REO# 
I 
REQUEST 
SIGNAL: This signal is an input to the VSCGA that indicates 
a request 


for a REOI ACK data transfer 
handshake. 


BSY# 
I 
BUSY SIGNAL: This is an OR-tied input signal to the VSCGA that indicates 
whether 
the SCSI bus is being used. 


RSTIN# 
I 
RESET IN SIGNAL: This signal is SCSI RST input to the VSCGA. 


SRST# 
0 
SCSI RESET SIGNAL: These two signals are externally 
tied together 
to meet the 
SRSTA# 
0 
48 mA current drive requirement 
of the SCSI standard. 
Together 
these two signals 
form the RST output from the VSCGA. 


ACK# 
0 
ACKNOWLEDGE 
SIGNAL: These two signals are externally 
tied together 
to meet 


ACKA# 
0 
the 48 mA current drive requirement 
of the SCSI standard. 
Together 
these two 
signals form the acknowledge 
output from the VSCGA. 


ATN# 
0 
ATTENTION 
SIGNAL: These two signals are externally 
tied together 
to meet the 
ATNA# 
0 
48 mA current drive requirement 
of the SCSI standard. 
Together 
these two signals 
form the attention 
output from the VSCGA. 


SEL# 
BI 
SELECT SIGNAL: These two signals are externally 
tied together 
to meet the 
SELA# 
0 
48 mA current drive requirement 
of the SCSI standard. 
Together 
these two signals 
form the target select output from the VSCGA. SEL# 
can also be used by a target 


as the reselect 
input. 


DB[7:0,P] 
BI 
DATA BUS SIGNALS: 
Each of the corresponding 
bits of these two busses are 
DBA[7:0,P] 
0 
externally 
tied together 
to meet the 48 mA current drive requirement 
of the SCSI 
standard. 
Together 
these two busses form the eight-bit data plus parity bus. Data 
parity is odd. When outputting 
the VSCGA drives both busses. When inputting 
only 


DB[7:0,P] 
is used. 
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1.2.3 
VRAM 
INTERFACE 
PIN DESCRIPTIONS 


Symbol 
Type 
Name and Funcllon 


RAS[3:0] 
11 
0 
VRAM ROW ADDRESS 
SELECT 
SIGNALS: 
These signals are used to select the 
VRAM row address. The RAS lines are not completely 
decoded. 


CAS [3:0] 11 
0 
VRAM COLUMN 
ADDRESS 
SELECT 
SIGNAL~;: These signals are used to select 
the VRAM column address. The CAS lines are decoded 
by the Byte Enable lines. 


WEll 
0 
WRITE 
ENABLE 
SIGNAL: 
This write enable pin is used to allow for Write-per-bit 


operations 
and Write Enable. The DVI bus signa 
VWE 11 is latched at the 
beginning 
of a memory cycle and determines 
thEIstate of WE 11. If VWE 11 is 
latched 
low (active) then WEll 
will also be asserted 
low. 


TROEII 
0 
OUTPUT 
ENABLE 
SIGNAL: 
This signal is used to allow for transfer 
operations 
and output enable. The DVI Bus signal VWE 11 is latched at the beginning 
of a 
memory cycle and determines 
the state of TROE: 11. If VWE 11 is latched 
high 
(inactive) 
then TROE 11 will be asserted 
low. 


MUX 
0 
MUX SIGNAL: 
This signal transitions 
high to low on the falling edge of the system 
clock after RAS to signal a switch to the column 
address. 


MREQII 
I 
MEMORY 
REQUEST 
SIGNAL: 
This signal is an nput from the 82750PB 
that 
indicates 
the beginning 
of a memory cycle. 


NXTFSTII 
I 
NEXT-FAST 
ACCESS 
SIGNAL: 
This signal is ar input from the 82750PB 
that 
indicates 
that a fast page memory cycle may be clone. 


TRNFRII 
I 
TRANSFER 
SIGNAL: 
This signal is an input to the VSCGA indicating 
a transfer 
cycle. 


RFSHII 
I 
REFRESH 
SIGNAL: 
This signal is an input to the VSCGA that indicates 
a memory 
refresh cycle. 


SEN[3:0] 
11 
0 
SERIAL 
ENABLE 
SIGNALS: 
These signals are used to select the serial port on 
the 82750DB 
that is being addressed 
during a transfer, 


1.2.4 
CAPTURE 
INTERFACE 
PIN DESCRIPTIONS 


Symbol 
Type 
Name and Func lion 


VBUS3 
I 
VBUS3 SIGNAL: 
This signal indicates 
to the VBCGA that a transfer 
is taking 
place on the video bus. 


BRSTACKII 
0 
BURST ACKNOWLEDGE 
SIGNAL: 
This signa 
is output from the VSCGA in 
response 
to a BRSTRQII 
indicating 
to the Capture subsystem 
that it may drive 
the current data word in its FIFO onto the memory data lines. 


BRSTRQII 
I 
BURST REQUEST 
SIGNAL: 
This signal is an input from the Capture subsystem 


that indicates 
that there is captured 
video data in a FIFO ready to be transferred. 


DCLK 
0 
OAT A CLOCK 
SIGNAL: 
This signal- is driven by the VSCGA to indicate that a 
video data word has been stored to VRAM. DCLK is a free-running 
clock. 


HTIM 
I 
HORIZONTAL 
TIME SIGNAL: 
This signal carries an active high pulse from the 
Capture subsystem 
for each horizontal 
line. 


EVEN_ODD 
I 
EVEN/ODD 
SIGNAL: 
This signal is an input from the Capture subsystem 
that 


indicates 
to the VSCGA which field is currently 
being captured. 
Low indicates 
an 
even field and high indicates 
an odd field. 


I 
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1.2.4 
CAPTURE 
INTERFACE 
PIN DESCRIPTIONS 
(Continued) 


Symbol 
Type 
Name and Function 


YVUSEL 
I 
YUV SELECT 
SIGNAL: 
This signal is an input from the Capture subsystem 
that 
indicates 
the type of data to be transferred 
in a video data burst cycle. A Y burst is 
indicated 
when this signal is low and a UV burst is indicated 
when it is high. 


REGSEL# 
0 
REGISTER 
SELECT 
SIGNAL: 
This signal is an output from the VSCGA to the 
Capture subsystem 
that indicates 
that a register access is being performed. 


CS2FLG 
I 
CAPTURE 
FLAG SIGNAL: 
This signal is active high and is driven by the Capture 
subsystem 
to indicate when register read data is ready or when register write data 
has been accepted. 


2.0 
INTERNAL ARCHITECTURE 
the SCSI interface 
or from the Host Interface 
Gate 
Array. Once In memory 
it can be processed 
by the 
Intel 82750PB 
Pixel Processor 
(PB). The data can 
then be displayed 
by shifting 
it out of the VRAM seri- 
al port to the 
Intel 82750 
Display 
Processor 
(DB). 
Each of these operations 
is supported 
by the VRAM 
and Capture portion of the VSCGA gate array whose 
internal 
architecture 
is shown 
in Figure 
2-1. Before 
each of its component 
blocks 
is examined, 
the func- 
tionality 
of the VSCGA 
chip will be described. 


2.1 
VRAM and Capture 


2.1.1 
OVERVIEW 


Video 
data 
typically 
enters 
the 
VRAM 
through 
its 
parallel 
port, 
originating 
from 
the 
Capture 
Module, 


241345-4 


REGISTER 
t---~ 
AND VRAt.4 
I/F 
REGISTER 
t----1AND 
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CONTROL 


VSCGA 
REGISTERS 


t.4EWORY 
L-----------tI 
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REGISTER 


Figure 2·1. VRAM and Capture 
Gate Array 
Architecture 
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One 
function 
of the 
VSCGA 
chip 
is to 
drive 
the 
VRAM Address 
lines. It does this for all DVI Bus cy- 
cles where 
the Intel Pixel Processor 
is not the bus 
master. When addresses 
from other DVI devices are 
placed 
on 
the 
time-multiplexed 
DVI 
Data 
Bus, 
VSCGA 
latches 
the information 
and uses it to drive 
the VRAM Address 
Bus. The address 
data can also 
originate 
internally 
from 
video 
capture 
write 
cycles 
and SCSI data acquisition. 
Driving VRAM addresses 
is hardware-efficient 
since 
other 
chips 
driving 
the 
DVI 
Bus 
do 
not 
have 
to 
implement 
the 
entire 
VA(23:0) 
Bus as well as the MD(31:0) 
Bus. 


The 
VSCGA 
can 
support 
the 
accessing 
of up to 
4 MB of memory 
in 1 MB increments 
using 256K x 
4 VRAM 
chips, or up to 16 ME! of memory 
in 4 MB 
increments 
using 1 MB x 4 VRAM chips. Each Mega- 
byte of the lower density chips consists 
of a integrat- 
ed circuits making up the 32-bit wide data path. Chip 
selection 
is controlled 
by CAS on each chip which is 
in turn controlled 
by the byte enable 
lines, BE(3:0), 
on the 
DVI Bus. 
Each 
4 Megabytes 
of the 
higher 
density 
memory 
is similarly 
configured. 
The VSCGA 
chip always 
assumes 
that the maximum 
amount 
of 
memory 
is installed 
and generates 
VRAM cycles ac- 
cordingly. 
If this is not the case 
a software 
check 
may be in order to determine 
how much memory 
is 
available. 


The VSCGA 
monitors 
the 
DVI Bus for VRAM 
and 
DVI 
Bus 
Register 
accesses, 
and 
when 
detected, 
generates 
the timing and control 
to execute 
the cy- 
cles. The following 
operations 
are supported: 


Single 
VRAM 
Read-Write 
cycle. 
Either 
the 
Intel 
Pixel Processor 
or any other DVI Device can read or 
write to a single 32-bit VRAM 
location. 
This also in- 
cludes 
the ability to address 
16- and a-bit locations 
using BE(3:0). 


Dual 
VRAM 
Read-Write 
cycle. 
Either 
PB or any 
other 
DVI Device can read or write to two consecu- 
tive 32-bit VRAM locations 
(where the first word has 
an even 
word 
address) 
using 
page 
mode 
access. 


This is more efficient 
than a single cycle access 
be- 
cause less time is spent acquiring 
the DVI Bus from 
PB, asserting 
RAS and waiting 
between 
accesses. 


VRAM 
Transfer 
cycle. 
Transfer 
cycles 
are used to 
load the VRAM 
serial shift register 
with data for the 
Intel Display Processor. 
DB makes requests 
for vari- 
ous types 
of data transfers 
of PB, which 
then 
initi- 
ates those 
cycles. 


VRAM 
Refresh 
cycle. 
Refresh 
cycles 
are initiated 
by DB and executed 
by PB. Every output 
horizontal 
line time, DB issues a refresh request to PB over the 
VBUS. For each request. 
up to 15 refresh cycles are 
initiated 
by the Pixel Processor. 
During 
refresh 
cy- 
cles all banks of VRAM 
are addressed. 
I 
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Single 
DVI Register 
Read-Write 
cycle. 
Either the 
Intel Pixel Processor 
or any other 
DVI Device 
can 
read or write to a single 32-bit DVI Register 
location. 
This also lncludes the ability to address 
16- and a-bit 
locations 
using BE(3:0). 


Capture 
cycle. 
Bursts of captured 
video are stored 
in VRAM 
under 
DMA control. 
The 
length 
of these 
bursts 
is not us Jally limited 
by the gate array. 
Ad- 
dress pointers 
stored in the gate array are automati- 
cally updated as the data is processed. 
A write mask 
can be employed 
which 
prevents 
the writing 
of par- 
ticular 
areas of ':he memory. 
Additionally, 
write-per- 
• 
bit operations 
are supported 
which prevent 
the writ- 
ing of particular 
Ilit positions 
and assist the Intel Dis- 
play Processor 
in the mixing of graphics 
and video 
data. 


Burst 
Read cycle. 
Bursts of VRAM data are loaded 
into the Mask RJIM under DMA control. 
Either 2, 4 or 
a page-mode 
cycles 
are executed 
before 
the 
DVI 
Bus is relinquish sd. 


Video capture 
operations 
are supported 
in conjunc- 
tion with a video Capture 
Module 
containing 
a com- 
plete digitizing 
interface 
to analog YVU or RGB data 
in interlaced 
NT3C 
or PAL format. 
The video 
Cap- 
ture Module 
is relatively 
independent 
of the VSCGA, 
and, once 
programmed 
using the Capture 
Module 
Command 
and 
Status 
Register, 
will 
continuously 
generate 
digitized 
fields 
of video 
data 
and 
output 
them to the gate array without 
further 
commands. 


Captured 
video 
data 
is 
transferred 
to 
VRAM 
in 
bursts through 
a series of Capture 
Module 
requests 
to VSCGA for USH of the DVI Data Bus. Each request 
is passed 
on to the 
Host 
Interface 
Gate 
Array 
for 
prioritization, 
anc when bus access 
is granted, 
a se- 
quence 
of only one type of data is written 
to memo- 
ry. For YVU data with subsampled 
chroma, 
this 
is 
either 
Y data or a combination 
of VU data, 
select- 
able 
by the Capture 
Module 
each 
burst. 
For RGB 
and 24-bit YVU data, each component 
is sent for a 
minimum 
of an ontire 
field. 
These 
transfers 
are a 
high DVI priority 
and are interruptible 
only by a DB 
VRAM data transfer 
request. 


During video capture, 
the gate array monitors 
video 
timing 
signals 
from 
the Capture 
Module 
to control 
where 
data is stored 
in memory. 
VSCGA 
is able to 
keep data from successive 
fields 
and YVU compo- 
nents from within a field in separate 
areas of memo- 
ry as programmed 
by the 
Intel 
Pixel Processor 
or 
other 
DVI Bus Device. This operation 
is handled 
by 
two sets of Shadow 
Registers, 
one for even and one 
for odd fields, one set of Working 
Registers, 
a Point- 
er Counter, 
a VU Pitch Register 
and an Adder. 
Each 
set of Shadow 
Registers 
and 
the 
set of Working 
Registers 
contains 
one memory 
pointer 
for Y data, 
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one for VU data and one for Input Mask data. Imme- 
diately before even field processing begins, the pre- 
viously programmed Even Shadow Registers are 
used to load data into their respective Working Reg- 
isters. The information transferred consists of start- 
ing locations in VRAM in which to deposit Y and VU 
data and from which to read mask information. Dur- 
ing the field, as various data and timing signals are 
received from the Capture Module, the Pointer 
Counter selectively uses the Working Registers to 
generate updated addresses, which are used for 
DMA accesses to VRAM and then stored back in the 
respective registers. Before the completion of the 
even field, it is necessary to program the set of Odd 
Shadow Registers, which will automatically be used 
at the beginning of the next field. It is also possible 
to program the gate array to keep V and U data 
separate in memory. VSCGA accomplishes this by 
adding a fixed offset from the VU Pitch Register to 
the Pointer address only while processing U data. 


Captured data can be transferred to memory in six 
different formats. They are Y8, YVU9, YVU10, 
YVU12, YVU24 and RGB24. Y8 mode contains only 
8-bit luminance data. In YVU9 mode V and U data 
are subsampled .4:1 in both horizontal and vertical 
directions. In YVU10 mode, V and U data are sub- 
sampled 4:1 in the horizontal and 2:1 in the vertical 
direction. In YVU12 mode, V and U data are sub- 
sampled 4:1 in the horizontal and sampled each line 
vertically. The 24-bit modes are used for higher qual- 
ity still images. Capture' is accomplished over six 
fields. During the first two, R or Y data is stored. 
During the second two, G or V data is captured and 
during the third two, B or U data is transferred to 
memory. In these modes, the Odd and Even Shad- 
ow Registers alternate each field and can be loaded 
in such a way as to keep the three components sep- 
arated in VRAM. While the first three display formats 


are "VRAM-compatible" with the Intel Display Proc- 
essor, the last three cannot be directly displayed 
without first reformatting them in VRAM. 


Interlaced fields of data can easily be interleaved 
into complete frames as they are written into VRAM 
by making the proper choice of register parameters. 
For example, if a 640 x 480 (280H x 1 EOH)resolu- 
tion image with VU subsampled by 4 in each direc- 
tion is captured (YVU9 mode), setting the Y horizon- 
tal resolution for 2048 (800H) pixels and offsetting 
the odd and even fields by 1024 (400H) pixels will 
integrate the two luminance scans. Setting the com- 
bined chrominance horizontal resolution for 1024 
(400H), offsetting the U by 256 (100H) and offsetting 
the odd and even fields by 512 (200H) will integrate 
the two components of the two chrominance scans. 
A memory map of this structure is illustrated in Fig- 
ure 2-2. 


The foregoing discussion illustrates the capability of 
the VSCGA gate array. In actual system operation, 
the capture of interlaced video alternates between 
Y8 and YVU10 modes on successive fields with 
chrominance data captured only every other field. 
This results in VU data being temporally as well as 
spatially subsampled with only a minor effect on pic- 
ture quality. 


2.1.2 REGISTER CONFIGURATION 


There are ten memdry-mapped DVI Registers in the 
VRAM and Capture portion of the VSCGA array. Ta- 
ble 2-1 shows a listing of their names, addresses, 
lengths and accessibility wom the DVI Bus. All are 
located in the range from FCOOOOHto FC0021H, 
conforming to a DVI Device ID of 6. Each can be 
accessed with either byte, word or long word opera- 
tions. 


Table 2-1. VRAM and Capture Register Configuration 


Register 
DVI 
DVI 
Regl.ter 
Register Description 
Address 
R/W 
Length 


Y_EVEN 
FCOOOOH 
R/W 
32 Bits 
Even Field Y Shadow Register 


VU_EVEN 
FCOO04H 
R/W 
32 Bits 
Even Field VU Shadow Register 


MASLEVEN 
FCOO08H 
R/W 
32 Bits 
Even Field Mask Shadow Register 


V_ODD 
FCOOOCH 
R/W 
32 Bits 
Odd Field Y Shadow Register 


VU_ODD 
FC0010H 
R/W 
32 Bits 
Odd Field VU Shadow Register 


MASLODD 
FC0014H 
R/W 
32 Bits 
Odd Field Mask Shadow Register 


VU_PITCH 
FC0018H 
R/W 
8 Bits 
V-U Interleave Register 


CAP_CST 
FC001AH 
R/W 
16 Bits 
Capture Command and Status 


MEM_CMD 
FC001CH 
W 
32 Bits 
Memory Command 


CM_CST 
FC0020H 
R/W 
16 Bits 
Capture Module CMD and Status 
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Figure 2-2. YVU Memory 
Map 


2.1.3 
MEMORY 
COMMAND 
REGISTER 


This write-only 
register 
is used to program the timing 
parameters 
used in memory 
and register 
accesses 
to accommodate 
different 
speed 
VRAMs 
and vari- 
ous system 
clock 
rates. It also determines 
whether 
the 
1 or 4 Megabyte 
VRAM 
addressing 
scheme 
is 
implemented. 
Finally, it stores the write-per-bit 
mask 
used to selectively 
write captured 
video data. 


As the Pointer Counter 
develops 
addresses 
to write 
captured 
video 
data 
to 
memory, 
these 
addresses 
are 
slrnultaneously 
used 
as pointers 
into 
a Mask 
RAM which 
mal' be used to selectively 
inhibit these 
write cycles. 
Each bit of this 24-byte 
RAM controls 
the write access to a single V and U pixel and to the 
corresponding 
4 x 4 or a 4 x 2 array of Y pixels, for 
the 
YVU9 
and 
YVU10 
modes 
respectively. 
If this 
mode is enabled, 
data will not be written 
to masked 
locations 
even 
:hough DVI Bus cycles 
will be con- 
sumed 
in rnaklrq 
the data 
available 
at the 
VRAM 
inputs. The nurr ber of bits of RAM that are used is 
dependent 
on t le 
horizontal 
resolution 
of the cap- 
tured 
image. 
For example, 
an image with Y resolu- 
tion 
of 256 x 2 ~O requires 
only 
256 
-i- 
4 or eight 


2.1.4 
REGISTER 
AND VRAM 
INTERFACE 


This logic block 
accepts 
the timing 
parameters 
and 
write-per-blt 
mask 
information 
from 
the 
Memory 
Command 
Register 
and write mask information 
from 
the Mask RAM and generates 
the signals necessary 
to access 
VRAM and DVI registers. 


I 
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bytes, while one with a horizontal resolution of 768 
needs the full 24-byte storage. When this RAM is in 
use, it is automatically loaded at the beginning of the 
first video line using the appropriate Odd or Even 
Shadow Mask Register. Its data is reused until it is 
updated at the beginning of every two or four lines, 
depending on the vertical chroma resolution. 


2.1.6 
SHADOW 
REGISTERS 
AND MUX 


These six read-write registers are used to store two 
sets of initial pointers to VRAM for captured Y and 
VU video and mask data: one set for even and one 
set for odd fields. They.also store information indi- 
cating the number of bytes of memory to allocate for 
each horizontal line of data. This number is used to 
indicate to the Pointer Counter how much to incre- 
ment the Working Register at the end of each line 
and is always a power of two regardless of the actu- 
al resolution. These registers also contain additional 
information which determines other operational de- 
tails of their use. 


The Mux allows data from the DVI Data Bus to be 
loaded into the Shadow Registers and, in one mode 
of operation, permits the Shadow Registers to be 
continuously updated from the Pointer Counter dur- 
ing a field. This allows the DVI Bus to monitor the 
state of various address counts in real time since 
neither the Working Counters nor the Pointer Coun- 
ter are accessible to that Bus. 


2.1.7 
WORKING 
REGISTERS 


Each of these three registers are loaded, one at a 
time, at the beginning of each field of captured vid- 
eo, from its respective Even or Odd Shadow Regis- 
ter. Each 32-bit quantity is passed through the Point- 
er Counter, which is transparent for this operation. 
(Refer to Figure 2-1.) Depending on the program- 
ming mode, the pixel portion of the count may be set 
to 0 as it is loaded. The Y data is transferred first, 
followed by VU and finally by the Input Mask data. If 
the input mask mode is selected, the Mask RAM is 
immediately loaded using the Input Mask Working 
Register as an address source, which is updated 
each DMA read cycle by the Pointer Counter. Only 
after the Mask RAM is loaded can burst requests 
from the Capture Module be processed. As each re- 
quest is received the data type is specified so that 
the appropriate Working Register (Y or VU) is loaded 
into the Pointer Counter, used as a VRAM write ad- 
dress and updated after each DMA write cycle. At 
the end of each burst, the Working Register contains 
the location of the next pixel to be written. This value 
is ready to be reloaded into the Pointer Counter the 
next time the same type of burst data is requested 
during that line. At the end of each horizontal line, 
the three Wor~ing Register outputs are sequentially 
passed through the Pointer Counter and their line 
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counts are selectively incremented before being re- 
stored in those same registers. The decision to in- 
crement each register depends on the status of an 
activity monitor associated with each Working Reg- 
ister. The Y and VU activity monitors detect the cap- 
ture of the respective data type during each horizon- 
tal line. The Mask monitor detects the fetching of the 
mask data from VRAM during that time. Only when 
the monitor outputs are asserted are the Working 
Register line counts incremented. Depending on the 
programming mode, the pixel count may either be 
set to 0 or loaded with the corresponding bits from 
the respective Shadow Register. 


2.1.8 
POINTER 
COUNTER 


This is a flexible logic block that accepts and pro- 
cesses 32-bit inputs from the six Shadow Registers 
and the three Working Registers. It then processes 
those inputs and makes them available to those 
same registers and to the DVI Address Bus through 
the VU Pitch Register Adder and the Address Output 
Mux. The processing can be minimal, such as at the 
beginning of a field when the Shadow Registers are 
used to load the Working Registers; or it can be 
more substantial, such as during a horizontal line 
when pixel counts are incremented (by four for each 
4-byte read) or at the end of each horizontal line 
when the line count is incremented. The bit position 
in the 24-bit Counter where pixel count ends and line 
count begins is dependent on the horizontal resolu- 
tion, the video component, the amount of subsam- 
pling of that component and whether interleaving of 
the interlaced data is being performed. The bit posi- 
tion information is programmed into each Shadow 
Register and passes through the various processing 
loops along with the address. 


2.1.9 
VU INTERLEAVE 
REGISTER 
AND ADDER 


This register contains one byte of storage which, 
when enabled, allows the V and U data from sub- 
sampled chroma fields to be accumulated in distinct 
areas of VRAM. The VU data comes from the Cap- 
ture Module alternating between four bytes of V and 
four bytes of U data. The VU Interleave Register 
contents are multiplied by eight and then combined 
only with addresses for U data in the Adder (V ad- 
dresses pass through unmodified). The result is 
used to determine the VRAM location for the cap- 
tured data. Note that the eight bits of pitch are not 
sufficient to totally separate the U and V compo- 
nents in memory, but are enough to allow them to be 
placed side-by-side as shown in Figure 2-2. 


2.1.10 
DATA 
BUS MUX 


This block is a bidirectional interface to the DVI Data 
Bus from both the SCSI and the VRAM and Capture 
I 


portions 
of the VSCGA gate array. In this portion 
of 
the array it is used to read and write 
register 
data 
and to pass the time-multiplexed 
address data to the 
Address 
Latch. 


2.1.11 
ADDRESS 
LATCH 


This block latches the time-multiplexed 
address data 
from the DVI Data Bus, MD(31:0) 
and supplies 
it to 
the 
DVI Address 
Bus, VA(23:0). 
This 
allows 
other 
DVI 
Bus 
components 
to 
implement 
only 
the 
MD(31 :0) signals 
for 
output 
addressing 
and 
data 
transfers 
and the few VA bits they need to decode 
their own register 
addresses. 


2.1.12 
ADDRESS 
OUTPUT 
MUX 


This block 
selects 
between 
the Address 
Latch and 
the Pointer 
Counter 
(or Adder) 
outputs 
as a source 
for the DVI Address 
Bus. 


2.1.13 
CAPTURE 
COMMAND 
AND STATUS 
REGISTER 


This register 
controls 
several 
of the parameters 
for 
capturing 
data 
and allows 
the 
DVI Bus to monitor 
some 
of the control 
signals 
from the Capture 
Mod- 
ule. 


2.1.14 
CAPTURE 
MODULE 
COMMAND 
AND 
STATUS 
REGISTER 


This 
Register 
is 
located 
in the 
Capture 
Module. 
VSCGA provides 
the control 
to interface 
this register 
to the DVI Bus by decoding 
a bus strobe. 


2.2 
SCSI Interface 


2.2.1 
OVERVIEW 


This 
portion 
of VSCGA 
contains 
a DMA 
interface 
from the SCSI Bus to the DVI Bus. This capability 
is 
important 
because 
it provides 
a high-speed 
path for 
the input of CD-ROM 
data, a major source 
of com- 
pressed 
DVI images. 
The DVI Bus interface 
allows 
the operations 
preliminary 
to the data transfer 
to be 
set up by software, 
using a set of registers 
inside the 
gate array, but the DMA transfer 
itself can be auto- 


matically 
carried 
out by a State 
Machine 
within 
the 
array. 
Up to sixteen 
Megabytes 
of VRAM 
memory 
can be addressed 
by the DMA controller, 
although 
the topmost 
Megabyte 
is reserved 
for DVI Bus De- 


vices. 
The VSCGA 
implements 
a complete 
single- 
initiator, 
multiple-target, 
single-ended 
SCSI Bus with 
parity, capable 
of controlling 
up to 8 devices. 
Being 
a single initiator 
system 
implies that neither an Arbi- 
I 
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tration 
nor a REselection 
Phase is implemented 
on 
the SCSI Bus and results 
in a simplified 
protocol. 
In 
the rest of this 
specification 
the CD-ROM 
interface 
and operation 
1{ill be used 
as an example 
of the 
Target 
Device. 
~Iease keep 
in mind that 
the inter- 
face is general 
md can handle 
both read and write 
DMA operation! 
. For a complete 
description 
of the 
SCSI standard, 
'efer to the ANSI document 
X3.131. 


The VSCGA DN A Controller 
has the ability to gather 
discrete 
data bl ocks from 
the CD-ROM 
Subsystem 
and form them 
nto a continuous 
data stream 
or to 
split a contlnuoi 
s data stream into discrete 
blocks. A 11 
typical 
CD-ROIi 
data 
transfer 
works 
as 
follows: 


Once 
the 
prelilninary 
Target 
Device 
protocol 
has 
been completee 
(Target 
selected) 
and the location 
of the first Chaill 
Block in VRAM 
has been stored 
in 
the array, the DIAA Controller 
is enabled 
and fetches 
and internally 
stores this block of data. Each of the 
memory-contigL 
ous 
12-byte 
Chain 
Blocks 
consists 
of an initial byte address to load the incoming 
data, a 
byte count and control information. 
This information 
is used by the Controller 
to direct 
data output 
from 
the CD-ROM to one or more areas in VRAM. As the 
8-bit data is accumulated 
into 32-bit words and writ- 
ten into VRAM, 
the DMA Byte and Address 
Register 
counts 
are updated. This continues 
until byte count 
reaches 
0, at which time a new Chain Block is auto- 
matically 
fetched. 
The 
process 
completes 
when 
a 
terminator 
flag 
is found 
to 
be set 
in the 
fetched 
Chain Block. Once the CD-ROM 
completes 
its data 
transfer, 
(which 
may 
span 
more 
than 
one 
Chain 
Block operation I, the Target 
status 
is read followed 
by a message 
from the 
Target 
indicating 
that 
the 
read 
is comple1 e. Several 
registers 
in VSCGA 
are 
used to send cc mmands 
and messages 
out, accept 
status 
and 
inpL t messages 
and 
monitor 
the 
SCSI 
Bus and State' 'I.1achine operation. 
An internal 
Pat- 


tern Generator 
is also included 
for self-test. 


A detailed 
blocl 
diagram 
of the SCSI portion 
of the 
VSCGA gate an ay is shown 
in Figure 2-4. 


2.2.2 
CHAIN 
BLOCKS 


In order to perform 
a CD-ROM 
DMA transfer, 
a set 
of Chain Blocks 
exactly 
matching 
the retrieved 
data 
organization 
must be stored 
contiguously 
in VRAM. 
The format 
of each Chain Block 
is shown 
in Figure 
2-3. In the lowest' address 
is the byte count, which is 
a 24-bit 
number 
in two's 
complement 
format. 
The 
most significant 
byte of the word should be set to FF 
for future compatibility. 
The next higher address con- 
tains the the 24-bit Byte Address 
of the starting 
loca- 
tion for stored data. The most significant 
byte of this 
word should 
be O. The lowest 
byte of the next word 
contains 
control 
information, 
including 
whether 
this 
Chain Block is the last one in the transfer. 
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31 
23 
15 
7 
0 


I 


FF 


I 


Byte Count 


I 


0 
Byte Address 


0 
0 
0 
Control 


Figure 2-3. Chain Block Format 
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Figure 2-4. SCSI Gate Array Architecture 
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2.2.3 
REGISTER 
CONFIGURATION 


There are nine memory-mapped 
DVI Registers 
in the 
SCSI portion of the VSCGA array. Table 2-2 shows a 
listing of their names, addresses, 
lengths and acces- 
sibility from the DVI Bus. All are located 
in the range 
from F80000H 
to F80021 H, conforming 
to a DVI De- 
vice ID of 4. Each can be accessed 
with either byte, 
word or long word operations. 


2.2.4 
CHAIN 
BLOCK 
ADDRESS 
REGISTER 


At 
the 
start 
of 
a CD-ROM 
transfer, 
this 
register 
stores 
the address 
of the initial Chain Block. 
When 
the DMA process 
begins, 
Chain Block data is auto- 
matically 
read from VRAM in three long words fetch- 
es. After 
each 
one, the register 
is incremented 
by 


four to point to the next long word to be fetched. 


2.2.5 
BYTE 
COUNT 
REGISTER 


This register 
is automatically 
loaded at the beginning 
of a DMA transfer 
from the current 
Chain Block with 
the 24-bit two's complement 
byte count correspond- 


ing to the number 
of CD-ROM 
bytes that are to be 
stored 
in a given 
area of memory. 
As the transfer 
progresses, 
each 
four bytes 
is accumulated 
in the 
DMA Data 
In Register 
and it is then transferred 
to 
VRAM. The Byte Count register 
is then incremented 
by four and tested 
for zero. If zero, the next Chain 
Block 
is fetched 
unless 
the 
last 
block 
has 
been 
reached. 
If not zero, the transfer 
continues. 


2.2.6 
BYTE 
ADDRESS 
REGISTER 


This register is automatically 
loaded from the current 
Chain Block at the beginning 
of a DMA transfer 
with 
the initial 24-bit byte VRAM address to begin loading 
the CD-ROM 
data. After each transfer 
to VRAM, it is 


82750LV 


incremented 
by four to point to the byte address 
of 
the next long 
W( Ird. 


2.2.7 
DMA COI~TROL 
REGISTER 


The lower byte (If this 16-bit register 
is automatically 
loaded 
from 
the current 
Chain 
Block 
at the 
begin- 
ning of a DMA tl ansfer with control 
information. 
The 
upper 
byte 
con ains 
additional 
control 
information 
loaded 
directly 
f 'orn the DVI Bus. 


2.2.8 
COMMA~ ID REGISTER 


This 16-bit regis :er is used to control 
the activity 
on 
the SCSI Bus. 


2.2.9 
STATUS 
REGISTER 


This 
read-only 
Iegister 
monitors 
the 
status 
of the 
SCSI Bus. 


2.2.10 
READ SCSI DATA 
REGISTER 


This one byte re.lister is a buffer which allows status 
and message 
in ormation 
on the SCSI Data Bus to 
be read. 


2.2.11 
WRITE 
(,~ENERAL REGISTER 


This 
one 
byte 
register 
stores 
command 
and 
mes- 
sage data as well as the Target 
ID to be placed 
on 
the SCSI Bus. 


2.2.12 
STATE 
REGISTER 


This register 
allows for the monitoring 
of the internal 
State Machine 
and DMA Controller. 


Table 
2-2. SCSI Register 
Configuration 


Register 
DVI 
DVI 
Register 
Address 
R/W 
Length 


Chain Block Address 
Register 
F80000H 
R/W 
32 Bits 


Byte Count Register 
F80004H 
R/W 
32 Bits 


Byte Address 
Register 
F80008H 
R/W 
32 Bits 


DMA Control 
Register 
F8000CH 
R/W 
16 Bits 


Command 
Register 
F80010H 
R/W 
16 Bits 


Status Register 
F80014H 
R/W 
16 Bits 


Read SCSI Data Register 
F80018H 
R/W 
8 Bits 


Write General Register 
F8001CH 
W 
6 Bits 


State Register 
F80020H 
R/W 
16 Bits 
I 


• 
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2.2.3 
REGISTER 
CONFIGURATION 


There are nine memory-mapped 
DVI Registers 
in the 
SCSI portion of the VSCGA array. Table 2-2 shows a 
listing of their names, addresses, 
lengths and acces- 
sibility from the DVI Bus. All are located 
in the range 
from F80000H 
to F80021 H, conforming 
to a DVI De- 
vice ID of 4. Each can be accessed 
with either byte, 
word or long word operations. 


2.2.4 
CHAIN 
BLOCK 
ADDRESS 
REGISTER 


At 
the 
start 
of 
a CD-ROM 
transfer, 
this 
register 
stores 
the address 
of the initial Chain Block. 
When 
the DMA process 
begins, 
Chain Block data is auto- 
matically 
read from VRAM in three long words fetch- 
es. After 
each 
one, the register 
is incremented 
by 
four to point to the next long word to be fetched. 


2.2.5 
BYTE COUNT 
REGISTER 


This register 
is automatically 
loaded at the beginning 
of a DMA transfer 
from the current 
Chain Block with 
the 24-bit two's complement 
byte count correspond- 
ing to the number 
of CD·ROM 
bytes that are to be 
stored 
in a given 
area of memory. 
As the transfer 
progresses, 
each 
four 
bytes 
is accumulated 
in the 
DMA Data In Register 
and it is then transferred 
to 
VRAM. The Byte Count register 
is then incremented 
by four and tested 
for zero. If zero, the next Chain 
Block 
is fetched 
unless 
the 
last 
block 
has 
been 
reached. 
If not zero, the transfer 
continues. 


2.2.6 
BYTE ADDRESS 
REGISTER 


This register is automatically 
loaded from the current 
Chain Block at the beginning 
of a DMA transfer 
with 
the initial 24-bit byte VRAM address to begin loading 
the CD-ROM 
data. After each transfer 
to VRAM, it is 
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incremented 
by four to point to the byte address 
of 
the next long word. 


2.2.7 
DMA CONTROL 
REGISTER 


Thelower byte of this 16-bit register 
is automatically 
loaded 
from 
the current 
Chain 
Block 
at the begin- 
ning of a DMA transfer 
with control 
information. 
The 
upper 
byte 
contains 
additional 
control 
information 
loaded 
directly 
from the DVI Bus. 


2.2.8 
COMMAND 
REGISTER 


This 16-bit register 
is used to control 
the activity 
on 
the SCSI Bus. 


2.2.9 
STATUS 
REGISTER 


This 
read-only 
register 
monitors 
the 
status 
of the 
SCSI Bus. 


2.2.10 
READ SCSI DATA 
REGISTER 


This one byte register 
is a buffer which allows status 
and message 
information 
on the SCSI Data Bus to 
be read. 


2.2.11 
WRITE 
GENERAL 
REGISTER 


This 
one 
byte 
register 
stores 
command 
and 
mes- 
sage data as well as the Target 
ID to be placed 
on 
the SCSI Bus. 


2.2.12 
STATE 
REGISTER 


This register allows for the monitoring 
of the internal 
State Machine 
and DMA Controller. 


Table 2-2. SCSI Register 
Configuration 


Register 
DVI 
DVI 
Register 
Address 
R/W 
Length 


Chain Block Address 
Register 
F80000H 
R/W 
32 Bits 


Byte Count Register 
F80004H 
R/W 
32 Bits 


Byte Address 
Register 
F80008H 
R/W 
32 Bits 


DMA Control 
Register 
F8000CH 
R/W 
16 Bits 


Command 
Register 
F80010H 
R/W 
16 Bits 


Status Register 
F80014H 
R/W 
16 Bits 


Read SCSI Data Register 
F80018H 
R/W 
8 Bits 


Write General Register 
F8001CH 
W 
6 Bits 


State Register 
F80020H 
R/W 
16 Bits 
I 
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CAPTUR,E 
SUBSYSTEM 
VSCGA 


DCLK 
Data 
Clock 


PG. DB & 


BRSTRO- 
Burst 
Re~uest 
DCLK 
SYSRST- 


Reset 
DVIBUS 


BRSTACK- 
Burst 
Acknowle~e 
BRSTRO- 
V1CLK 
DVI Bus Clock 
RESET- 


YVUSEL 
Y /VU- 
Select 
BRSTACK- 
DSTRB- 
Device Strobe 
V1CLK 


HTIW 
Horizontal 
Time 
YVUSEL 
WSTRB- 
Wemorv 
Strobe 
DSTRB- 


EVEN_ODD 
Even 
Odd- 
Field 
HTIW 
GAVALEN 
Gate 
Arrav 
Valid 
Latch 
Enable 
WSTRB- 


REGSEL- 
Re" ister 
Select 
EVEN_ODD 
BUSEN- 
Bus 
Enable 
GAVALEN 


DRDY 
Data 
R~ister 
Reacl~ 
REGSEL- 
VWE- 
VRAM Write 
Enable 
BUSEN# 


.." 
CS2FLG 
WO(31 :0) 
Wemorv 
Data 
31:0 
VWE# 


BE- 
(3:0) 


Bvte 
Enable 
WO(31:0) 


SCSI BUS 


WROY- 
Wemorv 
Ready 
BE# (3:0) 


NXTFST# 
Next 
Fast 
Cvcle 
WROY# 


DB (7:0) 
'--- 
SCSI Data 
Bus 
TRNFR# 
Transfer 
Cvcle 
NXTFST# 


OBP- 
SCSI Data 
Bus 
Parity 
OB(7:0) 
RFSH# 
Refresh 
Cvcle 
TRNFR# 


ACK# 
SCSI Acknowled~e 
OBP# 
MREO# 
Wemorv 
Reauest 
RFSH# 


ATN# 
SCSI Attention 
ACK# 
INT# 
cceture 
Interruet 
WREO# 


SEL- 
SCSI Select 
ATN# 
CHANREO# 
cceture 
Re"uest 
CINT# 


C/O- 
SCSI ControTlData· - SEL# 
VSCSEL# 
Canture 
Acknowledge 
CREO# 


I/O- 
SCSI In 
ut 
Outout 
CO# 
VBUS3 
Video 
Bus 
bit 
3 
CACK# 


WSG 
SCSI Wessaoe 
10- 


VBUS[3] 


SCREO- 
SCSI Reauest 
WSG# 


BSY- 
SCSI Busv 
REO# 


RST# 
SCSI Reset 
BSY# 
ADDRESS 


t.. 
RSTIN# 
VA (23:0) 


VRAW Address 
MUX 


RST# 
WUX 
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Figure 3-2. Latching 
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3.2 
VRAM and Capture 


For additional information about the capture pro- 
cess, refer to the ActionMedia 11 Capture Bus Speci- 
fication. 
' 


I 


3.2.1 VRAM AND REGISTER ACCESS 


In addition to latching the VA(23:0) Address Bus, the 
VSCGA chip supports DVI VRAM and Register ac- 
cesses in several ways: (1) It monitors the DVI 
Address Bus and determines whether the address 
corresponds to the VRAM or register space. All ad- 
dresses below FOOOOOHpoint to VRAM and conse- 
quently the gate array outputs a memory strobe 
(MSTRB#) at the appropriate time during those cy- 
cles. For multiple cycle accesses this signal indi- 
cates when the address may be changed. Similarly, 
for higher addresses up to the maximum FFFFFFH, 
the chip outputs a data strobe (DSTRB#) indicating 
a register access. This signal allows other bus inter- 
faces to develop chip enables without having to de- 
code the address lines. (2) It monitors the DVI Con- 
trol Bus, determines the bus master (the Intel 82750 
Pixel Processor or other DVI Device) and for VRAM 
cycles determines the type of cycle. Single read- 
write, Dual page-mode read-write (NXTFST and 
BUSFAST), transfer, refresh and continuous page- 
mode read-write cycles for capture are all support- 
ed. (3) It allows the programmer to independently 
select timing parameters for a variety VRAM read 
and write and register control signals, including 
CAS#, RAS#, MSTRB# and DSTRB#. This fea- 
ture accommodates a wide variety of clock and 
VRAM speeds. 


The VRAM addressing is organized such that the 
four RAS# outputs reflect the most significant bits 
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of memory, only one of which can be active at a time 
(except during refresh), while the four CAS# outputs 
represent the byte enables (BE3# ... 
BEO#) and 
are not mutually exclusive. The CAS# outputs each 
enable 8 output bits and will all be active simulta- 
neously when a 32-bit word is being addressed. 


Single Cycle VRAM Access 


A single VRAM read cycle followed by a single 
VRAM write cycle is shown in Figure 3-3. For sim- 
plicity, the diagram does not show the bus master- 
ship acquisition required for non-PB-initiated cycles. 
The read cycle begins with the assertion of MREQ# 
by PB (regardless of bus ownership.) Once the cycle 
is begun, time is allowed for the row address to ' 
propagate through the external Address Mux at 
which time RAS# is asserted. Shortly thereafter, the 
Mux output is switched from high to low, causing the 
column address to appear at the output of the exter- 
nal Address Mux. After a short delay CAS# and 
TROE# are asserted, completing the address and 
enabling the VRAM output. One V1ClK cycle later 
MRDY# 
is asserted indicating that data can be 
latched at the next rising edge of V1ClK. 


The second cycle in Figure 3-3 is a write cycle. The 
timing is very similar to the read cycle except that 
TROE# remains unasserted and the WE# output is 
enabled. In this cycle the assertion of MRDY# indi- 
cates that the data will be latched in VRAM at the 
rising edge of that signal. Note that while the write 
cycle request is asserted at the earliest possible op- 
portunity, the cycle is delayed by one V1ClK pulse. 
Whenever two or more contiguous requests are re- 
ceived, VSCGA automatically delays the generation 
of the second cycle to meet the minimum VRAM 
timing requirement for RAS# to be unasserted be- 
tween cycles (RAS Precharge). 
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Figure 3-3. Single VRAM Read and Write Cycles 
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Dual-Cycle 
Page-Mode 
VRAM 
Access 


There 
are two very similar 
dual page-mode 
VRAM 
cycles 
supported 
by 
VSCGA. 
The 
first 
is 
the 
NXTFST 
cycle which is one initiated 
by the PB. The 
second 
is a BUSFAST 
cycle which is initiated 
by any 
other DVI Device. A NEXTFAST 
read cycle is shown 
in Figure 3-4. The gate array looks for the simulta- 
neous 
assertion 
of MREQ# 
and NXTFST# 
as the 
V1 ClK 
goes high to generate 
this cycle. The wave- 


forms are similar to that of the single read cycle with 
the following 
exceptions: 
(1) There 
are two CAS# 
strobes 
for the 
RAS # 
strobe. 
(2) The first 
access 
must be an even word address 
(VA2 = 0) and the 
second 
the next higher odd address 
(VA2 = 1). (3) 
There 
is no delay due to RAS# 
being unasserted. 


Note that there is an MRDY # strobe 
after each ac- 
cess to indicate 
that data is available 
and the cycle 
may be ended. 
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Figure 
3-4. NXTFST 
Dual Page-Mode 
Read Cycle 
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A BUSFAST read cycle is shown in Figure 3-5. Since 
this cycle is initiated by a DVI Bus Device, the bus 
mastership is first acquired through a request pro- 
cess (not shown) which results in the BUSEN# sig- 
nal being asserted and enabling the output drivers 
during the read cycle. The GAVALEN low transition 
causes the least significant 24-bit address on the 
MD(31:0) Bus to be latched and MD31 = 1 causes 
the BUSFAST cycle to be generated by the gate ar- 
ray. The cycle proceeds identically to the NXTFST 
cycle with the exception that (1) the MSTRB# signal 
is used by the bus master to indicate that the read 
data can be latched and the address incremented 
and (2) the MRDY# signal is asserted only once at 
the end of the dual cycle to indicate to the PB that 
the cycle is complete. 


Transfer 
Cycle 


Transfer cycles are used to load the VRAM serial 
shift register with data for DB. DB requests the 
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transfer of different types of data but has no knowl- 
edge of memory addresses or pointers. These point- 
ers are maintained by PB, which schedules a trans- 
fer cycle upon receipt of any of several transfer 
codes over the VBUS. Since there is no mechanism 
for DB to get feedback as to when the transfer is 
performed, it waits a programmable period of time 
and assumes it is complete. In order to minimize this 
period, the PB makes transfers the highest priority 
operation it can perform on the memory system. 


A transfer cycle appears similar to a single read cy- 
cle except that TROE# is asserted before the falling 
edge of RAS# as shown in Figure 3-6. Some pro- 
grammable time after the end of the memory cycle, 
DB begins to deliver serial clocks to the VRAMs and 
the data comes out of the serial port one 32-bit word 
at a time. The serial port being addressed is enabled 
by one of the SE# (3:0) outputs, which are decoded 
by the gate array in the same manner as the RAS# 
signals. 
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Figure 3-5. BUSFAST 
Dual.Page-Mode 
Read Cycle 
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Refresh 
Cycle 


Refresh cycles are initiated by DB and executed by 
PB. DB issues a refresh request over the VBUS to 
PB which generates a programmable number (up to 
15) of refresh cycles for each DB request. These 


cycles are distinguished by RFRSH# being asserted 
simultaneously with MREQ# as shown in Figure 3-7. 
VSCGA generates a Refresh cycle by asserting 
CAS# before RAS#. All banks of VRAM go active 
regardless of the state of the address bus. 
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FIgure 3-6. Transfer 
Cycle 
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Figure 3-7. Refresh 
Cycle 
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Register 
Cycles 


Each of the DVI Bus Devices 
contains 
one or more 
addressable 
registers 
used for control 
and monitor- 
ing of the Device's 
functions. 
These registers 
are all 


in the range from FOOOOOHto FFFFFFH. 
Whenever 
any device 
in that 
range 
is accessed, 
the VSCGA 


generates 
a register cycle as shown in Figure 3-8 for 
a read cycle. The acquisition 
of bus mastership 
for 
non-PB-accesses 
is omitted 
from 
the figure. 
Once 
the gate array decodes 
the four most significant 
ad- 
dress bits, it asserts the DSTRB # pulse used by the 
other DVI Bus Device address 
decoders. 
The length 
of the strobe 
is programmable. 
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Figure 3-8. Register 
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3.2.2 
CAPTURE 
BURST 


When a Capture Module is ready to transfer Y or VU 
data into a section of VRAM, it sets the YVUSEl 
signal to the appropriate level (0 for Y, 1 for VU) and 
asserts the BRSTREQ# line to request the use of 
the DVI Bus as shown in Figure 3·8. The VSCGA 
passes the request on by asserting the CHANREQ# 
output and receives a grant of its request when 
BUSEN# and VSCSEl# 
are simultaneously assert- 
ed by the Host Interface Gate Array. VSCGA, in turn, 
asserts BRSTACK#, which is monitored by the Cap- 
ture Module, indicating that the MD(31:0) Bus may 
be driven with the captured data. DClK, which is a 
free-running clock at half the V1ClK 
frequency 
when captured data is not being transferred, is 
phased to the incoming data and used as a hand- 
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shaking signal to indicate when new data may be 
placed on the Bus. When data is being transferred 
DClK goes to a 1 for one V1ClK cycle and then 
goes to a 0 for a programmed length of time. The 
page mode access continues until the Capture Mod- 
ule completes its transfer, as shown in Figure 3-8, or 
until the DB requests a transfer cycle by setting 
VBUS3 to a O. This is shown in Figure 3-9. In this 
latter case note that the BRSTREQ# stays asserted 
even though the BRSTACK# is terminated. After 
the transfer cycle completes the request will still be 
asserted and the process of acquiring the DVI bus 
will begin again. 


During the transfer of captured data the BE#(3:0) 
lines are ignored. All CAS strobes are generated un- 
less inhibited by the Input Mask (if selected). 
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Figure 3-9. Capture 
Burst (Transfer 
Terminated) 
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3.2.3 
MASKING 


During the capture of data there are two ways of 
masking the data written to VRAM. The first, input 
masking, entails selectively preventing rectangular 
areas of pixels (as small as 4 x 2 for Y data) from 
being written to memory by inhibiting CAS strobes. 
The decision to enable the strobe is based upon a 
reduced resolution, single-bit memory map. When 
input masking is selected, this map is periodically 
automatically loaded at the beginning of a captured 
horizontal video line into the Mask RAM. The sec- 
ond type of masking is write-per-bit masking, which 
uses an eight-bit value stored in the Memory Control 
Register to selectively inhibit bits in every byte that is 
written to VRAM. When this mode is enabled, 
VSCGA places the mask data (replicated four times) 
on the DVI Data Bus and asserts the WE# line be- 
fore RAS# is asserted for the capture burst. The 
VRAM chips will continue to mask data writes only 
during that burst. The procedure must be repeated 
each time RAS is asserted to continue the masking. 


3.2.4 
CAPTURE 
MODULE 
TIMING 
SIGNALS 


In addition to the signals already discussed, there 
are several lines linking the Capture Module with 
VSCGA which support the capture operation: (1) 
HTIM is a pulse driven by the Capture Module for 
each horizontal line. It is timed to occur just before 
the active data. (2) EVEN/ODD is a signal driven by 
the Capture Module to indicate which field is current- 
ly being captured. A 0 indicates an even field while a 
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1 indicates an odd field. The EVEN/ODD transition 
is expected to occur between HTIM pulses. (3) 
REGSEL# is an output signal to the Capture Module 
whose assertion indicates that a register access op- 
eration is being performed. It is the logical "AND" of 
DSTRB and (address = FC0020H). Data written to 
the Capture Module should be latched from the 
MD(31:0) 
Bus 
on 
the 
trailing 
edge 
of 
REGSEL#. Data read from the Module should be 
enabled on the leading edge of REGSEL# for as 
long as REGSEL# is active. (4) CS2FLG is an input 
handshaking signal indicating that the Capture Mod- 
ule has register data available to be read or that 
register write data has been accepted. 


3.3 
SCSI Interface 


3.3.1 
SCSI PROTOCOL 


In the process of VSCGA as the Initiator command- 
ing a CD-ROM Target to perform its various func- 
tions, the SCSI Bus goes through several states. A 
simple state diagram encompassing the sequences 
for Data In operations encountered in a Single-Initia- 
tor system is shown in Figure 3-10. In the figure, 
I(nitiator), and T(arget) followed by a colon and a 
signal name indicate the assertion or negation of a 
signal and its source. Signal sequences within dot- 
ted boxes may be repeated in their entirety. Arrows 
within states indicate the direction of data flow. The 
following paragraphs describe how the phases are 
traversed during normal CD-ROM operation: 
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Figure 3-10. SCSI Bus Phase Diagram 


BUS FREE PHASE. In the diagram the system starts 
off in the BUS FREE state with none of the control 
signals asserted. 


SELECTION 
PHASE. Because this is a Single-Initia- 
tor system, there is no Arbitration Phase, and the 
Initiator can go directly into the Selection Phase.The 
ID of the Target Device (one of eight possible single 
bit selections) is placed onto the SCSI Bus. Next, 
SEL is asserted to indicate that the Target ID is val- 
id. The selected Target then asserts BSY after 
which time the Initiator releases SEL and the selec- 
tion process is complete. 


COMMAND 
PHASE. Without further direction, the 
Target now asserts CID and negates MSG and 1/0 
to put the SCSI Bus into the Command Phase. It 
then asserts the REO line to receive command data. 
This is placed on the Data Bus by the Initiator, fol- 
lowed by the assertion of ACK. The Target receives 
ACK, reads the command data and releases REO. 
In response the Initiator releases ACK and may 
place new command data on the Bus. The target 
may continue to request command bytes by assert- 
ing REO again. The number of command bytes to be 
transferred is contained in the first command byte. 


I 


DATA 
IN PHASE. Depending on the nature of the 
command, the SCSI Bus may enter either the DATA 
IN or the STATUS PHASE. If the DATA IN PHASE is 
selected, the Target asserts 1/0 and negates CID 
and MSG. It then places retrieved data on the Data 
Bus and asserts REO. The Initiator reads the Data 
Bus and then asserts ACK. When the Target detects 
ACK it negates REO and waits for the Initiator to 
negate ACK. The Target may then continue to pro- 
vide data by asserting REO again. The Initiator 
"knows" how many bytes of data to expect by virtue 
of the original command. 


STATUS 
PHASE. This phase is entered either after 
the DATA IN PHASE or directly from the COMMAND 
PHASE depending on the command. In either case 
the Target asserts CID and 1/0 and negates MSG. 
The Target then places Status Data on the Data Bus 
and asserts REO. The process continues as in the 
DATA IN PHASE. 


MESSAGE 
PHASE. This phase follows the STATUS 
PHASE and typically indicates that the command is 
complete. The Target asserts CID, 1/0 and MSG. It 
places the Message Data on the Data Bus and as- 
serts REO. The process continues as in the DATA 
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IN PHASE. Afterwards the Initiator and Target re- 
lease all lines and the SCSI Bus returns to the BUS 
FREE PHASE. 


3.3.2 SCSI BUS DRIVERS 


There are thirteen signal lines that must be driven by 
the VSCGA on the SCSI Bus. Each requires a drive 
capability of 48 mA. In order to achieve that current 
capability output drivers in the chip are paired. The 
pin names of the two drivers are the same except 
that one has an "A" suffix. 
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4.0 
PROGRAMMING 
INFORMATION 


4.1 
VRAM and Capture 


This section describes the details of the ten VRAM 
and Capture registers addressable from the DVI 
Bus. 


4.1.1 Y SHADOW REGISTERS 
(Y_EVEN, Y_ODD) 


There are two 32-bit Y Shadow Registers, one which 
is used during even fields and the other during odd 
fields. Figure 4-1 identifies each bit of these regis- 
ters as well as its accessibility. These registers are 
used during all modes of capture and are pointers 
into VRAM for the location to start storing the cap- 
tured Y, R, G or B data. 
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Figure 4-1. Video Capture Y Shadow Registers 
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YA23 ..YAO· 
Y Address 
Pointer 
(Bits 
23:0) 


These 24 bits represent 
the starting 
location 
for stor- 


ing captured 
Y pixels stored in VRAM. Once this val- 


ue is loaded 
into the Working 
Register 
it becomes 


the 
current 
Y address 
pointer 
into VRAM 
for that 


field and is incremented 
by one for each pixel and by 


the horizontal 
line length (next higher power of two) 


for each received 
HTIM pulse preceded 
by active Y 


data. 
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OFF3 •.0FFO • Y OFFSET 
(Bits 
27:24) 


These 
four 
bits 
determine 
where 
in the 
YA(23:0) 


pointer that the pixel counter 
ends and the least sig- 


nificant 
bit of the horizontal 
line counter 
begins. Ta- 


ble 4-1 indicates 
where the LSB of the line counter 
is 


located 
for all values of OFF(3:0). 
Note that for val- 


ues of OFF 
greater 
than 9 there 
is no change 
in 


functionality. 


Table 
4·1. LSB of Line Counter 
vs Offset 


Offset~-r 
__ +-~ __~ __r--+ 
__ ~ 
__ +-__+-__+-__+-__+-__+-__+-__r-~ 
• 


LSB 


I 


1-217 


82750LV 


LSB Mode (Bit 30) 


The LSB bit allows the pixel counter to either be 
initialized to a value or cleared to zero when it is 
loaded into the Pointer Counter. If this bit is set to O. 
bits of lesser significance than the LSB of the line 
counter are set to a zero at the beginning of each 
line regardless of the pixel count in the Working or 


. Shadow Register. When this bit is set to a 1. the 
register pixel count will be loaded into Pointer Coun- 
ter at that time. This mode is useful for offsetting 
images or preserving that offset when the Capture 
Module is producing cropped data. 


TRAN Mode (Bit 31) 


The TRAN bit. when set to a 1. allows the Y Shadow 
Register to be updated at the beginning of each line. 
This feature is mainly for diagnostic purposes but 
could be used to allow the programmer to monitor 
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the capture progress within a field. Since neither the 
Working Registers nor the Pointer Counter are ad- 
dressable the only way to accomplish this is to read 
a continuously updated Shadow Register. The only 
difficulty is that the read may coincide with a register 
update and the resulting data may be erroneous. 
This can be overcome with multiple software reads 
or with a simple external hardware fix which pre- 
vents updating during a read. 


4.1.2 VU Shadow Registers 
(VU_EVEN, VU_ODD) 


There are two 32-bit VU Shadow Registers, one 
which is used during even fields and the other during 
odd fields. Figure 4-2 identifies each bit of these reg- 
isters as well as its accessibility. These registers are 
used during YVU9, YVU10 and YVU12 capture 
modes and are pointers into VRAM for the location 
to start storing the captured V and U data. These 
registers are defined exactly the same as the Y 
Shadow Registers. 


15 
14 
13 
12 
11 
10 
9 
8 
7 
6 
5 
4 
3 
2 
1 
0 


IVUA151vUA141vUA131vUA121vUAl11vUA101 
VUA91 vUAel 
VUA71 VUA6 ~ 


R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 


31 
30 
29 
tsa I TRAN I 
1 


R/W 
R/W 
R/W 


29 
V 
29 
~ 
U 
~ 
U 
~ 
~ 
W 
W 
D 
~ 


o 
I OFF31 OFF21 OFFl 
I OFFO I VUA23 I VUA221VUA21IvUA201 VUA19 I VUAlel VUA17 I VUA161 


R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 


Figure 4-2. Video Capture VU Shadow Registers 
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4.1.3 Mask Shadow Registers 
(VU_EVEN, VU_ODD) 


There are two 32·bit Mask Shadow Registers, one 
which is used during even fields and the other during 
odd fields. Figure 4·3 identifies each bit of these req- 
isters as well as its accessibility. These registers are 
used during YVU9, YVU10 and YVU12 capture 
modes and are pointers into VRAM for the location 
of the Input Mask bit Map. These registers are de- 
fined exactly the same as the Y and VU Shadow 
Registers except for bits 28 and 29 which control the 
phase of the line counter controlling the mask fetch. 
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MOD1, MODO(Bits 29:28) 


Each 1, 2 or 4 lines, depending on the VU subsam- 
pling mode, mask data is fetched from VRAM and 
loaded into the Mask RAM. These bits control the 
starting count for the 
-7- 4 counter which controls 
that process. This feature gives the programmer the 
ability to vertically phase the Input Mask with the 
captured image without modifying the VRAM mask 
data. The relationship between the MOD(1:0) field 
and the starting line is shown below: 


Starting 
MOD1 
MODO 
Line 


0 
0 
0 
1 
1 
1 
2 
1 
0 
3 
0 
1 


15 
14 
13 
12 
11 
10 
9 
8 
7 
6 
5 
4 
3 


~ 
MA14! 
MA13 ~ 
MAll! 
MAlO! 
MA9 ! MAe ! MA7 ! MA6 ! MAS ! MM 
! MA3 


R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 


2 
1 
0 


MA2! 
MAl 
~ 


R/W 
R/W 
R/W 


31 
30 
29 
28 
27 
26 
25 
24 
23 
22 
21 
20 
19 
18 
17 
16 


~ 
TRAN ! MODl ! MODO! OFF3! 
OFF2! 
OFFl 
! OFFO ~l 
MA20! 
MA19! 
MAle! 
MAl?! 
MA16! 


R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 


Figure 4-3. Video Capture Input Mask Shadow Registers 


4.1.4 V-U INTERLEAVE REGISTER (VU_PITCH) 


This 16·bit register is used in YVU9, YVU10 and YVU12 modes of capture to separate the intertwined V and U 
data into adjacent areas of VRAM. Figure 4-4 identifies each bit of this register as well as its accessibility. 


15 
14 
13 
12 
11 
10 
9 
8 
7 
6 
5 
4 
3 
2 
1 
0 


~ 
WBP6! 
WBPS! WBP4! 
WBP3! 
WBP2! WBPl ~l 
VUO?! 
VU06! 
vuos! 
VU04! 
VU03! 
vuo21~ 


R 
R 
R 
R 
R 
R 
R 
R 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 


Figure 4-4. VU Interleave Register 
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VU07 ..VUOO - VU Offset 
(Bits 7:0) 


When the Capture 
Module 
sends bursts of captured 
chroma 
data to VRAM, 
the repeated 
eight-byte 
for- 
mat 
consists 
of four 
bytes 
of V followed 
by four 
bytes of U data. VSCGA keeps track of the incoming 
data format and allows the contents 
of the VU Offset 
field to be added only to the U data addresses. 
Be- 
fore the addition, 
the register contents 
are multiplied 
by eight. Since the field is one byte wide, a maximum 
offset 
of 2048 bytes is achievable. 
This is not suffi- 
cient to totally 
separate 
the two components 
but is 
enough 
to allow them to be placed 
side-by-side 
as 
shown 
in Figure 2-2. 
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WPB7 ..WPBO - Wrlte-per-Blt 
(Bits 15:8) 


This read-only 
field is used for diagnostic 
purposes 
and contains 
the Write-per-Bit 
field information 
from 
the Memory 
Command 
Register. 


4.1.5' CAPTURE 
COMMAND 
AND STATUS 
REGISTER 
(CAP _CST) 


This 16-bit register 
is used to set up the parameters 
for the video capture 
operation. 
Figure 4-5 identifies 
each bit of this register 
as well as its accessibility. 


A 
A 
R/W 
A/W 
A 
A/W 
A/W 


Figure 4-5. Video Capture 
Command 
and Status Register 
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IHT - Capture Interrupt (Bit 15) 


When this bit is a 1, the gate array is asserting a 
Capture Interrupt because of an EVEN/ODD field bit 
transition (either direction). For this to occur, the 
Even/Odd Interrupt Enable (EOIE) must be enabled. 
Writing a 1 to this location will reset the bit to a 0 but 
will not clear the interrupt. Reading the SCSI Status 
Register (F8000CH) will reset the bit and clear the 
interrupt. 


RDY - Capture Module Ready (Bit 14) 


This read-only bit reflects the status of CS2FLAG pin 
which, in turn, reflects the status of the Capture 
Module Command and Status Register. When this 
bit is a 1, it indicates either that write data has been 
accepted or that read data is valid. 


V-G - Video Gate (Bit 13) 


When this bit is a 1, the gate array is in the process 
of processing a burst of data from the Capture Mod- 
ule. This read-only bit is for diagnostic purposes. 


UPM - Update Pointer Mode (Bit 12) 


When this bit is set to a 1, the Shadow Registers will 
be updated from the Working Registers on every oc- 
currence of the HTIM pulse. This is equivalent to 
setting the TRAN bit to a 1 on all the Shadow Regis- 
ters. This bit becomes effective on the following 
transition of the EVEN/ODD signal. When this bit is 
set to a 0, the Shadow Registers will be updated 
from VRAM once per field (even or odd each time). 


VUD - VU Data Delnterleave (Bit 11) 


When this bit is set to a 1 the U and V data bursts 
will be deinterleaved as they are written into VRAM. 
This bit becomes 
effective on the following transition 
of the EVEN/ODD signal. 


HTIM - Horizontal Timing (Bit 10) 


This read-only bit indicates the status of the HTIM 
signal on the Video Capture Bus. 


ORX - Overrun X (Bit 9) 


This read-only bit is set when an HTIM pulse is as- 
serted before the bursts from previous video line 
have been written to VRAM and the capture process 
has been interrupted to perform a TRANSFER cycle. 
Setting the OR bit to a 1 will clear this condition. 
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ORM - Overrun Mask (Bit 8) 


This read-only bit is set when an overrun occurs dur- 
ing a transfer of mask data. This probably indicates 
a serious problem since it is the first data to be 
transferred after an HTIM pulse. Setting the OR bit 
to a 1 will clear this condition. 


ORY - Overrun Y (Bit 7) 


This read-only bit is set when an overrun occurs dur- 
ing a transfer of Y data. Setting the OR bit to a 1 will 
clear this condition. 


ORVU - Overrun VU (Bit 6) 


This read-only bit is set when an overrun occurs dur- 
ing a transfer of VU data. Setting the OR bit to a 1 
will clear this condition. 


OR - Overrun (Bit 5) 


This bit is set when an HTIM pulse is asserted be- 
fore the bursts from previous video line have been 
written to VRAM. At least one of the preceding four 
bits must be set as well. Setting this bit to a 1 will 
clear this condition and the preceding four bits as 
well. 


OME - Output Mask Enable (Bit 4) 


Setting this bit to a one enables the Write-per-Bit 
capture feature of.the gate array. This bit becomes 
effective on the following transition of the EVEN/ 
·000 signal. 


4x2# - 4 x 4 vs. 4 x 2 Subsampled Chroma 
(Bit 3) 


When this bit is set to a 1, each bit in the Input Mask 
represents a 4 x 4 rectangular area of pixels. When 
this bit is set to a 0 each bit in the Input Mask repre- 
sents a 4 x 2 rectangular area of pixels. For this bit 
to have meaning, the following IMD bit must be set 
to a O. This bit becomes effective on the following 
transition of the EVEN/ODD signal. 


IMD - Input Mask Disable (Bit 2) 


When this bit is set to a 0 the Input Mask Mode is 
enabled. This means that prefetched mask data 
from VRAM loaded into the Mask Ram will deter- 
mine which bytes, if any, of a capture burst will be 
written into VRAM. This bit becomes effective on the 
following transition of the EVEN/ODD signal. 
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EOIE - Even/Odd Interrupt Enable (Bit 1) 


When this bit is set to a 1, VSCGA will create a 
Capture Interrupt for every transition of the EVENI 
ODD input signal. This bit is effective immediately. 


E/O# - Even/Odd# 
(Bit 0) 


This read-only bit indicates the state of the EVENI 
ODD signal input pin on the gate array. It is a 1 dur- 
ing even fields and a 0 during odd fields. 


4.1.6 MEMORY COMMAND REGISTER 
(MEM_CMD) 


This write-only 32-bit register determines the timing 
parameters to allow for a wide variety of combina- 
tions of system clock and memory speeds. It also 
stores mask information for Write-per-Bit operations. 
Figure 4-6 identifies each bit of this register as well 
as its accessibility. 


WPB7..WPBO- Wrlte-per-Bit 
(Bits 23:16) 


This field of data contains the mask value that is 
loaded into VRAM during a Write-per-Bit capture cy- 
cle when the Output Mask Mode is enabled. A value 
of 1 in any bit position allows that bit to be written; a 
o prevents it. Typically, the value is set to FE to allow 
the PB to distinguish between video and graphics 
data in its "Alpha Channel" mode of operation. 


4MEG# - 1 Mb vs. 4 Mb VRAMS (Bit 12) 


When this bit is set to a 0 the controller decodes 
addresses for 4 Megabit VRAMs and addresses a 
total of 15 Megabytes of memory. The address 
ranges for which the four RAS# lines are decoded is 
shown in Table 4-2. The top Megabyte of the ad- 
dress space is reserved for DVI Bus Devices and is 
not decoded for VRAM. 


Table 4-2 4 Megablt VRAM RAS# Decodes 


Address 
RASO# RAS1# RASH 
RAS3# 
Range 


0-3FFFFFH 
Yes 
No 
No 
No 


400000-7FFFFFH 
No 
Yes 
No 
No 


BOOOOO-BFFFFFHNo 
No 
Yes 
No 


COOOOO-EFFFFFH No 
No 
No 
Yes 


When this bit is set to a 1 the controller decodes 
addresses for 1 Megabit VRAMs and addresses a 
total of 4 Megabytes of memory. The address 
ranges for which the four RAS# lines are decoded is 
shown in Table 4-3. The RAS# lines are not fully 
decoded in this mode and are asserted for higher 
address spaces as well but the CAS# lines are in- 
hibited for addresses above 400000H in this mode 
so that no memory operations will take place there. 


Table 4-3 1 Megablt VRAM RAS# Decodes 


Address 
RASO# RAS1# RAS2# RAS3# 
Range 


O-OFFFFFH 
Yes 
No 
No 
No 


100000-1FFFFFN 
No 
Yes 
No 
No 


200000-2FFFFFH 
No 
No 
Yes 
No 


300000-3FFFFFH 
No 
No 
No 
Yes 


OWL1, DWLO- DSTRB# Write Length 
(Bits 11:10) 


This field determines the number of cycles that 
DSTRB# is asserted during a Register Write Cycle. 
This is chosen as a function of the minimum write 
pulse widths required.by the DVI Bus Devices. In this 
mode DSTRB# 
is asserted for (DRl 
+ 1) x 2 
V1ClK periods. 


15 
14 
13 
12 
11 
10 
9 
8 
7 
6 
5 
4 
3 
2 
0 
o I 
0 
~ 
OWLl I OWLO I ORLl I ORLO I COO 
~ 
RLl 
RLO 


W 
W 
W 
W 
W 
W 
W 
W 
W 
W 
W 
W 
W 
W 
W 
W 


31 
30 
29 
28 
,.--::2:.:.7~-=26=---,,.--::2:.:.5-, 
24 
23 
22 
21 
20 
19 
18 
17 
16 
r-o-'-o-~ 
0 
0 
0 
~ 
WPB61 WPB51 WPB41 WPB31 WPB2 ~ 
w 
W 
W 
W 
W 
W 
W 
W 
W 
W 
W 
W 
W 
W 
W 
W 


Figure 4-6. Memory Command Register 
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DRL1, DRLO- DSTRB# Read Length (Bits 9:8) 


This field determines the number of cycles that 
DSTRB# is asserted during a Register Read Cycle. 
It is chosen as a function of the maximum read ac- 
cess time exhibited by any of the DVI Bus Devices. 
In this mode DSTRB# is asserted for (DRl + I) X 2 
+ 1 V1ClK periods. 


COD - CAS# Onset Delay (Bit 7) 


When this bit is set to a 1, it adds a delay cycle to 
the leading edge of all CAS# cycles. This allows for 
extra Column Address or Write Data setup time. This 
bit has no effect on Refresh Cycles. 


000 - DSTRB# Onset Delay (Bit 6) 


When this bit is set to a 1, it adds a delay cycle to 
the leading edge of all DSTRB# cycles. This allows 
for extra address or control signal setup time. 


XP - Extra Precharge (Bit 5) 


This bit adds an extra precharge wait cycle after 
RAS# is unasserted. When this bit is set to a 0, 
there are two V1ClK 
cycles from the time that 
RAS# is unasserted until the time it is reasserted. 
When this bit is set to a 1, that number is extended 
to three. 


ROD - RAS# Onset Delay (Bit 4) 


When this bit is set to a 1, it adds a delay cycle to 
the leading edge of RAS# cycles only when the nor- 
mal delay cycle between single VRAM read-write cy_· 
cles is not inserted. This allows for extra Row Ad- 
dress setup time from the assertion of MREQ#. This 
bit has no effect on Refresh Cycles. 


WL1 ,WLO- Memory Write Operation Length 
(Bits 3:2) 


This field determines the number of V1ClK cycles 
that RAS# 
and CAS# 
are simultaneously active 
during a VRAM Write Cycle. Its value is chosen as a 
function of the minimum RAS#, 
CAS# 
or WE# 
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pulse width required by VRAM. The relationship be- 
tween the Wl(1:0) field and the number of V1ClK 
cycles is shown below: 


# V1CLK 
WL1 
WLO 
Cycles 


1 
0 
0 
2 
0 
1 
3 
1 
0 
4 
1 
1 


RL1, RLO- Memory Write Operation Length 
(Bits 1:0) 


This field determines the number of V1ClK cycles 
that RAS# 
and CAS# 
are simultaneously active 
during a VRAM Read Cycle. Its value is chosen as a 
function of the maximum RAS# or CAS# access 
time. The relationship between the Rl(1 :0) field and 
the number of V1ClK cycles is shown below: 


# V1CLK 
RL1 
RLO 
Cycles 


1 
0 
0 
2 
0 
1 
3 
1 
0 
4 
1 
1 


4.1.7 CAPTURE MODULE COMMAND AND 
STATUS REGISTER (CM_CST) 


This 16-bit read-write register is decoded by the 
Capture Module logic. For information as to how to 
interpret this field, refer to the Capture Module Mi- 
crocontroller Software Specification. 


4.2 SCSIInterface 


This section describes the details of the nine SCSI 
registers addressable from the DVI Bus. 


4.2.1 CHAIN BLOCK ADDRESS REGISTER 


This 32-bit register contains a 24-bit pointer to the 
next Chain Block in VRAM. It is initially loaded using 
a DVI Bus Register access. Thereafter, whenever a 
Chain Block is fetched from memory, this register is 
incremented by four after each read cycle so that its 
pointer is always prepared for the next access. Fig- 
ure 4-7 identifies each bit of this register as well as 
its accessibility. 


1-223 


82750LV 


4 


R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 


27 
25 
21 
111 
31 


R/W 
R/W 
R/W 
R/W 


Figure 4-7. Chain Block Address 
Register 


4.2.2 
BYTE COUNT 
REGISTER 


This 32-bit 
register 
contains 
a 24-bit 
two's 
complement 
number 
indicating 
how many 
bytes 
are left to be 
transferred 
in the current 
block. The register 
is initially 
loaded 
during a Chain Block fetch. 
Each time data is 
read from or written to memory as part of a SCSI DMA cycle, the Byte Count Register 
is incremented 
by four to 
keep it current. 
The register 
can also be read or modified 
using a DVI Bus Register 
access. 


The least significant 
bit of this register 
must always be written 
as a 0 so that the block length 
is a multiple 
of 
two. Figure 4-8 identifies 
each bit of this register 
as well as its accessibility. 


R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 


31 
30 
211 
~----r-- 


R 
R 
R 


28 
27 
26 
25 
~ 
~ 
u 
~ 
~ 
u 
u 
u 
~ 
I BC23 I BC22 I BC2l I BC20 I BC19i 
BC18 I BC1? I BC16 I 


R 
R/W 
R/W 
R/W 
P./W 
R/W 
R/W 
R/W 
R/W 
R 
R 
R 
R 


Figure 4-8. Byte Count Register 
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4.2.3 
BYTE 
ADDRESS 
REGISTER 


This 32-bit register contains 
a 24-bit address indicating 
how many bytes are left to be transferred 
in the current 
block. 
The register 
is initially 
loaded 
during 
a Chain 
Block 
fetch. 
Each time data is read from 
or written 
to 
memory 
as part of a SCSI DMA cycle, the Byte Count Register 
is incremented 
by four to keep it current. 
The 
register 
can also be read or modified 
using a DVI Bus Register 
access. 
The least significant 
bit of this register 
must always be written as a 0 so that the block length is a multiple of two. Figure 4-9 identifies 
each bit of this 
register 
as well as its accessibility. 


r-3_1-,_30---,r-29_,-_28'--r-2_7-,_26'--r-2..c.5_ 
24 
23 
22 
21 
20 
19 
18 
17 
16 
1L-°,--~-,o~L--,o~l--,o_L--,o~--'O'--L--'O_~ 


R 
R 
R 
R 
R 
R 
R 
R 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 


Figure 
4-9. Byte Address 
Register 


4.2.4 
DMA Control 
Register 


This 16-bit register 
provides 
control 
information 
for the CD-ROM 
DMA Controller. 
The least significant 
byte is 
copied 
from the corresponding 
VRAM 
location 
during the fetch of the Chain Block. The two most significant 
bits are written 
using DVI Bus Register 
writes. 
Figure 4-10 identifies 
each 
bit of this register 
as well as its 
accessibility. 


15 
14 
13 
12 
11 
10 
9 
8 
7 
6 
5 
4 
3 
2 
0 
~_FN_B--,-_O~,--O_-,--_o_I,--_o--,-l_o_~r--S-D1""'-S-DO-~ 
ICE 
I NULLI 


R/W 
R/W 
R 
R 
R 
R 
R 
R 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
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CDE - CD-ROM DMA Controller 
Enable (Bit 15) 


This bit is the enable for the CD-ROM DMA which 
allows requests to be generated on the DVI Bus. 
The bit is reset automatically after the last data 
block is complete (one in which the LB bit is set). 


FNB - Fetch Next Block (Bit 14) 


This bit is used to start the CD-ROM DMA Chain 
Block sequence. Initially, a VRAM address is loaded 
into the Chain Block Address Register. Then, when 
this bit is set to a 1, the Chain Block starting at that 
location will be stored in VSCGA's registers, and up- 
dated as the sequence progresses. This bit is set to 
a 0 when the first word of the Chain Block is fetched. 


504 ..500 - Software Definable Bits (Bits 7:3) 


These bits can be read from this register and may 
contain information specific to the current Chain 
Block in use (e.g. Chain Block ID). 


LB - Last Block (Bit 2) 


When this bit is set to a 1, the CD-ROM DMA Chan- 
nel will terminate at the end of the current block. CD- 
ROM DMA data transfers will not continue until re- 
enabled by software. 


IOE - Interrupt on End of Block (Bit 1) 


When this bit is set to a 1 a CD-ROM DMA Interrupt 
will be generated when the last word of the current 
block is transferred. 


NULL - Null Transfer (Bit 1) 


When this bit is set to a 1, it disables the data trans- 
fer for the current block. Other than not writing any 
data to VRAM, the DMA Channel will behave nor- 
mally. Specifically, it will continue to request DVI Bus 
cycles. This feature can be used to strip off Pad 
Characters from the CD-ROM data stream. 


4.2.5 
COMMAND REGISTER 


This 16-bit register contains the signals used.to con- 
trol the SCSI Bus, and the manner in which the gate 
array responds to SCSI Bus errors. Figure 4-11 iden- 
tifies each bit of this register as well as its accessibil- 
ity. 


15 
14 


~ENPII 


R/W 
R/W 


13 
12 
11 
10 
9 
8 
7 
6 
5 
4 
3 
2 
1 
0 
o 
I RGA I ENBC I EMSI I ENPR I ACK I 
RST I ATN I EWGRI SEL 
I EPMI I~ 


R 
R/W 
R/W 
R/W 
R/W 
R/W 
W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 


Figure 4-11. Command Register 
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ELOB - Enable Loss Of Busy Interrupt (Bit 15) 


When this bit is set to a 1, VSCGA will generate an 
interrupt if the connected Target prematurely enters 
the Bus Free Phase. 


ENPI - Enable Parity Interrupts (Bit 14) 


When this bit is set to a 1 and the ENPR bit is set, an 
interrupt will be generated upon the detection of a 
parity error during a Data In transfer over the SCSI 
Bus. 


RGA - Reset Gate Array (Bit 12) 


When this unlatched bit is set to a 1, all the internal 
gate array CD-ROM logic is reset. 


ENBC - Enable Bus Drive Command (Bit 11) 


When this bit is set to a 1, VSCGA will be enabled to 
drive the SCSI Bus. When this bit is set to a 0, the 
SCSI Bus outputs will go to a high impedance state. 


EMSI - Enable Message and Status Interrupts 
(Bit 10) 


When this bit is set to a 1 and ENPR is asserted, an 
interrupt will be generated upon the detection of a 


. parity error during a Message In or Status transfer 
over the SCSI Bus. 


ENPR - Enable Parity Checking (Bit 9) 


When this bit is set to a 1 the Parity Checker is en- 
abled to examine data transferred from the Target 
over the SCSI Bus. 


Ack - Acknowledge 
(Bit 8) 


When this bit is set to a 1 and ENBC is asserted, the 
ACK signal on the SCSI Bus will be asserted. This bit 
is a handshaking signal used in conjunction with 
REO to transfer data bytes between the Initiator and 
Target. 
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RST - Reset (Bit 7) 


When this bit is set to a 1 the RST signal on the 
SCSI Bus will be asserted. This signal must be as- 
serted for a minimum of 25 us. 


ATN - Attention 
(Bit 6) 


When this bit is set to a 1, the ATN signal on the 
SCSI Bus will be asserted. The ATN signal is used to 
force the Target into the Message Out Phase. 


EWGR - Enable Write General Register (Bit 5) 


When this bit is set to a 1, the contents of the Write 
General Register are enabled onto the SCSI Data 
Bus. This bit is used to select a Target. 


SEL - Select (Bit 4) 


When this bit is set to a 1 and ENBC is asserted, the 
SEL signal on the SCSI Bus will be asserted. 


EPMI - Enable Phase Match Interrupt (Bit 3) 


When this bit is set to a 1, an interrupt is generated 
whenever the PMB(2:0) bits don't match the respec- 
tive SCSI Bus signals when REO is asserted. Upon 
interrupt, the State Machine will return to its reset 
state. 


PMB2••PMBO- Phase Match Bits (Bits 2:0) 


These three bits are used to compare to the three 
SCSI bus signals driven by the Target that determine 
the phase of the SCSI Bus. Bit 0 corresponds to I/O, 
bit 1 to MSG and bit 2 to CID. The comparison of 
these bits to the SCSI signals is performed each 
time the Target asserts the REO line, and, if a mis- 
match occurs with the EPMI bit set to a 1, causes an 
interrupt. 


4.2.6 STATUS REGISTER 


This 16-bit read-only register is used to monitor the 
signals on the SCSI Bus and interrupt sources. Fig- 
ure 4-12 identifies each bit of this register as well as 
its accessibility. 
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Figure 4-12. Status Register 
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CHRQ - DMA Channel Request (Bit 15) 


When 
this 
bit is set 
to 
a 1. it indicates 
that 
the 
CHANREO# 
signal is asserted. 
This is provided 
for 
diagnostic 
purposes. 


CINT - Capture Interrupt (Bit 14) 


When this bit is set to a 1. it indicates 
that the Video 
Capture 
logic is the source 
of the interrupt. 
This bit 
and the interrupt 
will be set to a 0 when the upper 
byte of this register 
is read. 


ENSM - Enable State Machine (Bit 13) 


When this bit is set to a 1. it indicates 
that the State 
Machine 
is enabled 
to execute 
DMA cycles. 
This bit 
is written 
in the State Register. 


10E - Interrupt on End of Block (Bit 12) 


When this bit is set to a 1. it indicates 
that interrupt 
was 
caused 
by the 
completion 
of a Chain 
Block 
DMA. This 
bit and the 
interrupt 
will be set toa 
0 
when the upper byte of this register 
is read. 


PMI - Phase Match Interrupt (Bit 11) 


When this bit is set to a 1. it indicates 
that the Inter- 
rupt was caused 
by the connected 
Target 
asserting 
REO while being in a different 
phase than is stored 


. in the Phase 
Match 
bits in the Command 
Register. 
The PMI bit and the interrupt 
will be set to a 0 when 
the upper byte of this register 
is read. 


LOBI - Loss of Busy Interrupt (Bit 10) 


When this bit is set to a 1. it indicates 
that the inter- 
rupt was caused 
by the Target 
prematurely 
enters 
the Bus Free Phase. The LOBI bit and the interrupt 
will be set to a 0 when the upper byte of this register 
is read. 


PERR - Parity Error Interrupt (Bit 9) 


When this bit is set to a 1. it indicates 
that the inter- 
rupt was caused 
by a parity error. The PERR bit and 
the interrupt 
will be set to a 0 when the upper byte of 
this register 
is read. 


IDAT - Internal Data (Bit 8) 


This bit reflects 
the state 
of the IDAT bit written 
in 
the State Register. 
When it is set to a 1. it indicates 
that the internal 
data generator 
is enabled. 
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RST - Reset (Bit 7) 


This bit reflects 
the state 
of the RST signal on the 
SCSI Bus. When it is set to a 1. it indicates 
that the 
RST signal is asserted. 


IRQ - Internal Request (Bit 6) 


This bit reflects 
the state of the IRO bit written 
in the 
State Register. 
When it is set to a 1. it indicates 
that 
the Internal 
request 
is enabled. 


REQ - Request (Bit 5) 


This bit reflects 
the state of the REO signal 
on the 
SCSI Bus. When it is set to a 1. it indicates 
that the 
REO signal is asserted. 


SEL - Select (Bit 4) 


This bit reflects 
the state 
of the SEL signal 
on the 
SCSI Bus. When it is set to a 1. it indicates 
that the 
SEL signal is asserted. 


BSY - Busy (Bit 3) 


This bit reflects 
the state of the BSY signal 
on the 
SCSI Bus. When it is set to a 1. it indicates 
that the 
BSY signal is asserted. 


CID - Command/Data 
(Bit 2) 


This bit reflects 
the state 
of the CID 
signal 
on the 
SCSI Bus. When it is set to a 1. it indicates 
that the 
CID 
signal is asserted. 


MSG - Message (Bit 1) 


This bit reflects 
the state of the MSG signal on the 
SCSI Bus. When it is set to a 1. it indicates 
that the 
MSG signal is asserted. 


I/O - Input/Output 
(Bit 0) 


This bit reflects 
the state 
of the 
I/O 
signal 
on the 
SCSI Bus. When it is set to a 1. it indicates 
that the 
I/O signal is asserted. 


4.2.7 READ SCSI DATA REGISTER 


This 8-bit read-only 
register 
is used to read Status 
and Message 
bytes from the Target. 


I 


4.2.8 WRITE GENERAL REGISTER 


This 8-bit write-only register is used to buffer Com- 
mand and Message bytes sent to the Target. It is 
also used to store the Target ID prior to selection. 
For diagnostic purposes, this register may also be 
used to send data .to the Target via programmed 
I/O. 


4.2.9 STATE REGISTER 


This 16-bit read-only register is used to monitor the 
State Machine which sequences data during DMA 
transfers. Figure 4-13 identifies each bit of this regis- 
ter as well as its accessibility. 


PIRQ - Pending Input Mask Request (Bit 15) 


When this read-only bit is set to a 1, it indicates that 
the Input Mask Request has won the arbitration and 
is the current pending DMA request. This bit is pro- 
vided primarily for diagnostic purposes. 


PCRQ - Pending Capture Request (Bit 14) 


When this read-only bit is set to a 1, it indicates that 
the Video Capture Request has won the arbitration 
and is the current pending DMA request. This bit is 
provided primarily for diagnostic purposes. 


PSRQ· Pending SCSI Request (Bit 13) 


When this read-only bit is set to a 1, it indicates that 
the SCSI Request has won the arbitration and is the 
current pending DMA request. This bit is provided 
primarily for diagnostic purposes. 


PIRQ· Pending Block Request (Bit 12) 


When this read-only bit is set to a 1, it indicates that 
the Chain Block Request has won the arbitration and 
is the current pending DMA request. This bit is pro- 
vided primarily for diagnostic purposes. 
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IMRQ • Input Mask Request (Bit 11) 


This read-only bit reflects the state of the internal 
Input Mask DMA Request signal. It is provided pri- 
marily for diagnostic purposes. 
• 
CRQ· Capture Request (Bit 10) 


This read-only bit reflects the state of the internal 
Video Capture DMA Request signal. It is provided 
primarily for diagnostic purposes. 


SRQ • SCSI Request (Bit 9) 
• 
This read-only bit reflects the state of the internal 
SCSI DMA Request signal. It is provided primarily for 
diagnostic purposes. 


BRQ • Block Request (Bit 8) 


This read-only bit reflects the state of the internal 
Chain Block DMA Request signal. It is provided pri- 
marily for diagnostic purposes. 


SB7..SBO• State Bits (Bits 7:0) 


These eight bits are for diagnostic purposes and in- 
dicate the state of the State Machine. 


ENSM • Enable State Machine (Bit 7) 


When this bit is set to a 1, the State Machine is 
enabled to sequence. This bit should only be en- 
abled for DMA transfers. The State Machine con- 
trols the handshaking signals transferring data dur- 
ing Data In and Data Out Phases of the SCSI Bus. It 
also controls the handshaking signals transferring 
data to and from VRAM during DMA operations on 
the DVI Bus. When this bit is set to a 0, the State 
Machine will hold its present value, allowing the pro- 
grammer to read its state. The value of this bit can 
be read from the Status Register. 
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Figure 4·13. State Register 
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Wltn IUA I to emulate 
Data In transfers 
in a diagnos- 
tic mode. 
The 
state 
of this 
bit can be read 
in the 
Status 
Register. 


5.0 
ELECTRICAL SPECIFICATIONS 


5.1 DCCharacteristics 


-- ..... 


(VI < OorVI > Vcc) 


Output Current Clamp 
±20mA 
(Vo < 0 orVo 
> Vccl 


Continuous 
Output Current 
20mA 
Low 


Continuous 
Output Current 
20mA 
High 


Table 5-1 contains 
stress ratings only, and function- 
. Table 
5-2. Recommended 
Operating 
Conditions 
al operation 
at the maximums 
is not guaranteed. 
Ex- 
posure 
to Maximum 
Ratings 
may affect 
device 
reli- 
ability. Furthermore, 
although 
the 82750LV 
contains 
protective 
circuitry to resist damage from static elec- 
trical discharge, 
this device 
is sensitive 
to ESO lev- 
els above 
1000V. Always 
take precautions 
to avoid 
high static voltages 
or electric 
fields. 


Recommended 


Parameter 
Condition 


Min 
Nom 
Max 


Supply Voltage 
(Ved 
4.50V 
5.00V 
5.50V 


Operating 
Temperature 
Range 
O°C 
70°C 


Vee = 5V, TeASE = 25°C 


Table 
5-3. DC Characteristics 


Symbol 
Parameter 
Mln 
Typ 
Max 
Units 
Notes 


Vll 
Input LOW Voltage 
0.8 
V 
vcc = 4.5V 


VIH 
Input HIGH Voltage 
2.0 
V 
vcc = 5.5V 


VOl 
Output LOW Voltage 
0.5 
V 
VI = 0.1 vcc. IOl = 4 mA 


VOH 
Output HIGH Voltage 
3.7 
V 
VI = 0.9 vcc. IOH = 4 mA 


III 
Input Leakage Current 
-70 
/LA 
vn. = OV 


IOl 
Output Low Current 
4 
mA 


Ice 
Power Supply Current 
30 
mA 


CIN 
Input Capacitance 
7 
pF 


COUT 
Output Capacitance 
34 
pF 


VT(1) 
Input Threshold 
Voltage 
1.3 
V 


VT(2) 
Input Threshold 
Voltage 
Vee/2 
V 


NOTES: 
1. Specified 
for all input pins except 
MD(31:0), 
VWE#, 
BE#(3:0). 


2. Specified 
for MD(31:0), 
VWE#, 
BE#(3:0). 


1-230 
I 


intel~ 
82750LV 


Table 
5-4. ClK 
DC Characteristics 


Vee = 5V. TeASE = 25°C 


Symbol 
Parameter 
Mln 
Typ 
Max 
Units 
Notes 


VIL 
Input LOW Voltage 
0.9 
V 
vcc = 4.5V 


VIH 
Input HIGH Voltage 
3.85 
V 
vcc = 5.5V 


IlL 
Input LOW Leakage 
±1 
p.A 
VIH = vcc 


IIH 
Input HIGH Leakage 
±1 
p.A 
VIL = OV 


VT 
Input Threshold 
Voltage 
2.5 
V 


CIN 
Input Capacitance 
7 
pF 


Output 
Delay 
and Rise Time 
Versus 
load 
Capacitance 


Typical 
Output 
Valid Delay 
Typical 
Output 
Rise Time 
vs L.oad Capacitance 
vs load 
Capacitance 


8 
8 


Valid 
6 
Rise :~ 


Delay 
Time 
Time 
(ns) 
(ns) 
4 


2 
2 


10 
20 
30 
40 
50 
10 
20 
30 
40 
50 


'1. (pr) 
'1. (pr) 


241345-18 
241345-19 
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5.2 
A.C. Characteristics 


NOTE: 
Industry 
standard 
gate array test methodologies 
do not include 
full AC characterization. 
The AC characteris- 
tics below 
were determined 
through 
simulation 
and are provided 
as design 
guidelines 
only. These 
parame- 
ters are not fully tested 
in production. 
As per Tl's 
standard 
gate array test methodology, 
two AC parametric 
measurements 
are made during production 
to guarantee 
the speed of the device. 
These 
measurements 
are 


indicated 
by an • in the table below. 


Table 5-1. A.C. Characteristics 
vcc = 
5V ±10% 
TA= o-c e 
+70·C 
CL = 
45pF 
, 


Symbol 
Parameter 
Mln 
Max 
Units 
Figure 
Notes 


t1 
V1ClK 
High to CHANREQ# 
low 
6 
26 
ns 
5-1 


t2 
V1ClK 
High to CHANREQ# 
High 
5 
23 
ns 
5-1 


t3 
GAVAlEN 
Setup before V1ClK 
High 
1 
ns 
5-1 


t4 
MREQ# 
Setup before V1ClK 
High 
5 
ns 
5-1 


t5 
MREQ# 
Hold after V1ClK 
High 
3 
ns 
5-1 


t6 
BUSEN # low to VA(23:0) 
Enabled 
2 
43 
ns 
5-1 


t7 
V1 ClK 
High to VA(23:0) Valid 
35 
ns 
5-1 


t8 
V1ClK 
High to VA(23:0) 
Invalid 
6 
ns 
5-1 


t9 
BUSEN # High to VA(23:0) 
Disabled 
3 
ns 
5-1 


t10 
VWE # Setup before V1 ClK 
High 
5 
ns 
5-1 


t11* 
V1ClK 
High to RAS# 
low 
5 
32 
ns 
5-1 


t12 
V1ClK 
High to RAS# 
High 
4 
27 
ns 
5-1 


t13 
V1ClK 
High to CAS# 
low 
7 
31 
ns 
5~1 


t14 
V1ClK 
High to CAS # High 
6 
27 
ns 
5-1 


t15 
V1 ClK 
High to MUX low 
4 
28 
ns 
5-1 


t16 
V1 ClK 
High to MUX High 
5 
34 
ns 
5-1 


t17 
V1 ClK 
High to MSTRB # Low 
4 
25 
ns 
5-1 


t18 
V1 ClK 
High to' MSTRB # High 
4 
25 
ns 
5-1 


t19 
MD(31 :0) Address 
Setup before V1ClK 
High 
0 
ns 
5-1 


t20 
MD(31 :0) Hold after V1ClK 
High 
13 
ns 
5-1 


t21 
V1ClK 
High to DTOE# 
low 
7 
30 
ns 
5-1 


t22 
V1 ClK 
High to DTOE # High 
6 
24 
ns 
5-1 


t23 
V1ClK 
High to MRDY # low 
6 
24 
ns 
5-1 


t24 
V1ClK 
High to MRDY # High 
5 
20 
ns 
5-1 


t25 
V1ClK 
High to WE# 
low 
7 
30 
ns 
5-2 


t26 
V1CLK High to WE# 
High 
7 
31 
ns 
5-2 


t27 
NXTFST # low 
Setup to V1ClK 
High 
0 
ns 
5-3 


t28 
NXTFST # High Setup to V1 ClK 
High 
0 
ns 
5-3 


t29 
TRNFR /I low 
Setup to V1ClK 
High 
0 
ns 
5-5 


t30 
TRNFR # High Setup to V1 ClK 
High 
0 
ns 
5-5 


t31 
RFRSH /I low 
Setup to V1 ClK 
High 
0 
ns 
5-6 


1-232 
I 


82750LV 


Table 5-1. A.C. Characteristics 
(Continued) 
vcc = 
5V ±10%, 
TA= 
O·C to +70·C, 
CL = 45 pF 


Symbol 
Parameter 
Mln 
Max 
Units 
Figure 
Notes 


t32 
RFRSH # High Setup to V1 ClK 
High 
0 
ns 
5·6 


t33 
V1ClK 
High to DSTRB# 
low 
6 
26 
ns 
5-7 


t34 
V1ClK 
High to DSTRB# 
High 
5 
24 
ns 
5-7 


t35 
WRITE DATA Setup before V1ClK 
High 
10 
ns 
5-7 


t36 
WRITE DATA Hold after V1 ClK 
High 
35 
ns 
5-7 


t37· 
V1ClK 
High to READ DATA Enabled and Valid 
65 
ns 
5-8 • 


t38 
V1ClK 
High to READ DATA Disabled 
5 
ns 
5-8 


t39 
BRSTREQ# 
Setup before V1ClK 
High 
3 
ns 
5-9 


t40 
YVUSEl 
Hold after BRSTREQ# 
low 
5 
ns 
5-9 


t41 
YVUSEl# 
Setup before BRSTREQ# 
low 
3 
ns 
5-9 


t42 
V1 ClK 
High to CHANREQ# 
low 
6 
26 
ns 
5-9 


t43 
V1ClK 
High to CHANREQ# 
High 
5 
23 
ns 
5-9 


t44 
BUSEN # Setup before V1 ClK 
High 
6 
ns 
5-9 


t45 
VSCSEl 
# Setup before V1 ClK 
High 
5 
ns 
5-9 


t46 
V1 ClK 
High to BRSTACK # Transition 
5 
34 
ns 
5-9 


t47 
V1 ClK 
High to DClK 
Transition 
5 
25 
ns 
5-9 
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Figure 5-3. VRAM NXTFST 
Read Cycle 
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Figure 5·5. VRAM Data Transfer 
Cycle 
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Figure 5-6. VRAM Refresh 
Cycle 
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NOTES; 
1. All dimensions 
are in millimeters. 


2. Co-planarity 
within 
0.10mm. 
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6.0 
MECHANICAL 
SPECIFICATIONS 


6.1 
Packaging Outlines and 
Dimensions 


6.2 
Package Thermal Specifications 


Thermal 
impedance 
is defined 
as the ability to dissi- 
pate heat generated 
by an electronic 
device 
and is 
characterized 
by (JJA and (JJC. It is measured 
in de- 
grees 
Celcius 
per Watt. 
(JJA is the thermal 
imped- 
ance from the IC chip junction 
in still air ambient with 
the package 
mounted 
in a socket 
or directly 
mount- 
ed on a PC board. 
(JJC is the thermal 
impedance 
from 
the IC junction 
to the external 
package 
case. 


Measurements 
are typically 
taken using high air flow 
to simulate 
an infinite heat sink. The thermal 
charac- 
teristics 
of the 
160-lead 
PQFP package 
are as fol- 


lows: 


()JA = 60.0·C/W 


()JC = 18.0·C/W 


r----- 
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INTRODUCTION 


The Keying and Audio Gate Array (KAGA) serves 
three functions: (1) It is the interface between the 
Audio Digital Signal Processor (ADSP) and both the 
DVI Bus and the analog conversion components; (2) 
it generates keying signals to allow various video 
sources to be combined; (3) it contains portions of 
the phaselock loop which both provides video gen- 
lock capability and generates clock signals for the 
entire DVI System. 


The following figure shows a simplified block dia- 
gram of the KAGA gate array interconnections in a 
typical DVI system. The ADSP outputs stereo digital 
audio to a dual DJA converter through a set of regis- 
ters in the gate array. These analog outputs, filtered 
to a 17 KHz bandwidth, comprise the audio output of 
the DVI system. Similarly, digital stereo audio is input 
to the ADSP from a dual AID converter through a 
set of registers in the gate array. Additional registers 
in KAGA provide the mechanism for the ADSP to 
communicate with DVI Bus components, especially 
the Host Processor for programming information and 
VAAM for audio data. 


The KAGA chip provides the control signals for ana- 
log multiplexing the Intel 82750DB DVI Display Proc- 


82750LA 


essor (DB) video with either the host's VGA video or 
an external source on a pixel-by-pixel basis. The ac- 
tual video source selection is performed in mux 
chips capable of instantaneously switching inputs. 
The control for the keying is generated in KAGA for 
the VGA video by monitoring the video feature bus 
and in a black detect circuit for the, external AGB 
video. The keying parameters are determined by 
registers in the gate array. 


In order for the DB video to be combined with other 
video sources into a unified video stream, the sourc- 
es must be synchronized or genlocked; Le.their hor- 
izontal and vertical scanning must be carefully 
aligned. This is accomplished, in part, by synchroniz- 
ing the DB horizontal scanning to either the VGA or 
external AGB horizontal sync using a phaselock 
loop. The loop is comprised of a phase detector and 
input counters in the gate array and a loop filter and 
vea located off-chip. Together, these components 
lock the horizontal syncs and provide a pixel clock 
for the DB video. Alternatively, when genlocking is 
not required, the phaselock loop can be locked to a 
10 MHz crystal to generate the timing for a wide 
variety of video formats, with pixel rates up to 
50 MHz. The selection of clock and sync sources, as 
well as phaselock loop parameters are determined 
by registers internal to KAGA. 


Manufactured 
and tested 
for Intel by Texas 
Instruments 
in accordance 
with Texas 
Instruments, 
internal 
standards. 
l 
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KAGA tnterecnneenene 


KAGA 
Audio 
Digital 
Signal 


DVI Bus 
Processor 


Audio 
DIA 


Interface 
CONVERTERS 


Circuitry 


AID 
CONVERTERS 


Loop 
Clock 
and 
Output 
Clocks 
Filter 
Timing 
and 
VCO 
Circuitry 
and 
H &: V Syncs 


Video 
Featur. 
Bus 


I+-- VGA 
DB H,V 
&: 
Composite 
Inp 


Sync 
Video 
Video 
I+-- 
DB 
Keying 
!lux 
Inpu 
Circuitry 
r--+ 
External 
I+- 


RGB 
H &: V Sync 


Capture 
Black 


H &: V Sync 
"'""- 
Detect 
I+- 


L... 


Audio 
Output 


Audio 
Input 


RGB 
Video 
Output 


Video 
ut 


Video 
t 


External 
RGB Video 
Input 
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1.0 PIN DESCRIPTION 


1.1 Pinout 


82750LA 


••• 
Rei_OfT 


110 
OB_KEY 
10. 
DOT_ClX 


111 
VF8_0 
112 
VFB_t 
101 


113 
VFB_2 
107 


11. 
VFlL3 
103 
'" 


VFB_4 
102 
III 
VrB_5 
10. 


117 
vra_, 
III 
VFB_7 


133 
U. 
132 


131 
14. 
137 
131 
120 
131 
111 
130 
127 
141 


'" 
147 
125 
142 
124 
143 
123 
12' 
•• 
121 


13 
.0 


74 
" 
75 
" 
" 
"•• 
" 
.. 
71 
53 
4l 
54 
55 
72 


""" 
70 
" 
" 
"" 
•• 
••"••".. 
W" 
11 
•• 
WOl 
l2 
•• 
WO, 
II 
•• 


WD3 
l4 
•• 
WO• 
se 


" 


WO. " 
WO, 
31 
•• 
WD7 " 
•• 
WDI 
20 


47 
WO. 
21 


101010 
27 
42 


11I011 
21 
•• 
51 


11I012 
ae 


se 


11I013 
24 


•• 


WOI .• 
13 
•• 


11I015 " 
WOII " 
15. 
W017 
11 


'" 


W01. 
17 


157 


IIIDlI 
11 


'" 


11I020 
15 


11I021 
14 


155 
.." 
U 


IIID23 
12 
154 
11I024 
10 
153 
1>2 
..,. • 
151 
WOlf 
• 


150 
11I027 
7 
NOU 
I 
" 


WOU 
e 
•• 


11I030 · 
n 
11I031 
3 


77 


J 
"" 
.0 
241346-2 
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intel~ 


Vcc 
GNO 
M031 
M030 
M029 
M028 
M027 
M026 
M025 
M024 
GNO 
M023 
M022 
M021 
M020 
M019 
M018 
M017 
M016 
GNO 
VCC 
M015 
M014 
M013 
M012 
MOll 
M010 
M09 
M08 
GNO 
MOO 
MOl 
M02 
M03 
M04 
M05 
M06 
M07 
VCC 
TEST" 


VGA_HSYNC 
VGA_VSYNC 
VFB_7 
VFB_6 
VFB_5 
VFB_4 
VFB_3 
VFB_2 
VFB_l 
VFB_O 
OBKEY 
DOLCLK 
GNO 
EXTERNAL 
BD_EN" 
AOV_VGA 
GNO 
VGA 
OVI 
VCC 
GNO 
XO 
XI 
VCC 
ICLKO 
AREQ" 
OVINT" 
DOUT 
WDCLK 
PLAYL_R 
BCLK 
DIN 
FSYNC 
SCLK 
CAPL_R 
VA2 
VA3 
VA4 
GNO 
VCC 
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82750LA - KAGA 
TOP VIEW 


-N~.~~~~~O-N~.~~~~~O-~~.~~~~~O-N~.~W~C~O 
••••••••• 
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c 


Figure 
1-1_ 82750LA 
Plnout 
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intel~ 


Pin 
Location 
Name 
1OMHZ_IN 
136 


AO 
73 
A1 
74 
A2 
75 
A3 
76 
ADV_VGA 
105 
AREQ# 
95 
ASEL# 
150 
BCLK 
90 
BD_EN # 
106 
BEO# 
159 
BE1# 
158 
BE2# 
157 
BE3# 
156 
BUSEN# 
154 
CAP HSYNC 
129 
CAPVSYNC 
128 
CAPLR 
86 
CINT# 
71 
DBCLK 
146 
DBCLKIN 
147 
DBCSYNC 
125 
DB HRST# 
137 
DB HSYNC 
127 
DB_VRST# 
138 
DB_VSYNC 
126 
DBG# 
49 
DBKEY 
110 
DBR# 
70 
DBS# 
50 
DFLAG 
43 
DIN 
89 
DMS# 
51 
DOT_CLK 
109 
DOUT 
93 
DRD# 
47 
DRS# 
72 
DSTRB# 
153 
DVI 
102 
DVINT# 
94 
I 


Table 1·1. Pin Cross Reference 
by Pin Name 


Pin 
Location 
Name 
DWR# 
48 
EXT_RGB 
144 
EXTERNAL 
107 
FSYNC 
88 
GAVALEN 
155 


GND 
2 


GND 
11 


GND 
20 
GND 
30 
GND 
41 


GND 
61 


GND 
82 
GND 
100 
GND 
104 
GND 
108 
GND 
121 
GND 
135 
GND 
141 
GND 
148 
HSYNC 
131 
ICLKD 
96 
IDFLAG 
46 
MOO 
31 
MD1 
32 
MD2 
33 
MD3 
34 
MD4 
35 
MD5 
36 
MD6 
37 
MD7 
38 
MD8 
29 
MD9 
28 
MD10 
27 
MD11 
26 
MD12 
25 
MD13 
24 
MD14 
23 
MD15 
22 
MD16 
19 
MD17 
18 


Pin 
Location 
Name 
MD18 
17 
MD19 
16 


MD20 
15 


MD21 
14 
MD22 
13 


MD23 
12 
MD24 
10 


MD25 
9 
MD26 
. 
8 
MD27 
7 
MD28 
6 
MD29 
5 
MD30 
4 
MD31 
3 
MINT# 
44 
MSTRB# 
151 
NC 
80 
NC 
134 
NC 
160 
PD1 
132 
PINT# 
45 
PLAYLR 
91 
PU1 
133 
RESET# 
42 
RGB_HSYNC 
124 
RGB_VSYNC 
123 
SCLK 
87 
SYS_CLK 
149 
TOO 
52 
T01 
53 
TD2 
54 
T03 
55 
TD4 
56 
TD5 
57 
T06 
58 
TD7 
59 
TD8 
62 
TD9 
63 
TD10 
64 
TD11 
65 
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Pin 
Location 
Name 
TD12 
66 
T013 
67 


TD14 
68 
TD15 
69 
TEST# 
40 
V1CLK 
142 


V1CLKIN 
143 
V_SYNC 
130 
VA2 
85 
VA3 
84 
VA4 
83 
VA17 
77 
VA18 
78 
VA19 
79 


Vcc 
1 


Vcc 
21 


Vcc 
39 


Vcc 
60 


Vcc 
81 


Vcc 
97 


Vcc 
101 


Vcc 
122 


Vcc 
140 


Vcc 
145 
VCO_IN 
139 
VFB_O 
111 
VFB_1 
112 
VFB---2 
113 
VFB_3 
114 
VFB_4 
115 
VFB_5 
116 
VFB_6 
117 
VFB_7 
118 
VGA 
103 
VGA-HSYNC 
120 
VGA-VSYNC 
119 
VWE# 
152 
WDCLK 
92 
XI 
98 
XO 
99 
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Pin 
Location 
Name 


1 
VCC 
2 
GND 


3 
MD31 
4 
MD30 


5 
MD29 


6 
MD28 
7 
MD27 
8 
MD26 


9 
MD25 
10 
MD24 
11 
GND 
12 
MD23 
13 
MD22 
14 
MD21 
15 
MD20 
16 
MD19 
17 
MD18 
18 
MD17 
19 
MD16 
20 
GND 
21 
VCC 
22 
MD15 
23 
MD14 
24 
MD13 
25 
MD12 
26 
MD11 
27 
MD10 
28 
MD9 
29 
MD8 
30 
GND 
31 
MDO 
32 
MD1 
33 
MD2 
34 
MD3 
35 
MD4 
36 
MD5 
37 
MD6 
38 
MD7 
39 
Vcc 
40 
TEST# 
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Table 1-2. Pin Cross Reference 
by Pin Number 


Pin 
Location 
Name 
41 
GND 
42 
RESET# 


43 
DFLAG 


44 
MINT# 


45 
PINT# 


46 
IDFLAG 


47 
DRD# 


48 
DWR# 


49 
DBG# 


50 
DBS# 


51 
DMS# 


52 
TDO 
53 
TD1 
54 
TD2 
55 
TD3 
56 
TD4 
57 
TD5 
58 
TOO 
59 
TD7 
60 
Vcc 
61 
GND 
62 
TD8 
63 
TD9 
64 
TD10 
65 
TD11 
66 
TD12 
67 
TD13 
68 
TD14 
69 
TD15 
70 
DBR# 


71 
CINT# 
72 
DRST# 
73 
AO 
74 
A1 
75 
A2 
76 
A3 
77 
VA17 


78 
VA18 
79 
VA19 
80 
NC 


Pin 
Location 
Name 
81 
VCC 
82 
GND 


83 
VA4 
84 
VA3 


85 
VA2 


86 
CAPLR 
87 
SCLK 
88 
FSYNC 


89 
DIN 
90 
BCLK 
91 
PLAYLR 
92 
WDCLK 
93 
DO UT 
94 
DVINT# 


95 
AREQ# 


96 
ICLKD 
97 
VCC 
98 
XI 
99 
XO 
100 
GND 
101 
VCC 
102 
DVI 
103 
VGA 
104 
GND 
105 
ADV 
VGA 
106 
BD_EN # 


107 
EXTERNAL 
108 
GND 
109 
DOT_CLK 


110 
DBKEY 
111 
VFB_O 


112 
VFB_1 


113 
VFBJ 
114 
VFB_3 


115 
VFB_4 


116 
VFB_5 


117 
VFB_6 


118 
VFB_7 
119 
VGLVSYNC 
120 
VGLHSYNC 


Pin 
Location 
Name 


121 
GND 


122 
VCC 
123 
RGB_VSYNC 


124 
RGB_HSYNC 


125 
DB_CSYNC 


126 
DB_VSYNC 


127 
DB_HSYNC 


128 
CAP_VSYNC 


129 
CAP_HSYNC 


130 
V_SYNC 


131 
H_SYNC 


132 
PD1 
133 
PU1 
134 
NC 
135 
GND 
136 
10MHLIN 
137 
DB_HRST# 


138 
DB 
VRST# 
139 
VCO_IN 


140 
VCC 
141 
GND 
142 
V1CLK 
143 
V1CLKIN 
144 
EXT_RGB 
145 
VCC 
146 
DB_CLK 


147 
DB_CLKIN 
148 
GND 
149 
SYS_CLK 


150 
ASEL# 
151 
MSTRB# 


152 
VWE# 
153 
DSTRB# 
154 
BUSEN# 
155 
GAVALEN 
156 
BE3# 


157 
BE2# 


158 
BE1# 
159 
BEO# 
160 
NC 
I 
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1.2 Pin Descriptions 


1.2.1 KAGA 
AUDIO 
AND 
DVI BUS SIGNAL 
DEFINITIONS 


Symbol 
Type 
Name and Function 


A3-AO 
I 
A3-AO: 
DSP address output used in the decode 
of external 
data memory and boot 
memory space. 


AREQ# 
0 
AUDIO 
REQUEST 
SIGNAL: 
This signal is asserted 
indicating 
that the DSP is 
requesting 
control of the DMA Channel for VRAM accesses. 


ASEL# 
I 
AUDIO 
SELECT 
SIGNAL: 
This signal indicates 
that the Host has acknowledged 
the audio request. 


BCLK 
0\ 
DAC BIT CLOCK 
SIGNAL: 
The rising edge of this clock will shift the serial data into 
the internal serial shift register of the DAC. 


BE#3-BE#O 
BI 
BYTE ENABLES 
3-0: 
These signals are used by PB and DVI bus devices to 
indicate which bytes in the 32·bit VRAM word are being accessed. 
These signals 
should be driven by the device causing the DVI bus cycle during a VRAM access. 
These signals are driven by the host interface 
logic during a host access of 
registers 
residing in KAGA. 


BUSEN# 
I 
BUS ENABLE 
SIGNAL: 
This signal is driven by PB in response 
to the HREQ# 
signal. BUSEN# 
indicates 
that the DVI bus can now be used by KAGA provided 
the 
ASEL# 
signal is asserted. 


CAPLR 
I 
CAPTURE 
LEFT fRIGHT 
CLOCK: 
This signal is generated 
in the ADC. KAGA 
receives 
this clock from the ADC whose output frequency 
is at the word rate. When 
this clock is high, left channel 
data is output. When this signal is low, right channel 
data is output. 


CINT# 
0 
CAPTURE 
INTERRUPT: 
Generated 
internal to KAGA. This signal is connected 
to 
the highest priority interrupt 
on the DSP (IRQ2#). 
The signal CAPLR 
initiates a 
CINT #. CINT # is negated when either of the audio input registers 
are-read. 


DBG# 
I 
BUS GRANT 
INPUT: This signal originates 
from the DSP (signal BG#). 
Control of 
the DSP bus is transferred 
to the DVI Device when the DSP asserts the BG # 
signal. 


DBR# 
0 
BUS REQUEST 
OUTPUT: 
This signal is connected 
to the DSP BR # signal. When 
the DVI Device requires access to the DSP external 
bus or if the DVI Device must 
halt the DSP it will assert DBR #. If the DSP is not performing 
an external 
access, 


then it will respond to the DBR # signal in the same cycle by tri-stating 
the DSP data 
and address 
bus as well as DMS#, 
DBS#, 
DRD#, 
and DWR#. 


DBS# 
I 
BOOT MEMORY 
SELECT: 
This signal is the DSP BMS# 
signal. The DSP signal 
DBS # is used to select the boot memory interface. 


DFLAG 
0 
DSP FLAG: This signal is the DMARDY 
(DMA Ready) signal generated 
internal to 
KAGA. This signal is tied to the asynchronous 
input FI (Flag In) of the DSP. When 
DMARDY 
is asserted 
(high) the DSP is then able to read or write VRAM. 


DIN 
I 
SERIAL 
DATA INPUT: This is the serial data received from the ADC. Audio data 
bits are presented 
MSB first, in 2's complement 
format. 


DVINT# 
6 
DVI DEVICE 
INTERRUPT: 
Generated 
internal to KAGA. This signal is asserted 
when the DSP writes to the Message 
to DVI Device register (MDVI). This signal is 
negated 
when the DVI Device reads the most significant 
byte of the MDVI register. 


DMS# 
I 
DATA MEMORY 
SELECT: 
DSP strobe signal for data memory accesses. 
When the 
DMS # signal is asserted 
this indicates 
that the address bus is being driven with a 
data memory address and memory can be selected. 


DOUT 
0 
DATA OUTPUT: 
Serial data presented 
to the playback 
DAC. The DAC receives 
this 
data (with the MSB first) in Binary Two's Complement 
(BTC) form. The DAC takes 
this data as input and converts 
it into analog form. 


I 
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FSYNC 
I 
FRAME SYNC SIGNAL: 
This signal is generated 
internal to the ADC. KAGA receives 
this clock from the ADC. This is an output clock which goes high coincident 
with the 
start of the first data bit (MSB) and falls immediately 
after the last data bit (LSB). 


GAVALEN 
I 
GATE ARRAY 
VALID 
ADDRESS 
LATCH 
ENABLE: 
This signal is used to latch the 
VRAM address from the MD31-MDO) 
lines. The MD31-MDO) 
bus is latched 
at the 
falling edge of GAVALEN. 
When GAVALEN 
and BUSEN# 
are asserted, 
the MD31- 
MDO) bus is used for VRAM address information. 


ICKLD 
0 
INPUT CLOCK 
DIGITAL: 
This clock is generated 
in KAGA. This is the source clock 
for the ADC. This clock runs the digital filter internal to the ADC on the CS2 board. 
ICLKD must be 384 times the desired sample rate. 


IDFLAG 
I 
INPUT DSP FLAG: Input signal to KAGA form the DSP. This signal may be set, 
toggled, 
or cleared 
in software 
to signal events or conditions 
to the DVI Device. 


MD31-MDO 
BI 
MEMORY 
DATA: This bus is the DVt bus data path. MD(O) is the least significant 
bit. 


This bus can be used at the start of a DVI bus cycle to temporarily 
hold the VRAM 
address 
until latched by GAVALEN. 


MINT# 
0 
MESSAGE 
BIT: Generated 
in KAGA. This signal is asserted 
when the DVI Device 
completes 
a write to the Message to DSP register (MDSP). This signal is negated 
when the DSP reads the MDSP register. 


MSTRB# 
I 
MEMORY 
STROBE: 
This signal is used to latch data during a VRAM access. This 
signal is generated 
whenever 
a VRAM access falls below the 15M byte boundary. 


PINT# 
0 
PLAYBACK 
INTERRUPT: 
Generated 
in KAGA. Interrupt 
signal to the DSP. This 
signal is connected 
to the next highest priority interrupt 
on the DSP (IRQ1 #). The 
frequency 
of PINT # is equivalent 
to the audio output sample rate. This signal is 
negated when either of the audio playback 
registers 
are written. 


PLAYLR 
0 
PLAYBACK 
LEFT fRIGHT 
CLOCK: 
This signal is the playback 
left/right 
selector 
signal. PLA YLR 
has a 50% duty cycle and is equivalent 
to the playback 
sample 
rate. When this signal is high the DOUT data is right channel 
information. 


RESET# 
I 
KAGA 
RESET INPUT: This signal when asserted 
will initialize KAGA internal 
registers and counters. 
When asserted, 
KAGA will place the DSP in a reset state. 


SCLK 
I 
SERIAL 
DATA CLOCK: 
Capture bit clock generated 
internal to the ADC. Data is 
clocked 
out on the rising edge of this clock. This signal is 64 times the source clock 
(ICLKD) of the ADC. 


TD15- TDO 
BI 
DSP DATA BUS: Data communications 
for register transfers 
between 
KAGA and 
DSP. 


TEST# 
I 
TEST PIN: When this signal is asserted 
the outputs of KAGA are tri-stated. 


V1CLKIN 
I 
V1 CLOCK: 
This is the main synchronizing 
signal for the DVI bus. Although 
the DVI 
bus is asynchronous 
in nature and is meant to be event driven, there are some 
signals that must be applied to PB synchronously. 
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1.2.1 KAGA 
AUDIO 
AND 
DVI BUS SIGNAL 
DEFINITIONS 
(Continued) 


Symbol 
Type 
Name and Function 


VWEII 
BI 
VRAM WRITE 
ENABLE: 
This signal is the read/write 
status line that further defines 
the bus cycle into a read or write type of cycle. This signal is driven by KAGA when 
KAGA is causing a DVI bus cycle. VWE 11 is asserted 
for a write operation 
and 
negated for a read operation. 
When performing 
Device Register 
accesses 
this signal 
is driven by the host. 


VA4-VA2 
I 
VRAM ADDRESSES 
VA4-VA1: 
These are the bits that define the offset for a DVI 
Device register access. The audio portion of KAGA uses only 2 locations 
(OOOOb, 
0001 b). 
• 


VA19-VA17 
I 
VRAM ADDRESSES 
VA19-VA17: 
These are the bits that define the DVI Device ID 
for a DVI Device register access. 
KAGA has been assigned 
an ID of 5 (101 B). 


WDCLK 
0 
WORD CLOCK: 
This is a playback 
clock. This clock is generated 
in KAGA. Clock 
frequency 
is 2 times the playback 
sample rate. 


XI 
I 
CRYSTAL 
INPUT (16.9344 
MHz): Source clock for audio capture 
and playback 
clock generation 
circuitry. 


XO 
0 
CLOCK 
OUTPUT 
(16.9344 
MHz): Oscillator 
output signal. 


I 
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1.2.2 KAGA 
KEYING/GENLOCK 
SIGNAL 
DEFINITIONS 


Symbol 
Type 
Name ,and Function 


ADV_VGA 
0 
ADVANCE 
VGA: Not used in this design. This is a pre-VGA signal. 


BD_EN 
# 
0 
BLACK DETECT 
ENABLE: Enable signal for the Black Detect circuitry. 
Black 
Detect examines 
the R G and B external video input and generates 
video mux 
control 
signals when black is detected. 


10MHZ_IN 
I 
10 MHz INPUT CLOCK: The source of this clock is the 10 MHz oscillator 
output of 
the DSP chip. This clock can be selected 
as the source clock for the PB or DB. 


SYS_ClK 
I 
25 MHz INPUT CLOCK: The source of this clock is a 25 MHz oscillator. 
This clock 
can be selected 
as the source clock for the PB or DB. 


DB_CSYNC 
I 
DB COMPOSITE 
SYNC INPUT: The source of this signal is the Video Display 
Processor 
(DB). This signal contains 
the vertical 
serration 
and equalization 
information 
as well as horizontal 
synchronization 
pulses. 


DOT_ClK 
I 
VGA DOT CLOCK: Dot frequency 
originating 
from the Video Graphics 
Adaptor 
(VGA) circuitry. 


DVI 
0 
DVI VIDEO SELECT: Active high signal. When this signal is asserted 
the video 
mux passes the DVI video to the output video amplifiers. 


EXTERNAL 
0 
EXTERNAL 
SELECT: Software 
can force this signal. If this signal is active then 
the external video (R,G,B) will pass through the video mux on the DS2 board to the 
video output. 


EXT_RGB 
I 
EXTERNAL 
RGB: Status bit which determines 
the power on default state for video 
keying operation. 


DB_HRST# 
0 
HORIZONTAL 
RESET OUTPUT: 
KAGA generates 
this signal. Assertion 
of this 
signal will reset all of the horizontal 
timing to the start of the horizontal 
line. 


CAP_HSYNC 
I 
CAPTURE 
HORIZONTAL 
SYNC: Horizontal 
video synchronization 
signal. The 
source of this signal is the Video Capture circuitry. 


VGLHSYNC 
I 
VGA BUS HORIZONTAL 
SYNC: Horizontal 
video synchronization 
signal. The 
source of this signal is the VGA circuitry. 


DB_HSYNC 
I 
DB HORIZONTAL 
SYNC: The source of this signal is the Video Display Processor 


(DB). This is a video synchronization 
signal which is asserted 
at the beginning 
of 


every line and ends a programmed 
time later. 


RGB_HSYNC 
I 
EXTERNAL 
HORIZONTAL 
SYNC: Horizontal 
video synchronization 
signal. The 
source of this signal is the External RGB circuitry. 


PD1 
0 
PUMP DOWN OUTPUT: 
Phase Detector 
output. This signal together 
with the PU1 
signal are inputs to the charge pump to the Pll 
circuit. 


PU1 
0 
PUMP UP OUTPUT: 
Phase Detector 
output. This signal together 
with the PD1 


.signal are inputs to the charge pump to the Pll 
circuit. 


V1ClK 
0 
PB CLOCK SOURCE: This signal provides the fundamental 
timing for the 
82750PB. 


V1ClKIN 
I 
PB CLOCK INPUT TO KAGA: This is the PB clock input signal. The V1 ClK 
signal 
is fed back into KAGA so that timing referenced 
to V1 ClK 
internal to KAGA is 
identical 
to timing referenced 
to all other devices. 


DB_ClK 
0 
DB CLOCK SOURCE: This signal provides the fundamental 
timing for the 
82750DB. 


DB_ClKIN 
I 
DB CLOCK INPUT TO KAGA: The DB_ClK 
signal is fed back into KAGA so that 
timing referenced 
to DB_ClK 
internal to KAGA is identical 
to timing referenced 
to 
all other devices. 
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1.2.2 KAGA 
KEYING/GENLOCK 
SIGNAL 
DEFINITIONS 
(Continued) 


Symbol 
Type 
Name and Function 


DB_KEY 
I 
DB KEY: The 827500B 
Alpha Bit-7. This signal is used with the VGA signal in 
order to define a valid area on the active display for VGA Keying. 


VGA 
0 
VGA VIDEO SELECT: Active high signal. When this signal is asserted 
the video 
mux passes the VGA video to the output video amplifiers. 


VFB7-VFBO 
I 
VIDEO FEATURE 
BUS: Pixel Select Inputs originating 
from the Video Feature 
Connector. 
These signals are compared 
to the Chroma Register value to generate 
the OVIIVGA 
keying signals. 


OB_VRST# 
0 
VERTICAL 
RESET OUTPUT: 
Assertion 
of this signal by KAGA will reset all vertical 
timing in the DB. 


CAP_VSYNC 
I 
CAPTURE 
VERTICAL 
SYNC: Vertical Video Sync signal originating 
from the 
capture circuitry. 


OB_VSYNC 
I 
DB VERTICAL 
SYNC: Video Sync signal which can be programmed 
to start and 
end in each field. DB is the source of this signal. 


VGA-VSYNC 
I 
VGA VERTICAL 
SYNC: Vertical Video Sync signal generated 
by the VGA circuitry. 


HSYNC 
0 
HORIZONTAL 
SYNC: System Output Horizontal 
Sync to monitors 
etc. 


VSYNC 
0 
VERTICAL 
SYNC: System Output Vertical Sync to monitors 
etc. 


VCO_IN 
I 
VCO INPUT: The source of this clock is the VCO. This clock can be selected 
as 
the source for the PB or DB. 


RGB_VSYNC 
I 
EXTERNAL 
VERTICAL 
SYNC: Vertical Video Sync signal generated 
by the 
External RGB circuitry. 


I 
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2.0 INTERNAL ARCHITECTURE 


2.1 Audio 


2.1.1 OVERVIEW 


The audio portion 
of KAGA consists 
of 18 memory- 
mapped 
registers 
addressable 
from the DVI Bus, the 
. 


ADSP Bus or both, plus logic to support 
their func- 


tions. The ADSP typically 
accesses 
these 
registers 
to communicate 
with the Host, to input data from the 
AID 
Converters, 
to output data to the 0/A Convert- 
ers, and to exchange 
audio and program 
data with 
VRAM. 
Similarly, 
the Host (or other 
DVI device) 
will 
access 
these 
registers 
to load audio software 
or to 
check the status of the audio subsystem. 
A detailed 
block diagram 
of the audio portion 
of KAGA and its 
interconnections 
to the ADSP, the DVI Bus and the 
AID 
and 0/A Converters 
is shown 
in Figure 
2-1. 
Each of the audio gate array functional 
components 
is described 
in greater 
detail 
in the following 
sec- 
tions of this chapter. 
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2.1.2 REGISTER 
CONFIGURATION 


The ADSP registers 
are implemented 
in low external 


data 
memory 
space 
(OOOO-OOOBH). Table 
2-1 


shows 
a listing 
of their 
names, 
addresses 
and ac- 


cessibility 
from each bus. All are 16 bits except 
for 


the 
BOOT 
Register, 
which 
is determined 
by the 
ADSP to be 8 bits. Each 
16-bit register 
can be ac- 


cessed 
with either 
byte or word 
operations. 
Some 
pairs of registers 
share the same address, 
with one 
read-only 
and 
the 
other 
write-only 
with 
respect 
to 
the ADSP Bus. 


The 
DVI 
registers 
are 
located 
in 
the 
range 
of 
FAOOOOHto FA0007H 
on the DVI Bus, which 
con- 
forms to the Audio 
subsystem 
having 
a DVI Device 
ID of 5. Each register 
can be accessed 
with either 


byte, word or long word operations. 


Table 
2-1. Audio 
Registers 


Register 
DVI 
DVI 
ADSP 
ADSP 
Register 
Description 
Address 
R/W 
Address 
R/W 


ACS 
OOOOH 
R/W 
Audio Command 
and Status 


MDVI 
FAOOO2H 
R 
0OO1H 
R/W 
Message 
to DVI Device 


MDSP 
FAOOO4H 
R/W 
0OO2H 
R 
Message to DSP Device 


SRCS 
0OO3H 
R/W 
Sample Rate Command 
and Status 


RVRO/WVRO 
0OO4H 
R/W 
Least Significant 
VRAM Read/Write 
Data' 


RVR1/WVR1 
0OO5H 
R/W 
Most Significant 
VRAM Read/Write 
Data' 


CPAR 
0OO6H 
R/W 
Capture/Playback 
Audio Right' 


CPAL 
0OO7H 
R/W 
Capture/Playback 
Audio Left' 


WVRADDO 
0OO8H 
R/W 
Least Significant 
VRAM Write Address 


WVRADD1 
0OO9H 
R/W 
Most Significant 
VRAM Write Address 


RVRADDO 
OOOAH 
R/W 
Least Significant 
VRAM Read Address 


RVRADD1 
OOOBH 
R/W 
Most Significant 
VRAM Read Address 


DCS 
FAOOO6H 
R/W 
DVI Device Command 
and Status 


BOOT 
FAOOOOH 
R/W 
BOOT 


NOTE: 
'Two 
registers 
sharing 
one address. 
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r--------------.,.._ .•..••..•.•. 
_.•....,. 


AUDIO 
CONTROL + 
-+1 
AND 
STATUS 
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Figure 2·1. Audio Gate Array Architecture 
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2.1.3 DMA REGISTERS 


The audio subsystem's interface to the DVI Bus con- 
sists of four 16-bit DMA Data Registers (RVRO, 
RVR1, WVRO,WVR1) which are used to hold VRAM 
data, two 
16-bit DMA Write Address Registers 
(WVRADDO, WVRADD1) and two 16-bit Read Ad- 
dress Registers (RVRADDO,RVRADD1). 


The ADSP writes data to VRAM by first loading the 
VRAM destination into the Write Address Registers 
and then loading the data into the WVR Data Regis- 
ters, least significant word first. The VRAM write op- 
eration begins when the most significant Data Regis- 
ter is loaded. 


The ADSP reads data from VRAM by loading the 
VRAM access location into the Read Address Reg- 
isters, least significant word first. The data can then 
be read from the RVR Data Registers. The VRAM 
read operation begins when the most significant 
Read Address Register is loaded. 


After a DMA read or write operation is initiated, the 
hardware requests control of the DVI Bus for one 
transfer. Once handshaking signals are exchanged, 
the data is transferred and another DMA operation 
may then be requested. 


If blocks of contiguous VRAM data are to be trans- 
ferred, the autoincrement mode should be used. 
When this mode is selected for writing VRAM data, 
the Write Address Register is post-incremented by a 
long or double word after the completion of each 
audio DMA cycle. Additionally, a new cycle is initiat- 
ed each time the most significant word of data is 
written. When this mode is selected for reading 
VRAM data, the read address register is post-incre- 
mented by a double word after the completion of 
each audio DMA cycle. Additionally, a new cycle is 
initiated each time the most significant word of data 
is read. 


2.1.4 BOOT 
REGISTER 


The Boot Register is an a-bit register that is used to 
initially load the ADSP internal program RAM. The 
gate array allows a DVI device to emulate a Boot 
ROM, which is typically used for this purpose. Hand- 
shaking signals are used with the ADSP to operate it 
in single-step mode to allow for the long access time 
required to obtain boot data from the DVI Bus. 


2.1.5 MESSAGE 
REGISTERS 


The KAGA gate array has two 16-bit registers that 
are used to exchange data between the ADSP and 
DVI Buses. The first is the Message to DVI Register 
(MDVI). This is a read/write register on the ADSP 
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Bus which can be read by DVI Devices. The DVI 
Command and Status Register (DCS) indicates 
when there are unread messages in this register and 
an output pin on the array can be used as a mes- 
sage interrupt. The second is the Message to DSP 
Register (MDSP).Similarly, this is a read/write regis- 
ter on the DVI Bus which can be read by the ADSP. 
The Audio Command and Status Register (ACS) in- 
dicates when there are unread messages in this reg- 
ister and an output pin on the array can be used as a 
message interrupt. The MDSP Register is typically 
used to load program data to the ADSP once the 
boot loading process is complete. 


2.1.6 PLAYBACK 
AND CAPTURE 
REGISTERS 


Two write-only 
16-bit Playback Audio 
Registers 
(PAL, PAR) are available to the ADSP to output 
stereo audio data to the dual 0/ A Converter. These 
contain the left and right channels of audio, respec- 
tively. Data placed in these registers is combined to 
form a 32-bit serial data word which is then shifted 
out to the 0/ A Converter. 


Two read-only 16-bit Capture Audio Registers (CAL, 
CAR) are available to the ADSP to acquire stereo 
audio data from the dual AID Converter. These con- 
tain the left and right channels of audio, respectively. 
Data contained in these registers has been convert- 
ed from a 32-bit serial data word which was previ- 
ously shifted in from the AID Converter. 


The rate at which data is played back and captured 
is determined by two independent six-bit numbers 
stored in the Sample Rate Command and Status 
Register (SRCS). 


2.1.7 COMMAND 
AND STATUS 
REGISTERS 


There are two 16-bit read-write KAGA registers, ACS 
and DCS,which are accessible to the ADSP and DVI 
Buses respectively. They are used to control config- 
urations and modes of operation of the audio sub- 
system and to monitor the status of various opera- 
tions taking place within the system. 
' 


2.2 
Keying and Genlock 


2.2.1 OVERVIEW 


The keying and genlocking portion of the KAGA gate 
array performs several functions: It provides the tim- 
ing and control signals to allow DVI-generated video 
to be synchronized and mixed with either digital VGA 
or analog RGB sources. It also permits the DVI vid- 
eo to be locked to an external synchronization 
source, such as a video capture board or studio 
sync. Additionally, it contains components for a crys- 
I 


tal-based frequency synthesizer which can produce 
pixel clocks up to 50 MHz to satisfy a wide .rangeof 
video format options. Finally, it allows an Indepen- 
dent selection of external clock frequencies up to 
50 MHz to drive both the DVI Display Processor (DB) 
and the DVI Pixel Processor (PB). 


The gate array contains several subsystems to per- 
form the above tasks: (1) four multiplexers to select 
horizontal and vertical sync signals for the system's 
video output and DB; (2) two multiplexers to select 
the clocks used for DB and PB; (3) two 12-bit coun- 
ters and a phase detector for use in the phaselock 
loop generating and synchronizing clock freq~en- 
cies; (4) keying logic to select the mode of the v~deo 
keying and to control the keying of the DVI video 
with the VGA video; and (5) seven memory-mapped 
registers addressable from the DVI Bus to control 
and monitor the operation of the other subsystems 
in this portion of the gate array. A detailed block 
diagram of the keying and genlock portion of KAGA 
and its interconnections to the DVI Bus, PB, DB, var- 
ious video and clock sources, the phaselock loop 


I 
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and the keying circuit is shown in Figure 2-2. Each of 
the keying and genlock functional component~ is de- 
scribed in greater detail in the following sections of 
this chapter. 


2.2.2 VIDEO 
SYNC 
MUXES 


The Video Sync Muxes select the horizontal andver- 
tical sync pulses which accompany the output video 
of the DVI system. The Horizontal Sync Mux can 
select from DB, VGA and RGB Horizontal Syncs and 
DB Composite Sync. The Vertical Sync Mux can se- 
lect from DB, VGA, RGB and Capture Vertical 
Syncs. If VGA or RGB video is the only output of the 
system, choosing the respective syncs is in order. If 
the Display Processor is to be genlocked to those or 
other video sources, the appropriate combination of 
DB syncs should be chosen. Although the Capture 
Vertical Sync can be selected for Video Sync Out- 
put, it is advisable to synchronize the DB to it by 
selecting it for the DB Vertical Reset and then 
choosing the DB Vertical Sync in the Video Sync 
Mux. 
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Figure 2·2. Keying and Genlock 
Gate Array Architecture 
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2.2.3 DISPLAY 
PROCESSOR 
VERTICAL 
RESET 
MUX 


The Display Processor Vertical Reset Mux selects 
the sync pulse which is used to reset the vertical 
display counter in the DB. The Vertical Reset Mux 
can select from VGA, RGB and Capture Vertical 
Syncs and is used for genlocking purposes. 


2.2.4 PHASELOCK 
LOOP COMPONENTS 


The block 
diagram of 
a frequency 
synthesizer 
phaselock loop is shown in Figure 2·3. A stable input 
frequency (FREF)is used as a reference to generate 
the desired output frequency. Its frequency is first 
divided in a +- N Counter after which it is used as 
one input to a Phase Detector. The other input 
comes from a Voltage Controlled Oscillator (VCO) 
whose output frequency (Foscl is divided in a -i- M 
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Counter. The Phase Detector output represents the 
difference in phase between these two divided down 
frequencies. This output goes to an external loop 
filter which controls many of the performance pa- 
rameters of the loop. The filtered control voltage 
then goes to an external VCO whose output is driven 
toward a frequency and phase that minimizes the 
phase error out of the detector. When this error ap- 
proaches zero, the loop will achieve lock (i.e., the 
two phase detector inputs will be the same). In order 
for this to occur, the frequency FREF -i- N must be 
equal Fosc 
-i- M. Rearranging terms: 


FOSC= FREFX 
(~) 


By using a crystal source for FREFand appropriately 
choosing N and M, a wide range of stable frequen- 
cies can be achieved at the output. 
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Figure 2-3. Frequency 
Synthesizer 
Phaselock Loop Block Diagram 
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The main use for this synthesized 
frequency 
is as a 
programmable 
pixel clock 
for the DVI Display 
Proc- 
essor. 
Accurate 
clock 
rates 
up to 50 MHz can be 
achieved 
using 
a reference 
frequency 
of 10 MHz. 
Alternatively, 
the 
phase lock 
loop 
can 
be used 
to 
genlock 
the DVI video to an external 
video 
source. 
In this mode the reference 
input to the Phase Detec- 
tor is the external 
input video horizontal 
sync select- 
ed by the Phaselock 
Loop Reference 
Mux. The + N 
Counter 
is not used (see Figure 2-2). The modulus of 
the + M Counter 
is set equal to the desired 
number 
of pixels per line. When the loop locks, the DVI Dis- 
play Processor 
will then be producing 
pixels in syn- 
chronism 
with the external 
source 
video. 


Within the KAGA gate array, the + M and + N Coun- 
ters are implemented 
as 12-bit counters 
and each 
can be set for any even 
modulus 
between 
10 and 
4096. Each can also be selected 
to change 
state on 
the positive 
or negative 
transition 
of its input. 
The 
Phase Detector 
is also located 
in the gate array. 
It 
compares 
the negative-going 
edges of the + M and 
+ N waveforms 
and produces 
timing pulses 
propor- 
tional to the phase error between 
the two. (The + M 
output 
can optionally 
be inverted 
before 
being 
ap- 
plied to the Phase Detector.) 
This operation 
can be 
seen more clearly by referring 
to the Phase Detector 
logic diagram in Figure 2-4 and the timing diagram of 
Figure 2-5. 
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In Figure 2-4 Inputs 1 and 2 are the outputs of the 


-i- M and +- N Counters. Assume that initially all in- 
puts and outputs are high. This determines the state 
of all the internal locations (A through E) in the dia- 
gram. If Input 1 goes low first, this causes Output 1 
to go low and stay low until Input 2 goes low, at 
which time signal E goes low briefly resetting both 
outputs to their high state. When Inputs 1 and 2 go 
high, the circuit returns to its initial state. Similarly, in 
the timing diagram, when Input 2 goes low first, Out- 
put 2 goes low and stays low until Input 1 goes low. 
In summary, the appropriate output will be asserted 
(Iow) depending on which input goes low first and 
the pulse width will be proportional to the time differ- 
ence in the input negative-going edges. In the last 
section of the timing diagrarr/, Input 1·is at a higher 
frequency than Input 2. In this case Output 1 stays 
asserted most of the time, showing that this Phase 
Detector has a high sensitivity for input frequency 
offsets (in contrast with an exclusive-or type phase 
detector). 


In order to provide some flexibility in the use of the 
Phase Detector, the output configuration can be se- 
lected under software control. One option is the one 
shown in Figure 2-4. Another provides positive-going 
output pulses (non-inverted). A third uses each as- 
serted output as the enable for a tri-stated gate 
whose output is driven to a high level. In the final 
configuration the two Phase Detector inputs appear 
at the output, permitting the use of an external 
Phase Detector. 
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2.2.5 PB AND DB CLOCK MUXES 


The clocks used to operate the PB and DB Proces- 
sors are individually selectable from the choice 
shown in Figure 2-2. The 25 MHz system clock is a 
typical choice for PB while the VCO output is usually 
used for DB. The VCO+- 2 output allows the use of a 
lower clock frequency than the VCO can provide. 


2.2.6 KEYING LOGIC 


KAGA determines the source of the system output 
video using connections to the Video Feature Bus, 
DB, the Black Detect Circuit and the Video Mux. 
There are several ways in which this choice can be 
made. The gate array can externally select DVI, 
VGA or External RGB as the sole output video. A 
second option is to make VGA video the default se- 
lection, replacing it with DVI Video only when a digi- 
tal VGA pixel value equal to a previously stored val- 
ue is detected. A third option is the same as the 
second but additionally requires a DB Keying Signal 
to be asserted for DVI to be selected. The final op- 
tion is to make External RGB Video the default se- 
lection, replacing it with DVI Video only when the 
Black Detect Circuit determines that the RGB value 
is near black. In this last case the gate array controls 
the mode of operation and the keying signal is pro- 
vided externally. 
. 


2.2.7 REGISTER CONFIGURATION 


The Keying and Genlock Registers are located in 
the range from FA0008H to FA0015H on the DVI 
Bus which conforms to the subsystem having a DVI 
Device ID of 5. Each register can be addressed with 
either byte, word or long word operations. 


Table 2-2. Keying and Genlock Registers 


The remaining loop components, consisting of a 
loop filter and a VCO, are located external to the 
chip. 


Register 
DVI 
DVI 
Register 
Register Description 
Address 
R/W 
Length 


NMOD 
FAOOO8H 
R/W 
16 Bits 
N Modulus 
MMOD 
FAOOOAH 
R/W 
16 Bits 
M Modulus 
NCTR 
FAOOOCH 
R 
16 Bits 
N Counter 
MCTR 
FAOOOEH 
R 
16 Bits 
M Counter 
GCSR 
FAOO10H 
R/W 
32 Bits 
Genlock Command and Status 
CKVAL 
FAOO14H 
R/W 
8 Bits 
Chroma Keying Value 
KMSEL 
FAOO15H 
R/W 
8 Bits 
Keying Mode Select 


I 
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2.2.8 MODULUS 
AND 
COUNTER 
REGISTERS 


The 
16·bit 
read-write 
N Modulus 
(NMOD) 
and 
M 
Modulus 
(MMOD) 
Registers 
are used to determine 
the values 
to which 
the 
12-bit 
-;-Nand 
-i- M Coun- 
ters respectively 
divide 
their input clock 
frequency. 
The 
Counters 
will 
count 
to a number 
two 
greater 
than 
the 
values 
in NMOD 
and 
MMOD. 
When 
the 
chip is reset, 
these 
registers 
are each 
loaded 
with 
the value 200H. 


The 
16-bit 
read-only 
N 
Counter 
(NCTR) 
and 
M 
Counter 
(MCTR) 
Registers 
permit 
a DVI Device 
to 
instantaneously 
read the respective 
Counter 
values. 
This feature 
is primarily 
used for diagnostic 
purpos- 
es. 


2.2.9 GENLOCK 
COMMAND 
AND STATUS 
REGISTER 


This 32-bit read-write 
register, 
GCSR is used to con- 
trol and monitor the following 
parameters 
involved 
in 
genlocking 
the DVI Video to external 
video sources: 
(1) The selection 
of the PB and DB clocks; 
(2) the 
mode of the phase 
detector; 
and (3) the choice 
of 
horizontal 
and vertical 
syncs for the various 
muxes. 


/ 


2;2.10 CHROMA 
KEYING 
VALUE 
REGISTER 


This a-bit read-write 
register, 
CKVAL, 
is used select 
and monitor the stored digital VGA pixel value which 
causes 
VGA keying to occur when that mode is se- 
lected. 


I 


2.2.11 
KEYING 
MODE 
SELECT 
REGISTER 


This a-bit read-write 
register, 
KMSEL, 
is used to se- 
lect and monitor 
the choice 
of keying 
mode as de- 
scribed 
in Section 
2.2.6. 


3.0 HARDWARE INTERFACE 


3.1 DVI Bus Register Access 


The KAGA gate array recognizes 
a DVI Bus access 
to one 
of its registers 
whenever 
the 
upper 
VRAM 
Address 
bits match the KAGA DVI Device ID (VA19- 
VA17) = 5 and the Device Strobe line (DSTRB#) 
is 
asserted 
indicating 
that 
(VA23-VA20) 
= 
F Hex. 
Therefore, 
combining 
the two conditions, 
an access 
is made whenever 
the address 
lies anywhere 
in the 
FAOOOOHto FBFFFFH 
range. Within this range only 
the four locations 
shown 
in Table 
2-1 are used for 
audio. This selection 
is made using (VA4-VA2) 
and 
the 
Byte 
Enables 
(BE#3-BE#O). 
Read 
and write 
cycles 
are differentiated 
by the state 
of the VRAM 
Write 
Enable 
signal 
(VWE#). 
This is demonstrated 
in Figure 3-1. 


VA(19:0) ZZX 
---JxXXXXXXX'- 
---JXXXXXX 


BE#(3:0) ZZX 
--JXXXXXXXX'- 
...JXXXXXX 


DSTRB# 
\-- 
1 


VWE# \ 
1 


MD(31:0) --<XXXXXXX 
) 


WRITE CYCLE 
I 


\~ 
---J;--- 


\\\\\ 


<XXX~ 


READ CYCLE 
I 
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Figure 
3·1. DVI Bus Register 
Access 
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3.2 Audio 
3.2.1 ADSP BUS REGISTER ACCESS 


A diagram of the KAGA gate array interconnections 
relating to the audio subsystem is shown in Figure 
3-2. These signals will be described in conjunction 
with various operations performed by the audio sub- 
system. In order to fully appreciate the operation of 
the audio system hardware, refer to the Analog De- 
vices ADSP-2105 data sheet and ADSP-2101 User's 
Manual. 


The KAGA gate array recognizes an ADSP Bus Ac- 
cess to anyone of its registers whenever the Data 
Memory Strobe (DMS#) and either the Data Read 
(DRD#) or the Data Write (DWR#) signals are as- 
serted. The ADSP Address Bus (A3-AO) is decoded 
to determine which register is being addressed. As 
shown in Table 2-1 byte addresses from OH to BH 
correspond to the audio registers. Read and Write 
Cycles are differentiated by whether the DWR# or 
the DRD# signal is asserted as shown in Figure 3-3. 


ADSP- 
KAGA 
DVIBUS 
2105 
Re,.t- 
ADSP 
Reset 
Reset 
ORST" 
RESET" 
RESET" 


Rd" 
Read 
Device Strobe 
ORO" 
OSTRB# 
OSTRB" 
W 
•• 
Memory 
Strobe 
Wr" 
OWR" 
t.4STRB# 
t.4STRB" 


Ot.4S" 
Data Memory 
Strobe 
Ot.4S# 
GAVALEN 
Gat. Array Valid Latch Enable 
GAVALEN 


BNS" 
Boot ••••mory 
Strobe 
OBS" 
VWE" 


VRAl.t 
Writ. 
Enable 
VWE" 


Addr 
Addr.ss 
A(3:0) 
VA(19:17) 
VRAN Addre •• ( 19: 17) 
VA(19:17) 


Data 
Data 
ro(15:0) 
VA(4:2) 
VRAN Address(4:2) 
VA(4:2) 


BR# 
Bus Reauest 
OBR# 
B£#(3 ••0) 
Byte Enable(3:0) 
BE#(3 ••0) 


BG" 
Bus Grant 
NO(31..0) 
t.4emory Dalo(31:0) 
OBG" 
NO(31 ••0) 


FI 
DNA Ready 
Device 
Interruot 
DFLAG 
OVINT" 
DlNT.o 


FO 
Flo. 
Out 
Audio 
Reauest 
10FLAG 
AREO" 
AREO" 


IR01" 
Interrupt 
Request 
1 
PINT" 
ASEL" 
Audio 
Select 
ASEL" 


IR02" 
Interrupt 
ROQuest 2 
CINT# 
Bus Enoble 
BUSEN# 
BUSEN# 


IROO# 
Int.rruJ;t 
Request 0 
_ 
AINT" 
V1CLK 
DVl Bus Clock 
V1CLK 


XI """""J", 


Disploy Proc.sso~ 
_ 
~ 
Vertical 
Sync 
XO 


TEST# - 
D/A 
Bit Clock 
BCLK 


Word Clock 
WCLK 
Play Left/Riaht 
PLAYL_R 


Serial 
Ooto Out 
DOUT 


AID 


Input Clock Digital Filter 
ICLKO 
Serial Input Clock 
SCLK 


frome 
Sync 
FSYNC 


Serial 
Data 
In 
DIN 
Capture- Left/Rioht 
CAPLR 


Figure 3·2. KAGA Audio Interconnections 
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WRITE CYCLE 
READ 
CYCLE 


0"5· 
"""""\\.. 
--J/ 


TO(15,O) --~~X:=:X8Z)~------<l~C==)(ZZm: 
\\... 
--Jr- 
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3.2.2 RESET 
AND BOOT 
LOAD 


Figure 3-3. ADSP Bus Register 
Access 


Upon power-up the audio digital signal processor 
has random data in its program memory and is held 
in a reset state. The reset is caused by the DVI Bus 
RESET# input pin on KAGA being asserted, in turn 
asserting the DRST# output to the ADSP, halting 
that processor. The output remains asserted until 
the DRST bit in the DeS register is set to a 0 by a 
DVI Device. Since this action will immediately initiate 
a boot load of program memory in the ADSP, this 
should not be done before some preparation is 
made. 


During the booting process the DVI Bus acts as a 
Boot ROM for the ADSP. Since the interaction with 
the DVI BUS is slow due to the low priority of the 
audio system, it is not possible to allow the ADSP to 
run at its normal speed. KAGA contains circuitry 
which allows the DVI Device to provide one byte of 
data at a time to the ADSP program boot by inter- 
rupting the processor after each data transfer until 
the next byte can be supplied. The procedure, de- 
scribed in Figure 3-4, is as follows: 


While the ADSP is still reset, the 8-bit BOOT Regis- 
ter is loaded from the DVI Bus. Writing of this regis- 
ter sets the Bus Ready (BRDY) status bit in the DeS 
Register to a O.The hardware reset is then removed 


by setting the DRST# bit in the DeS register to a O. 
The ADSP then reads the BOOT Register and after 
the leading edge of the DRD# signal, the gate array 
automatically 
asserts 
the 
Bus 
Request 
signal 
(DBR#) to the ADSP, halting the processor after the 
completion of the read cycle. At the trailing edge of 
the Boot Memory Strobe (DBS#) the gate array re- 
asserts BRDY, at which time the DVI Device can 
load the succeeding byte. The trailing edge of the 
DSTRB# signal from that load qualified with (DVI 
Device = 5) indicates that new data is in the Boot 
Register and automatically releases the DBR# line 
to the ADSP, allowing that processor access to this 
data. This process continues one byte at a time, until 
the entire boot program is loaded. After the last byte 
is loaded, the DVI device must set the DBR# bit to 0 
to allow the ADSP to begin executing the boot soft- 
ware. Note that the ADSP Address lines, (A3-AO), 
are not used since the BOOT Register responds to 
all DBS# cycles and the Host "knows" the order in 
which to supply the boot program. Refer to the 
ADSP User's Manual for a description of the se- 
quence of bytes which must be loaded in the boot 
process. 


In practice, the boot program contains only the soft- 
ware to allow the loading of the remaining program 
code using the MDSP Register. This second loading 
mechanism permits transfers of 16 bits and is there- 
fore more efficient. 


DSTRBM 


TO(15,8) 
_ 
•••••••'""""""'" 
__ 
---''''''.l<.l<.l'''"- 
+-_....f.:.L~~'____ 


OBS. 


ORO· 


OBR_ 


Read Boot 
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Figure 3-4. Boot Process 
I 


3.2.3 PROGRAM 
LOAD 


Once the boot program is loaded it is possible to 
load the remainder of internal ADSP program memo- 
ry with 16-bit transfers by using the Message to DSP 
Register (MDSP) and the Message to DVI Register 
(MDVI). ADSP data requests are made by writing to 
the MDVI Register. This creates an audio interrupt 
by asserting DVINT# on the DVI Bus if the DVI De- 
vice Interrupt Enable (DVIE) bit is set to a 1 in the 
DVI Device Command and Status Register (DCS). If 
the interrupt is not enabled, the DVINT bit in the 
DCS register can be polled by the DVI Device. In 
either case the DVI Device sets the DVINT bit to 0 
(and resets the interrupt) by reading the MDVI Reg- 
ister and returns program data by writing to the 
MDSP Register. This causes the AINT bit in the Au- 
dio Command and Status Register to be set to a 1. It 
also causes the MINT# pin of the gate array to be 
asserted, but this pin is not presently used in the 
system. When the ADSP polls the ACS Register and 
finds the AINT bit set to a 1, it can set that bit to a 0 
by reading the MDSP register and the whole process 
is repeated until the program transfer is complete. 


3.2.4 VRAM 
DMA TRANSFERS 


Whenever audio capture or playback takes place, 
data is transferred between the ADSP and VRAM 
using the KAGA DVI DMA interface. The timing for 
this transaction appears in Figure 3-5. As described 
in Section 2.1.3, the ADSP initiates a write access by 
first loading the VRAM destination into the write ad- 
dress registers and then loading the data into the 
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WVR data registers, least significant word first. A 
read access is initiated by loading the VRAM access 
location into the read address registers, least signifi- 
cant word first. In either case, the gate array negates 
the DMA Ready (DFLAG) output and sets the 
DMARDY status bit in the DVI Control and Status 
Register (DCS) to a 0 at the trailing edge of the reg- 
ister load pulse to prevent further requests until the 
DMA cycle is complete. The trailing edge is used to 
insure that the data or address is stable before it 
appears on the DVI Bus. The gate array also asserts 
the Audio Request (AREQ#) signal to request con- 
trol of the DVI Bus. When the assertion of both the 
Audio 
Select 
(ASEl#) 
and 
the 
Bus 
Enable 
(BUSEN#) signals are detected by KAGA, this indi- 
cates that the audio bus cycle is beginning and that 
AREQ# may be negated. It also enables KAGA to 
drive both the VRAM Write Enable (VWE#) line to 
indicate the type of cycle taking place and the DVI 
Data Bus (MD31-MDO) where address information 
is placed. Once the Gate Array Valid latch Enable 
(GAVAlEN) signal is -detected and qualified by a 
positive transition on the V1ClK, the address is re- 
placed by the data to be transferred on the (MD31- 
MDO)lines. During VRAM read cycles, the bus driv- 
ers are then tri-stated and, subsequently, the Memo- 
ry Strobe (MSTRB#) signal, qualified by the V1ClK, 
is usedto latch the read data into the RVR registers. 
For VRAM write cycles, the negation of the BU- 
SEN# signal causes the bus drivers to be tri-stated 
and the DVI Data Bus relinquished. For both read 
and write cycles the negation of the BUSEN# signal 
causes the DFLAG pin and the DMARDY status bit 
in DCS to be asserted to allow new requests. 


\~ 
-JI 
\~ 
~ 
Jr- 
\~ 
Jr- 
\~ 
~r-- 


VWE- 
.l.XXXXX=.l~ 
)@ 


DMARDYII 


AS EL" 


WSTRBII 


\'- 
....Jr- 
GAVALEN 


V1CLK 


WD(31,O) 
---------~---~(~A~DD~RE~S~SJX\._......;;D;.;.AT;.;.A_~>- 
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Figure 3-5. DMA Transfer 
Timing 
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3.2.5 AUDIO 
DATA 
CAPTURE 


Data captured by the audio system in two's comple- 
ment format is transferred serially to KAGA from a 
model CS5338 dual AID Converter manufactured by 
Crystal Semiconductor. The source of all capture 
timing is a 16.9344 MHz crystal attached to pins XI 
and XO of the gate array. This oscillator is divided 
internally based on the (C5-CO) bits of the Sample 
Rate Command and Status Register (SRCS) and 
output to the converter as an input digital clock 
(ICLKD) which runs at 384 times the software-se- 
lected sample rate, Fs, of each channel. The Con- 
verter uses this frequency to operate a sharp cutoff 
digital filter which eliminates aliasing and to generate 
clocks, timing signals and data. Figure 3-2 shows 
the AID 
interface signals. A timing diagram of these 
signals is shown in Figure 3-6. The Shift Clock 
(SCLK) is used to transfer serial data and runs at a 
frequency 64 x Fs. The Capture LefLRight 
signal 
is a 1 when right channel audio is being transferred. 
The Frame Sync (FSYNC) is a 1 when data on the 
Data In (DIN) line is active. The Capture Interrupt 
(CINT#) signal is asserted each time new data is 
available in the Capture Audio Registers (CALl 
CAR). It is negated when the ADSP reads either reg- 
ister. 


intel~ 


3.2.6 AUDIO 
DATA 
PLAYBACK 


Data output by the audio system in two's comple- 
ment format is transferred serially from KAGA to a 
model PCM66 dual DI A Converter manufactured by 
Burr Brown. The source of all Playback timing is the 
same 16.9344 MHz crystal used for audio capture. 
This oscillator is divided internally based on the (P5- 
PO)bits of the Sample Rate Command and Status 
Register (SRCS). It is then output to the Converter 
as a Bit Rate Clock (BCLK) which runs at 32 times 
the software-selected sample rate, Fs, of each 
channel and is used to transfer serial data. Figure 
3-2 shows DI A interface signals. A timing diagram of 
these signals is shown in Figure 3-7. The gate array 
also generates Word Clock (WDCLK) and Play 
LefLRight 
(PLAYLR) 
signals for the Converter in 
addition to the output data (DOUT). WDCLK is a 
square wave at frequency 2 x Fs, which is a 0 at the 
beginning of each output word. PLAYLR 
is a 
square wave at frequency Fs and is a 1 when right 
channel audio is being transferred. A Playback Inter- 
rupt signal (PINT#) is asserted by KAGA each time 
new data may be transferred to the Playback Audio 
Registers (CPAR/CPAL). It is negated when the 
ADSP writes to either of these registers. 


SCLK 


1 
--'1 
____ ---'I 
r 
FSYNC -.J 


DIN 


CINT# 1 
.....1 
L 


Figure 3·6. Audio Capture 
Timing 
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BCLK 


PLAYLR I~ 
_ 


WCLK IL.. 
_ 


DIN 


L 
L 


PINT# IL.. 
...• 
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Figure 3·7. Audio Playback 
Timing 
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3.3 
Keying and Genlock 
(DB_VSYNC), VGA Vertical Sync (VG~VSYNC) 
and RGB Vertical Sync (RGB_VSYNC). The out- 
puts may be logically inverted within the array to sim- 
plify display device compatibility. 
A diagram of the KAGA interconnections relating to 
the Keying and Genlock ,Subsystems is shown in 
Figure 3-7. These signals will be described in con- 
junction with various operations performed by these 
subsystems. 
3.3.2 DISPLAY 
PROCESSOR 
SYNCHRONIZATION 


3.3.1 VIDEO 
SYNC SELECTION 
The gate array produces a Horizontal (DB_HRST) 
and Vertical (DB_VRST) 
Reset signal to synchro- 


nize the Display Processor to a choice of input 
sources. DB_ VRST is selected by a software-con- 
trolled multiplexer whose inputs are VG~ 
VSYNC, 
RGB_VSYNC, and Capture Vertical Sync (CAP_ 
VSYNC). The DB_HRST output is the same as the 
Reference input to the Phase Detector in the Phase- 
lock Loop. When the Loop is genlocked, this output 
is synchronous with the Horizontal Reference se- 
lected by the Phaselock Loop Reference Mux. 


The gate array incorporates two Video Sync Muxes 
to 
develop 
a Horizontal (HSYNC)' and Vertical 
(VSYNC) output which synchronize the output video 
to a monitor. The HSYNC output is selected under 
software control from DB Horizontal Sync (DB_ 
HSYNC), DB Composite Sync (DB_CSYNC), VGA 
Horizontal Sync (VG~HSYNC) 
and RGB Horizon- 
tal Sync (RGB_HSYNC). The VSYNC output is se- 
lected under software control from DB Vertical Sync 


DVI 
BUS 
Reset 
KAGA 
RESETAI 
RESETAI 
Detect 
Enable 
BLACK 
Device 
Strobe 
BO_ENAI 
DETECT 
OSTRBAI 
OSTRBAI 
EXTERNAL 
External 


VWEAI 
VRAM Write 
Enable 
VWEAI 
CIRCUIT 


VA(19: 
17) 
VRAM Addr.s.( 
19: 17) 
VAC 19: 17) 


VA(4:2) 
VRAM Addro.s( 
4:2) 
VA(4:2) 
RC:! 
-!rVI 


Bvto 
Enablo(3:0) 
Enable 
Ena 
BEAI(3 ••0) 
Memor. 
Data(31 
:0) 
BEAI(3 ••0) 


1010(31..0) 
1010(31..0) 


VCA 
VCA Enable 
VIDEO 
DB Clock 
OVI Enablo 


DB Horizontal 
S~ 


OB_ClK 
OVI 
t.lUX 


OB_ClKIN 
Advance 
VCA 
DVI 
DB ComDOsite 
Svnc 
DB_HSYNC 
AOV_ VGA 


DISPLAY 
DB_CSYNC 


PROCESSOR 
DB Vertical 
SYnc 
OB_VSYNC 
Pull 
Ue 
PU 
] 
LOOP 


DB 
DB ALPHA 
Bit-7 
DB_KEY 
Pull 
Dawn 
PO 
°1 
fiLTER 
DB Horizontal 
Re.et 
OB_HRST 
DB Vortical 
Re.ot 
OB_VRST 
1 


DVI 
PIXEL 


PROCESSOR 
VI 
Clock 
V1ClK 
VCO_IN 
Voltage 
Controlled 
VCO 
L.... 
~illator 
PB 
V1ClKIN 
10MHLIN 
10MHz 


SYS_ClK 
!+--25MHz 


VCA Dat Clock 
DOLCLK 
VIDEO 
Video 
Data 
Bu. 
VFB(7:0) 
fEATURE 
VCA Horizontal 


BUS 
VCA_HSYNC 
VCA Vertical 
VCA_VSYNC 
Capture 
Horizontal 
VIDEO 
CAP_HSYNC 
CAPTURE 
Capture 
Vertical 
CAP_VSYNC 
CIRCUIT 
RCB Horizontal 


EXTERNAL 
RGB_HSYNC 
RCB Vortical 
Syst~m 
Horizontal 


RGB 
RGB_VSYNC 
HSYNC 
Systom 
Vortical 
RGBDef~ 
RGB_OFT 
VSYNC 


ble 
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Figure 3-8. KAGA Keying and Genlock 
Interconnections 
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3.3.3 SYNCHRONIZING 
THE DISPLAY 
PROCESSOR 


In order to genlock the DVI Video to another video 
source or to synchronize it to a crystal clock. the 
internal Phaselock Loop is used. Its Reference Fre- 
quency is selected by the Phaselock Loop Refer- 
ence Mux from the following signals: VGA.....HSYNC. 
RGB_HSYNC. 
Capture 
Horizontal 
Sync 
(CAP_HSYNC) 
and a divided down version of an 
input clock (10MHZ_IN). 
This is compared in the 
Phase Detector with a divided down version of the 
Voltage Controlled Oscillator Frequency (VCO_IN) 
to produce the phase error signals. Pull Up (PU1) 
and Pull Down (PD1). These outputs have several 
configurations described in Section 4.2.5. 


3.3.4 PB AND DB CLOCK 
SELECTION 


There are six clocks which can be independently se- 
lected to generate clocks for PB (PBCLK) and DB 
(DBCLK). These are VCO_IN. an internal clock at 
half the frequency of VCO_IN. 
10MHLIN. 
the 
VGA Dot Clock (DOT_CLK). 
the internal audio 
clock at 16.9344 MHz and the 25 MHz System Clock 
(SYS_CLK). PBCLK and DBCLK are input back into 
the array on pins PBCLKIN and DBCLKIN for syn- 
chronization of DVI System and Display System sig- 
nals. 
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3.3.5 VIDEO 
KEYING 


The KAGA gate array interfaces with an external 
Video Multiplexer. DB. the Video Feature Bus and a 
Black Detect Circuit in order to implement Keying. 
The interconnections are shown in Figure 3-8. The 
Multiplexer is capable of instantaneously switching 
anyone of its three inputs. DVI. VGA or RGB Video. 
to its output. depending on state of the control sig- 
nals from KAGA and the Black Detect Circuit. The 
three control signals DVI. VGA and EXTERNAL are 
mutually exclusive and. when asserted. cause the 
Multiplexer to pass the DVI. VGA or External RGB 
Video respectively. In Keying Modes O.4 and 5 (see 
Section 4.2.7) only one of these signals is enabled 
and no video keying takes place. In Keying Modes 2 
and 3 it is KAGA's responsibility to switch the DVI 
and VGA outputs to effect the video keying. The in- 
formation used to make this decision is derived from 
the data lines of the Video Feature Bus. (VFB7- 
VFBO).the keying signal from the Display Processor 
(DB_Key) and the contents of the Chroma Key Val- 
ue Register. In Keying Mode 6. the Black Detect En- 
able (BD_EN#<) signal is asserted low by the gate 
array and the Black Detect Circuit has the responsi- 
bility of switching the DVI and RGB outputs to effect 
video keying. BD_EN #<is asserted in Keying Mode 
5. as well. simplifying the external logic. The control 
signal information is summarized in Figure 3-9. 


Keying 
EXTERNAL 
BD_EN#< 
VGA 
DVI 
Mode 


0 
0 
1 
0 
1 


1 
Reserved for Future Use 


2 
0 
1 
o When VGA KEY = 1 
1 When VGA KEY = 1 
1 When VGA KEY = 0 
o When VGA KEY = 0 


3 
0 
1 
o When VGA KEY & DB_KEY = 1 1 When VGA KEY & DB_KEY = 1 
1 When VGA KEY & DB_KEY = 0 o When VGA KEY & DB 
KEY = 0 


4 
0 
1 
0 
1 


5 
1 
0 
0 
0 


6 
0 
0 
0 
0 


7 
Reserved for Future Use 


Figure 3-9. Keying Control Signals 
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Two other signals are involved in the video keying. 
The first, Advance VGA (ADV_VGA), 
is an input 
which when grounded causes Keying Mode 0 to be 
the power-on default state. If the pin is tied to Vcc or 
left floating, then Keying Mode 5 is the power-on 
default 
state. 
The 
second, 
RGB 
Default 


(RGB_DFT), is an output which is a combination of 
the information found on the VGA and DVI pins, but 
slightly earlier in time. Using it as the 0 input to a flip- 


flop clocked by the rising edge of DOT_ClK 
will 


make the timing the same as the other two outputs. 
However, if DVI and VGA are used, this signal is 
unnecessary. 


One further piece of information about the use of 
this chip is that the DVI and VGA control signals 
appear one cycle of DOT_ClK 
too early at the out- 


put pins to properly key the video. This can be cor- 
rected with two 0 flip-flops as shown in Figure 3-10. 


DVI 
D 
Q 


DOLCLK 
C 


Q 


R 


974HCT74 
KAGA 


VGA 
D 
Q 


C 


BD_EN_L 
Q 


R 
V 74HCT74 


DELAYED 
DVI 


DELAYED 
VGA 
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Figure 3-10. DVI and VGA Keying Delay Circuit 


4.0 
PROGRAMMING 
INFORMATION 
4.1.1 AUDIO COMMAND AND STATUS 
REGISTER (ACS) 


4.1 Audio 
The Audio Command and Status Register is a .16-bit 
ADSP read-write register. Figure 4-1 identifies each 
bit of this register as well as its accessibility. Writing 
data to read-only locations has no effect. 


This section describes the details of the 18 audio 
registers addressable from the DVI and the ADSP 
Buses. 
15 
14 
13 
12 
11 
10 
I 
8 
7 
6 
5 
4 
2 
I AINT I DVINT I DMARDY I TEST I CINT I PINT I 
0 I VSYNC I 0 I 
0/1 I RAUTO I WAUTO I 
BE3 
BE2 
BE1 


R 
R 
R 
R/W 
R 
R 
R 
R 
R 
R/W 
R/W 
R/W 
Figure 4-1. Audio Command and Status Register 


R/W 
R/W 
R/W 
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o 


BEO 


R/W 


Device reading the MDVI Register. This interrupt 
mechanism is primarily intended for use by the Host 
Computer but could be configured to be used by oth- 
er DVI devices. 


DMARDY-DMA Ready (Bit 13) 


This read-only bit, when set to a 1, indicates that the 
ADSP may execute an audio DMA read or write op- 
eration. 


TEST-Loop 
Test Mode (Bit 12) 


This read-write bit, when set to a 1, indicates that the 
audio input! output circuitry is in the loop back mode. 
In this mode, serially shifted data intended for the 
output D/A converter is additionally routed within 
KAGA to the AID input serial shift register. In the 
process, left and right channel data are transposed. 


CINT-Audio 
Capture Interrupt (Bit 11) 


This read-only bit, when set to a 1, indicates that 
audio 'data from the Audio AID converter resides in 
the 
Capture/Playback 
Audio 
Registers 
(CPALI 
CPAR) and the CINT11 output pin of the gate array is 
asserted (generating an ADSP interrupt if enabled 
within the ADSP). The bit is cleared when the ADSP 
reads either the CPAL or CPAR. 


PINT-Audio 
Playback Interrupt (Bit 10) 


This read-only bit, when set to a 1, indicates that 
audio data to the Audio D/ A converter can be sent 
to the Capture/Playback Audio Registers (CPAL/ 
CPAR) and the PINT11 output pin of the gate array is 
asserted (generating an ADSP interrupt if enabled 
within the ADSP). The bit is cleared when the ADSP 
writes to either the CPAL or CPAR. 
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word) at the end of each audio DMA read cycle. An- 
other read cycle will be initiated when the Most Sig- 
nificant VRAM Data Register (VRDAT1) is read. If 
RAUTO is set to a 0 the pointer will be unaffected by 
data reads and a DMA read cycle will consist of an 
address write followed by a data read. 


WAUTo-Wrlte 
Address Auto-Increment Mode 
(Bit 4) 


This read-write bit, when set to a 1, will increment 
the address pointer in the VRAM Write Address 
Register (WVRADDOand WVRADD1) by 4 (a double 
word) at the end of each audio DMA write cycle. 
Another write cycle will be initiated when the Most 
Significant VRAM Data Register (WVR1) is written. If 
WAUTO is set to a 0 the pointer will be unaffected 
by address writes and a DMA write cycle will consist 
of a data write followed by an address write. 


BE3-BEo-Byte 
Enables for VRAM and DVI 
Device Accesses (Bits 3:0) 


These read-write bits determine which of the four 
bytes on the DVI bus are enabled during a bus trans- 
action. Thus, they control whether the access in- 
volves a byte, word or long word. 


4.1.2 MESSAGETO DVI REGISTER(MDVI) 


The MDVI Register is a 16-bit read-write ADSP reg- 
ister but is read-only by a DVI Device. When the reg- 
ister is written, the DVINT bit is set to a 1 in the 
Audio Command and Status Register (ACS) and the 
DVI Device Command and Status Register (DCS). 
This creates an audio interrupt if the DVI Device In- 
terrupt Enable (DVIE) bit is set to a 1 in the DVI 
Device Command and Status Register (DCS). The 
DVINT bit is cleared and the audio interrupt disabled 
by a DW Device reading the MDVI Register. 


I 


4.1.3 MESSAGE TO DSP REGISTER (MDSP) 


The MDSP Register is a 16-bit read-write DVI regis- 
ter but is read-only by the ADSP. When the register 
is written, the AINT bit is set to a 1 in both the Audio 
Command and Status (ACS) Register and the DVI 
Device Command and Status Register (DCS). The 
AINT bit is cleared by the ADSP reading the MDSP 
Register. 


4.1.4 SAMPLE RATE COMMAND AND STATUS 
REGISTER (SRCS) 


This Sample Rate Command and Status Register is 
a 16-bit read-write ADSP register. Figure 4-2 identi- 
fies each bit of this register as well as its accessibili- 
ty. Writing data to read-only locations has no effect. 


C5-Co-capture 
Sample Rate (Bits 13:8) 


These bits control the rate at which audio samples 
are taken by the A/D converter. The sample rate 
(FS) is determined by the formula: 


44.1 KHz 
FS = -C=--+-1- 


where: 


0:::;; C:::;; 63 


C is the value C5:CO 


P5-PO-Capture 
Sample Rate (Bits 5:0) 


These bits control the rate at which audio samples 
are output to the D/ A converter. The sample rate 
(Fs) is determined by the formula: 


529.2 KHz 
Fs = --=P-+-1- 


where: 


1 :::;;P :::;;63 


P is the value P5:PO 
P= 0 turns off the playback sample rate clock. 


82750LA 


4.1.5 VRAM DATA REGISTERS (VRDATO, 
VRDAT1) 


The Least and Most Significant VRAM Data Regis- 
ters are each 16-bit ADSP read-write registers used 
together to transfer up to 32-bit data words between 
the ADSP and VRAM on the DVI Bus. Data is trans- 
ferred to the DVI Bus at the address stored in the 
VRAM 
Write 
Address 
Registers 
(WVRADDO, 
WVRADD1) whenever data is written to VRDAT1. 


4.1.6 CAPTURE/PLAYBACK AUDIO REGISTERS 
(CPAR, CPAL) 


The Right and Left Capture/Playback Registers are 
each 16-bit ADSP read-write registers used to trans- 
fer data between the ADSP and the D/ A or AID 
converters. Each time the Audio Capture Rate Coun- 
ter counts to completion, 32 bits of data are trans- 
ferred from the Audio Capture Serial-to-Parallel 
Converters to the Left and Right Capture Audio 
.Registers and the Capture Interrupt (CINT) bit is set 
to a 1 in the Audio Command and Status Register 
(ACS). Similarly, each time the Audio Playback Rate 
Counter counts to completion, 32 bits of data are 
transferred from the Left and Right Playback Audio 
Registers to the Audio Playback Parallel-to-Serial 
Converters and the Playback Interrupt (PINT) is set 
to a 1 in the ACS Register. Figure 4-3 shows a hard- 
ware block diagram of the AID and D/ A interface. 
Even though the Audio Capture and Playback Regis- 
ters are distinct hardware entities, the first is read- 
only and the second write-only by the ADSP and it is 
convenient to group them together and assign them 
one bus address. 


4.1.7 VRAM WRITE ADDRESS REGISTERS 
(WVRADDO, WVRADD1) 


The Least and Most Significant VRAM Write Ad- 
dress Registers are each 16-bit ADSP read-write 
registers used together to create a 32-bit write ad- 
dress pointer into the DVI Device Bus space. Up to 
32 
bits of 
data 
in the VRAM 
Data Registers 
(VRDATO,VRDAT1) is written to this pointer location 
when the ADSP writes to the VRDAT1 Register. 


15 
14 
13 
12 
11 
10 
9 
8 
7 
6 
5 
4 
0 
I 0 I 0 I cs I C4 I C3 I C2 I Cl I co I 0 I 0 I PS I P4 I P3 I P2 I Pl I PO I 


R 
R 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R 
R 
R/W 
R/W 
R/W 
R1W 
R/W 
R/W 
Figure 4·2. Sample Rate Command and Status Register 
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Left Serial-Porollel 
+-- 
Right Serial-Parallel 
Serial Data 
Register 
Register 
From Input 
1>./0 


~6 
~6 


Left Capture 
Right Capture 
Regiater 
R~ist.r 


~ ~6 
16 


I>.DSPDota Bus 


16 


, 
16 
16 
~,. 


Right 
Playback 
Loft 
Playback 
Register 
Register 


~6 
~6 


Right 
Parallel-Serial 
r----. 
Left Parallel-Serial 
Serial 
Data 
Converter 
Converter 
to Output 
011>. 
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Figure 4-3. Audio AID and DI A Interface Registers 


4.1.8 VRAM READ ADDRESS REGISTERS 
(RVRADDO, RVRADD1) 


The Least and Most Significant VRAM Read Ad- 
dress Registers are each 16-bit ADSP read-write 
registers used together to create a 32-bit read ad- 
dress pointer into the DVI Device Bus space. Up to 
32 bits of data is loaded into the VRAM Data Regis- 
ters (VRDATO,VRDAT1) from this pointer location 
when the ADSP writes to the RVADD1 Register. 


4.1.9 DVI COMMAND AND STATUS REGISTER 
(DCS) 


The DVI Command and Status Register is a 16-bit 
DVI Bus read-write register. Figure 4-4 identifies 
each bit of this register as well as its accessibility. 
Writing data to read- only locations has no effect. 


15 
14 
13 
12 
11 
10 
9 
8 
7 
8 
5 
4 
3 
2 
1 
0 


IFOI 
HIE I OVINTI AINTI AREa 
I BR 
IBGIARSTlolololololololBRovl 
R 
R/W 
R 
R 
R 
R/W 
R 
R/W 
R 
R 
R 
R 
R 
R 
R 
R 
Figure 4-4. DVI Command and Status Register 


1-276 
I 


Fo-Flag 
Out (Bit 15) 


This read-only bit indicates the status of the Flag 
Out pin of the ADSP. It is controllable in software by 
the audio processor and may be used to signal 
events to the DVI Device. 


DVIE-Host 
Interrupt Enable (Bit 14) 


This read-write bit controls whether the assertion of 
the DVI Device Interrupt (DVINT) bit will create an 
audio interrupt on the DVI Bus. DVIE = 1 enables 
the interrupt; DVIE = 0 disables it. The interrupt en- 
able bit powers up in the 0 state. 


DVINT-DVI 
Device Interrupt (Bit 13) 


This read-only bit, when set to a 1, indicates that the 
ADSP has written a message to the MDVI Register 
(causing an audio interrupt if the DVI Device Inter- 
rupt Enable (DVIE) bit is set to a 1). The bit is 
cleared and the audio interrupt disabled by a DVI 
Device reading the MDVI Register. This interrupt 
mechanism is primarily intended for use by the Host 
Computer but could be configured to be used by oth- 
er DVI devices. 


AINT-Audlo 
Device Interrupt (Bit 12) 


This read-only bit, when set to a 1, indicates that a 
message has been sent to the MDSP Register from 
a DVI Device. The ADSP clears this bit by reading 
that register. 


AREa-Audio 
Request (Bit 11) 


This read-only bit, when set to a 1, indicates that the 
AREQ# output pin on the gate array is asserted and 
there is a pending request by KAGA to perform an 
audio data transfer on the DVI Bus. When the trans- 
fer is completed, the bit is negated. 


BR-Bus 
Request (Bit 10) 


This read-write bit, when set to a 1 by any DVI De- 
vice, asserts the BR# pin of the gate array, request- 
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ing the ADSP to relinquish control of its bus. This is 
used primarily during the ADSP boot process where 
the Host serves as the boot ROM and loads ADSP 
program memory in single-step mode using this sig- 
nal. 


BG-Bus 
Grant (Bit 9) 


This read-only bit, when set to a 1, indicates that the 
ADSP has asserted the DBG# pin of the gate array 
and has relinquished control of the audio bus. 


ARST-ADSP 
Reset (Bit 8) 


This read-write bit, when set to a 1, indicates that 
DRST# pin of the gate array has been asserted, 
resetting the ADSP. 


BRDY-Bus 
Ready (Bit 0) 


This read-only bit, when set to a 1, indicates that the 
DBS# pin of the gate array has been negated, signi- 
fying the end of a boot memory load cycle. At this 
time a new byte of data may be loaded into the 
BOOT Register by the DVI Device. 


4.2 Keying and Genlock 


This section describes the details of the 7 Keying 
and Genlock registers addressable from the DVI Bus 
and how they are used in the system. 


4.2.1 N MODULUSREGISTER(NMOD) 


The N Modulus Register is a 16-bit DVI Bus read- 
write register. Figure 4-5 identifies each bit of this 
register as well as its accessibility. The -i- N Counter 
divides the 10 MHz reference frequency by the 
quantity (NMOD + 2) so long as the register con- 
tents are 8 or greater. (The NObit is always interpret- 
ed by the counting circuitry as a 0 regardless of the 
state of stored value. This results in the modulus 
always being even.) If the contents of NMOD are 
less than 8, the output of the N Counter will be non- 
oscillatory. Writing to read-only locations has no ef- 
fect. This register is set to 200H upon reset. 


15 14 13 12 
11 
10 
9 
8 
7 
6 
5 
4 
3 
2 
1 
0 
I 0 I 0 I 0 I 0 I Nll I Nl0 I N9 I N8 I N7 I N6 I N5 I N4 I N3 I N2 I NI I NO I 


R 
R 
R 
R 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
Figure 4-5. N Modulus Register 
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4.2.2 M MODULUS 
REGISTER 
(MMOD) 


The M Modulus 
Register 
is a 16-bit DVI Bus read-write 
register. 
Figure 4-6 identifies 
each bit of this register 
as 
well as its accessibility. 
The 
-i- M Counter 
divides 
the Voltage 
Controlled 
Oscillator 
(VCO) frequency 
by the 
quantity 
(MMOD + 2) so long as the register contents 
are 8 or greater. (The MO bit is always interpreted 
by the 
counting 
circuitry as a 0 regardless 
of the state of stored value. This results in the modulus always being even.) 
If the contents 
of MMOD are less than 8, the output of the M Counter 
will be non-oscillatory. 
Writing to read- 
only locations 
has no effect. 
This register 
is set to 200H upon reset. 


15 
14 
13 
12 
11 
10 
9 
8 
7 
6 
5 
4 
3 
2 
0 
I 0 I 0 I 0 I 0 I Mll I Ml0 I 
M9 I 
Me I 
M7 I 
M6 I 
MS I 
M4 I 
M3 
M2 
Ml 
MO 


R 
R 
R 
R 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
Figure 
4-6. M Modulus 
Register 


4.2.3 N COUNTER 
REGISTER 
(NCTR) 


The N Counter 
Register 
is a 16-bit DVI Bus read-only 
register which contains 
the instantaneous 
value of the N 
Counter. 
Figure 4-7 identifies 
each bit of this register as well as its accessibility. 
Writing to this register 
has no 
effect. 


15 
14 
13 
12 
11 
10 
9 
8 
7 
6 
5 
4 
2 
0 


0 
0 
0 
0 
Nll 
Nl0 I N9 I Ne 
N7 I Ne I N5 I N4 
N3 
N2 
Nl 
NO 


R 
R 
R 
R 
R 
R 
R 
R 
R 
R 
R 
R 
R 
R 
R 
R 
Figure 
4-7. N Counter 
Register 


4.2.4 M Counter 
Register 
(MCTR) 


The M Counter 
Register 
is a 16-bit DVI Bus read-only 
register which contains 
the instantaneous 
value of the M 
Counter. 
Figure 4-8 identifies 
each bit of this register as well as its accessibility. 
Writing to this register 
has no 
effect. 


15 
14 
13 
12 
11 
10 
9 
8 
7 
6 
5 
4 
3 
2 
1 
0 


0 
0 
0 
0 
Mll 
Ml0 I M9 I Me I M7 I M6 I MS I M4 I M3 I M2 I Ml I MO I 


R 
R 
R 
R 
R 
R 
R 
R 
A 
R 
R 
R 
R 
R 
R 
R 
Figure 
4-8. M Counter 
Register 


4.2.5 GENLOCK 
COMMAND 
AND STATUS 
REGISTER 
(GCSR) 


The Genlock 
Command 
and Status Register 
is a 32-bit DVI Bus read-write 
register 
use to control 
and monitor 
the parameters 
involved 
in synchronizing 
the DVI Display 
Processor 
(DB) to various 
sources 
of video timing. 


Figure 4-9 identifies 
each bit of these 
registers 
and its accessibility. 


15 
14 
13 
12 
11 
10 
, 
8 
7 
6 
5 
4 
3 
2 
1 
0 


I GSEL I 
IVS I 
IHS I SCMP I SOB I HREN I VREN I NEVS I UPBC I PBC2 I PBCl 
PBCO I UOBC I OBC2 I OBCl 
OBCO I 


R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 


31 
30 
29 
28 
27 
29 
25 
24 
23 
22 
21 
20 
19 
18 
17 
16 


0 
0 
0 
0 
0 
I 
0 
I 
0 
I 
0 
I 
0 
I 
IOMC I 
IONC I 
POMl I 
POMO 
SCAP 
SEXT 
IVRS 


R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
Figure 
4-9. Genlock 
Command 
and Status 
Register 
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IDMe-lnvert 
Divider M Counter (Bit 22) 


This read-write bit selects the edge of the Voltage 
Controlled Oscillator waveform that is used to incre- 
ment the 
-i- M Counter. The default for this bit is 0 
which selects the rising edge. If the bit is set to a 1, 
the falling edge is selected. 


IDNe-lnvert 
Divider N Counter (Bit 21) 


This read-write bit selects the edge-of the 10 MHz 
Reference Oscillator Waveform that is used to incre- 
ment the 
-i- N Counter. The default for this bit is 0 
which selects the rising edge. If the bit is set to a 1, 
the falling edge is selected. 
. 


PDM1-PDMG-Phase Detector Mode (Bits 20:19) 


These read-write bits control the Phase Detector 
mode of operation. There are four modes of opera- 
tion: 


Mode 0 is the default mode. In this mode, if the two 
Phase Detector inputs are coincident, the output 
pins are tri-stated. If the VCO input lags the Refer- 
ence, the PU1 output pin will go active high during 
the lag time. If the VCO input leads the Reference, 
the PD1 output pin will go active high during the lead 
time. 


Mode 1 bypasses the internal Phase Detector, bring- 
ing the VCO input to the PD1 output pin and the 
Reference to the PU1 output pin. This allows the use 
of an external phase detector. 


In Modes 2 and 3 the outputs of the internal Phase 
Detector always drive the output pins. In Mode 2 if 
the two Phase Detector inputs are coincident, the 
output pins are both low. If the VCO input lags the 
Reference, the PU1 output pin will go high during the 
lag time. If the VCO input leads the Reference, the 
PD1 output pin will go high during the lead time. In 
Mode 3 the outputs are inverted from Mode 2. 


SCAP-5elect 
Capture (Bit 18) 


This read-write bit together with bits SEXT and 
GSEL determines the Reference input to the Phase 
Detector and the signal selected for Vertical Reset 
of the DB as shown in Figure 4-10. The default for 
this bit is O. 


I 
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SEXT-5elect 
External RGB (Bit 17) 


This read-write bit together with bits SCAP and 
GSEL determines the Reference input to the Phase 
Detector and the signal selected for Vertical Reset 
of the DB as shown in Figure 4-10. Additionally, to- 
gether with bits SCMP and SOB, it determines the 
horizontal and vertical inputs selected for the Video 
Sync Outputs as shown in Figure 4-11. The default 
for this bit is O. 


IVRS-Invert 
Reference Source (Bit 16) 


When this read-write bit is set to a 1 the Reference 
Input to the Phase Detector is inverted. When the bit 
is set to a 0 no inversion takes place. The default for 
this bit is O. 


GSEL-Genlock Select (Bit 15) 


This read-write bit together with bits SCAP and 
SEXT determines the Reference input to the Phase 
Detector and the signal selected for Vertical Reset 
of the DB as shown in Figure 4-10. 


IVS-Invert 
Vertical Sync (Bit 14) 


When this read-write bit is set to a 1 the Vertical 
Sync output (VSYNC)is inverted. When the bit is set 
to a 0 no inversion takes place. The default for this 
bit is O. 


IHS-Invert 
Horizontal Sync (Bit 13) 


When this read-write bit is set to a 1 the Horizontal 
Sync output (HSYNC) is inverted. When the bit is set 
to a 0 no inversion takes place. The default for this 
bit is O. 


SCMP-5elect 
Composite (Bit 12) 


This read-write bit together with bits SEXT and SOB 
determines the horizontal and vertical inputs select- 
ed for the Video Sync Outputs as shown in Figure 
4-11. The default for this bit is O. 


SDB-5elect 
Display Processor (DB) (Bit 11) 


This read-write bit together with bits SEXT and 
SCMP determines the horizontal and vertical inputs 
selected for the Video Sync Outputs as shown in 
Figure 4-11. The default for this bit is O. 
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SCAP (18) 
SEXT (17) 
GSEL (16) 
Reference 
DB_VRST 


0 
0 
0 
10MHZ_N 
VGA-VSYNC 


0 
0 
1 
VGA-HSYNC 
VGA-VSYNC 


0 
1 
X 
RGB_HSYNC 
RGB_VSYNC 


1 
X 
X 
CAP_HSYNC 
CAP_VSYNC 


Figure 4-10. Phase Detector Reference and Vertical Reset Select 


SCMP (12) 
SEXT (17) 
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VGA-VSYNC 


0 
X 
1 
DB_HSYNC 
DB_VSYNC 


0 
1 
0 
RGB_HSYNC 
RGB_VSYNC 


1 
0 
0 
DB_CSYNC 
VGA-VSYNC 


1 
X 
1 
DB_CSYNC 
DB_VSYNC 


1 
1 
0 
DB_CSYNC 
RGB_VSYNC 


HREN-Horizontal 
Reset Enable (Bit 10) 


Figure 4-11. Horizontal and Vertical System Sync Select 


PBC2-PBC~B 
Clock Select (Bits 6:4) 


When this read-write bit is set to a 1 the Horizontal 
Reset to the DB is enabled. The reset is used to 
align the lines of video produced by the DB with vari- 
ous video sources. When the bit is set to a 0 the 
output is disabled. The default for this bit is O. 


VREN-Vertlcal 
Reset Enable (Bit 9) 


When this read-write bit is set to a 1 the Vertical 
Reset to the DB is enabled. The reset is used to 
align the fields of video produced by the DB with 
various video sources. When the bit is set to a 0 the 
output is disabled. The default for this bit is O. 


NEV5-Negatlve 
Edge of Vertical Sync (Bit 8) 


This read-write bit selects the edge of the selected 
Vertical Sync input waveform that is used to assert 
the Vertical Sync Reset to the DB. The default for 
this bit is 0, which selects the rising edge. If the bit is 
set to a 1, the falling edge is selected. 


UPBC-Update 
PB Clock (Bit 7) 


When a 1 is written to this read-write bit, the selec- 
tion of the PB clock source (PBC2-PBCO) is updat- 
ed. The bit immediately returns to the 0 state. 
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This three-bit field selects the PB clock source when 
the UPBCinput makes a positive transition as shown 
in Figure 4-12. The default for this field is O. 


PBC2(6) 
PBC1 (5) 
PBCO(4) 
PB Clock Source 


0 
0 
0 
10MHLIN 


0 
0 
1 
VCO_IN 


0 
1 
0 
10MHLIN 


0 
1 
1 
16.9344 MHz 


1 
0 
0 
VCO_IN + 2 


1 
0 
1 
10MHLIN 


1 
1 
0 
DOT_ClK 


I 
1 
1 
SYS_ClK 


Figure 4-12. PB Clock Source Selection 


UDBC-Update 
DB Clock (Bit 3) 


When a 1 is written to this read-write bit, the selec- 
tion of the DB clock source (DBC2-DBCO) is updat- 
ed. The bit immediately returns to the 0 state. 


I 


intet~ 


DBC2-DBCo-DB 
Clock Select (Bits 2:0) 


This three-bit field selects the DB clock source when 
the UDBC input makes a positive transition as 
shown in Figure 4-13. The default for this field is O. 


DBC2 (2) DBC1 (1) DBCO(0) DB Clock Source 


0 
0 
0 
10MHLIN 


0 
0 
1 
VCO_IN 


0 
1 
0 
10MH~IN 


0 
1 
1 
16.9344 MHz 


1 
0 
0 
VCO_IN + 2 


1 
0 
1 
10MHLIN 


1 
1 
0 
DOT_ClK 


1 
1 
1 
SYS_ClK 


Figure 4-13. DB Clock Source Selection 


4.2.6 CHROMA KEYING VALUE REGISTER 
(CKVAL) 


The Chroma Keying Value Register is an 8-bit DVI 
Bus read-write register which contains a value which 
is continuously compared with the pixel values on 
the Video Feature Bus. If VGA Keying is selected 
(see Section 4.2.7) and the values match, the output 
from the Display Processor is substituted for the 
VGA video for that pixel. 
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4.2.7 KEYING MODE SELECT REGISTER 
(KMSEL) 


The Keying Mode Select Register is an 8-bit DVI Bus 
read-write register which determines whether a sin- 
gle video source or the keying of RGB or VGA with 
DVI video will make up the system video output. Fig- 
ure 4-14 identifies each bit of this register as well as 
its accessibility. 


7654321°1 
I 
0 
I 
0 
I 
0 
I 
0 
I XDFT I 
MS2 I 
MS1 I 
MSC 


R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
R/W 
Figure 4-14. Keying Mode Select Register 


XDFT-External 
RGB Default Mode (Bit 3) 


This bit reflects the state of the RGB_DFT pin on 
the gate array. If this pin is tied to ground, the bit will 
display as a 0 and Keying Mode 0 (see below) is the 
Power-ondefault state. If the pin is tied to vcc or left 
floating, then Keying Mode 5 is the power-on default 
state. 


MS2•.MSo-Mode 
Selects (Bits 2:0) 


This 3-bit field selects one of eight keying modes as 
shown in Figure 4-15 and described below. 


Keying Mode 
MS2 
MS1 
MSO 
Video with Key = 0 
Video with Key = 1 
Keying Signal 


0 
0 
0 
0 
VGA 
- 
N/A 


1 
0 
0 
1 
Reserved for Future Use 


2 
0 
1 
0 
VGA 
DVI 
VGAKey 


3 
0 
1 
1 
VGA 
DVI 
VGA Key and DB_Key 


4 
1 
0 
0 
DVI 
- 
N/A 


5 
1 
0 
1 
RGB 
- 
N/A 


6 
1 
1 
0 
RGB 
DVI 
Black Detect 


7 
1 
1 
1 
Reserved for Future Use 


Figure 4-15. Keying Modes 
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rieqrster l~I\Vf\L} 
ana me ut:l_t\ey 
pin on me array 
is asserted 
at which time DVI video is selected. 


In Mode 
4, DVI is the unconditional 
source 
of the 
video output. 


In Mode 
5, RGB is the unconditional 
source 
of the 
video output. 
It is the default 
mode when the RGB_ 
DFT pin on the gate array is tied to vcc or left float- 
ing. 


In Mode 6, RGB video is selected 
unless the exter- 
nal analog 
Black 
Detect 
circuit 
determines 
that the 
RGB value is near black, at which time DVI video is 
selected. 


Mode 7 is reserved 
for future use and is not present- 
ly defined. 


4_2.8 COMMON 
SYSTEM 
CONFIGURATIONS 


Although 
the Genlock 
Command 
and Status 
Regis- 
ter provides 
many 
possibilities 
for signal 
selection, 
the 
following 
eight 
combinations 
shown 
in Figure 
4-16 (ignoring 
some signal inversions) 
are the most 
logical 
alternatives: 


Case B 


This is a mode where the Display 
Processor 
gener- 
ates an image that is overlaid with VGA video. In this 
mode the DB clock 
source 
must originate 
from the 
vca which 
is phaselocked 
to the VGA 
Horizontal 
Sync. 
The VGA 
Horizontal 
and Vertical 
Syncs 
are 
selected 
as the 
DB Resets 
as well as the system 
output 
synchronization. 
The 
HREN 
and VREN 
bits 
are set to 1 in order to allow those resets to synchro- 
nize the DB to the VGA frame. The IVS and IHS bits 
are set to 0 since both the Video 
Feature 
Bus input 
and the output 
monitors 
most likely use active 
low 
syncs. 


CaseC 


This mode is identical 
to case B except that the out- 
put Horizontal 
and Vertical 
Syncs originate 
from the 
DB. Since the DB outputs 
positive-going 
sync puls- 
es, it is necessary 
to set the IVS and IHS bits to a 1. 


GCSR 
HSYNC 
Reference 
OB_VRST 
OB-HRST 
SCAP 
SEXT 
GSEL 
SCMP 
SOB 
VSYNC 
(15:0) 
Source 


68 
0 
0 
0 
0 
1 
DB 
VSYNC 
DB 
HSYNC 
+N 
OFF 
OFF 


86 
0 
0 
1 
0 
0 
VGA-VSYNC 
VGA-HSYNC 
VGA-HSYNC 
VGA-VSYNC 
:M 


EE 
0 
0 
1 
0 
1 
DB 
VSYNC 
OB_HSYNC 
VGA-HSYNC 
VGA-VSYNC 
:M 


206 
0 
1 
0 
0 
0 
RGB 
VSYNC 
RGB_HSYNC 
RGB_HSYNC 
RGB_VSYNC 
:M 


26E 
0 
1 
0 
0 
1 
DB 
VSYNC 
DB_HSYNC 
OB_HSYNC 
DB_VSYNC 
:M 


27E 
0 
1 
0 
1 
1 
DB 
VSYNC 
OB_CSYNC 
RGB_HSYNC 
RGB_VSYNC 
:M 


46E 
1 
0 
0 
1 
1 
DB 
VSYNC 
DB_HSYNC 
CAP_HSYNC 
CAP_VSYNC 
+M 


47E 
1 
0 
0 
1 
1 
DB 
VSYNC 
OB_CSYNC 
CAP_HSYNC 
CAP_VSYNC 
+M 


A 


B 


C 
o 


E 


F 


G 


H 


Figure 
4-16. Common 
System 
Configurations 
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CaseD 


This is a mode where the Display Processor gener- 
ates an image that is overlaid with RGB video. In this 
mode the DB clock source must originate from the 
VCO which is phaselocked to the RGB Horizontal 
Sync. The RGB Horizontal and Vertical Syncs are 
selected as the DB Resets as well as the system 
output synchronization. The HREN and VREN bits 
are set to 1 in order to allow those resets to synchro- 
nize the DB to the RGB frame. The IVS and IHS bits 
would probably be set to 0 since RGB syncs are 
likely to be the correct polarity. 


Case E 


This mode is identical to case 0 except that the out- 
put Horizontal and Vertical Syncs originate from the 
DB. Since the DB outputs positive-going sync puls- 
es, it is necessary to set the IVS and IHS bits to a 1. 


I 
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CaseF 


This mode is identical to case E except that Com- 
posite Horizontal Sync is selected from the DB. 


CaseG 


This is a mode in which the DB generates the entire 
image but is still genlocked to an external so~rce, 
Capture Horizontal and Video. The Capture HOrizon- 
tal is the genlock reference and the Vertical is used 
to reset the DB. However, the DB Horizontal and 
Vertical Syncs are used as the system output. 


Case H 


This mode is identical to case G except that Com- 
posite Horizontal Sync is selected from the DB. 
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5.0 
ELECTRICAL 
SPECIFICATIONS 


5.1 DCCharacteristics 


Table 5-1 contains 
stress ratings only, and function- 
al operation 
at the maximums 
is not guaranteed. 
Ex- 
posure 
to Maximum 
Ratings 
may affect 
device 
reli- 
ability. Furthermore, 
although 
the 82750LA 
contains 
protective 
circuitry to resist damage from static elec- 
trical 
discharge, 
this device 
is sensitive 
to ESO lev- 
els above 
1000V. Always 
take precautions 
to avoid 
high static voltages 
or electric 
fields. 


Table 
5-1. Maximum 
Ratings 


Condition 
Maximum 
Requirement 


Maximum 
Operating 
Junction 
Temperature 
100·C 


Storage Temperature 
- 65·C to + 150·C 


Voltage on Any Pin with Respect to Ground 
-0.5Vt07V 


Supply Voltage with Respect to Vss 
-0.5Vt07V 


Input Current Clamp (VI <0 or VI>VCC> 
±20mA 


Output Current Clamp (VO < Q or VO> VcC> 
±20mA 


Continuous 
Output Current Low 
20mA 


Continuous 
Output Current High 
20mA 


Table 5-2. Recommended 
Operating 
Conditions 


Recommended 
Condition 
Parameter 
Mln 
Nom 
Max 


Supply Voltage 
(VcC> 
4.50V 
5.00V 
5.50V 


Operating 
Temperature 
Range 
O·C 
70·C 


Table 5-3. DC Characteristics 
vcc = 5V, TCASE = 25·C 


Symbol 
Parameter 
Mln 
Typ 
Max 
Unit 
Test Conditions 


VIL 
Input LOW Voltage 
0.8 
V 
vcc = 4.5V 


VIH 
Input HIGH Voltage 
2.0 
V 
vcc = 5.5V 


VOL 
Output LOW Voltage 
0.5 
V 
VI = 0.1 VCC,IOL = 4 mA 


VOH 
Output HIGH Voltage 
3.7 
V 
VI = 0.9 vcc. IOH = 4 mA 


IlL 
Input Leakage Current 
-70 
p.A 
VIL = OV 


IOL 
Output Low Current 
4 
mA 


Icc 
Power Supply Current 
35 
mA 


CIN 
Input Capacitance 
7 
pF 


COUT 
Output Capacitance 
34 
pF 


VT(1) 
Input Threshold 
Voltage 
1.3 
V 


VT(2) 
Input Threshold 
Voltage 
Vcc/2 
V 
• 


NOTES: 
1. Specified for all input pins except MD31-MDO, VWEII, BEII3-BEIIO. 
2. Specified for MD31-MDO, VWEII, BEll-BEllO. 
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Table 
5-4. ClK 
DC Characteristics 
vcc = 5V, TCASE = 25°G 


Symbol 
Parameter 
Mln 
Typ 
Max 
Unit 
Notes 


VIL 
Input LOW Voltage 
0.9 
V 
vcc = 4.5V 


VIH 
Input HIGH Voltage 
3.85 
V 
vcc = 5.5V 


IlL 
Input LOW Leakage 
±1 
p.A 
VIH = vcc 


IIH 
Input HIGH Leakage 
±1 
p.A 
VIL = OV 


VT 
Input Threshold 
Voltage 
2.5 
V 


GIN 
Input Capacitance 
7 
pF 


Output 
Delay and Rise Time vs load 
Capacitance 


Typical Output Valid Delay 
vs load 
Capacitance 
Typical 
Output 
Rise Time 
vs load 
Capacitance 


8 


Valid 
Delay 
6 
Time 


, (no) 
4 


8 


Rise 
Time:1 
»> 
(ns) 
.• ~ 


2'--------' 
10 
20 
30 
40 
50 
2'-- 
-' 
10 
20 
30 
40 
50 


G.. (pr) 
G.. (pF) 


241346-19 
241346-20 
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5.2 AC Characteristics 


NOTE: 
Industry 
standard 
gate array test methodologies 
do 
not include 
full AC characterization. 
The 
AC char- 
acteristics 
below 
were 
determined 
through 
slmula- 


tion 
and 
are 
provided 
as design 
guidelines 
only. 
These parameters 
are not fully tested 
in production. 
As per Tl's 
standard 
gate 
array test 
methodology, 


two AC parametric 
measurements 
are made during 
production 
to guarantee 
the 
speed 
of the 
device. 
These 
measurements 
are indicated 
by an • in the 
table below. 


Table 5-5. AC Characteristics 
vcc = 5V ± 10%, TA ;", O°C to + 70°C, CL = 45 pF 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


t1 
VA19-VAO 
Setup before DSTRB# 
Low 
10 
ns 
5·1 


t2 
VA19-VAO 
Hold after DSTRB# 
Low 
2 
ns 
5·1 


t3 
BE#3-BE#0 
Setup before DSTRB# 
Low 
0 
ns 
5·1 


t4 
BE#3-BE#0 
Hold after DSTRB# 
Low 
10 
ns 
5·1 


t5 
DSTRB # Low Pulse Width 
20 
ns 
5·1 


t6 
VWE# 
Setup before DSTRB# 
Low 
7 
ns 
5·1 


t7 
VWE # Hold after DSTRB # Low 
5 
ns 
5·1 


t8 
MD31-MDO 
Write Data Valid before VWE# 
High 
5 
ns 
5·1 


t9 
MD31-MDO 
Write Data Valid after VWE# 
High 
20 
ns 
5·1 


t10 
DSTRB# 
Low to MD31-MDO 
Read Data Enabled 
5 
ns 
5·1 
2 


t11 
DSTRB# 
Low to MD31-MDO 
Read Data,Valid 
50 
ns 
5·1 
2 


t12 
DSTRB# 
High to MD31-MDO 
Read Data Invalid 
4 
ns 
5·1 
2 


t13 
DSTRB# 
High to MD31-MDO 
Read Data Disabled 
20 
ns 
5·1 
2 


NOTES: 
1. All timing measured at the 1.3V TTL threshold. 
2. For MD31-MDO lines, CL = 100 pF. 


--+!--12 


VA(19:17)~~,-- 
__ 
~~_ 
•••••""~~~~>I\ 
...J'\Ci~~~l.Cl.. 
_ 


BE#(3:0)~:Z~C:jt:::~~~~~O<::::::J<l~~X:::::: 


OSTRB# 


16--1--- 


VWE# 


110 
~ 


110(31:0) 
\,,::::::_1-9_~.....JI------~xa 
112 
~ 
I 
WRITE 
CYCLE 
READ 
CYCLE 


241346-21 


Figure 5-1. DVI Device 
Read and Write 
Cycle 
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Table 5-6. AC Characteristics 
vcc = 5V ± 10%, TA = O°Cto + 70°C,CL = 45 pF 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


t14 
DMS#, DRD#, and DWR# low to AREQ# low 
42 
ns 
5-2 


t15 
AREQ# low to DMARDY# low 
6 
ns 
5-2 


t16 
ASEl# 
low to AREQ# High 
32 
ns 
5-2 


t17 
BUSEN# lowtoAREQ# 
High 
31 
ns . 
5-2 


t18 
BUSEN High to DMARDY# High 
36 
ns 
5-2 


t19 
ASEl # High to DMARDY# High 
37 
ns 
5-2 


t20 
ASEl# 
High to VWE# Disabled 
28 
ns 
5-2 


t21 
ASEl# 
low to VWE# Enabled and Valid 
28 
ns 
5-2 


t22 
BUSEN# low to VWE# Enabled and Valid 
27 
ns 
5-2 


t23 
BUSEN# High to VWE# Disabled 
27 
ns 
5-2 


t24 
MSTRB# Setup to V1ClK 
10 
ns 
5-2 


t25 
GAVAlEN Setup to V1ClK 
8 
ns 
5-2 


t26 
ASEl # low to MD31-MDO 
46 
ns 
5-2 
2 
Address Enabled and Valid 


t27 
BUSEN# lowtoMD31-MDO 
27 
ns 
5-2 
2 
Address Enabled and Valid 


t28 
V1ClK High to MD31-MDO Address Disabled 
50 
ns 
5-2 


t29 
WRITE: Vl ClK High to MD31-MDO 
50 
ns 
5-2 
2 
Data Enabled and Valid 


t30 
WRITE: V1ClK to MD31-MDO Data Disabled 
30 
ns 
5-2 
2 


t31 
WRITE: ASEl# 
High to MD31-MDO Data Disabled 
3 
43 
ns 
5-2 


t32 
READ: MD31-MDO Setup to Vl ClK 
10 
ns 
5-2 


t33 
READ: MD31-MDO Hold after Vl ClK 
10 
ns 
5-2 


NOTES: 
1. All timing 
measured 
at the 1.3V TTLthreshold. 


2. For MD31-MDO 
lines, CL = 100 pF. 
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MSTRB# 


GAVALEN 


1010(31:0) 


123 
121 


VWE# --------------------t------t--1C::::::::::::::::::::~:)~------- 


V1CLK 
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Figure 5-2. DVI DMA Transfer 
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Table 5-7. AC Characteristics 
vcc = 5V ± 10%, TA = O·C to + 70·C, CL = 45 pF 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


t34 
A(3:0) Delay before TD15- TOO Read Data Valid 
48 
ns 
5-3 


t35 
A(3:0) Delay before TD15- TDO Read Data Invalid 
3 
ns 
5-3 


t36 
DRD# 
High to TD15- TDO Read Data Invalid 
3 
ns 
5-3 


t37 
DMS# 
High to TD15- 
TDO Read Data Invalid 
3 
ns 
5-3 


t38 
DMS # Low to TD15,.. TDO Read Data Valid 
57 
ns 
5-3 


t39 
DMS # Low to TD15- 
TDO Read Data Enabled 
3 
ns 
5-3 


t40' 
DRD # Low to TD15- TDO Read Data Valid 
57 
ns 
5-3 


t41 
DMS # High to TD15- TDO Read Data Disabled 
3 
ns 
5-3 


t42 
DRD# 
Low to TD15- TDO Read Data Enabled 
3 
ns 
5-3 


t43 
DRD# 
High to TD15- 
TDO Read Data Disabled 
3 
ns 
5-3 


t44 
A3-AO 
Setup before DWR # Low 
0 
ns 
5-3 


t45 
A3-AO 
Hold after DWR # High 
5 
ns 
5-3 


t46 
A3-AO 
Hold after DMS# 
High 
5 
ns 
5-3 


t47 
A3-AO 
Setup before DMS# 
Low 
0 
ns 
5-3 


t48 
TD15- TDO Write Data Setup before DMS# 
High 
5 
ns 
5-3 


t49 
TD15- TDO Write Data Hold after DMS# 
High 
15 
ns 
5-3 


t50 
TD15- 
TDO Write Data Setup before DWR # High 
5 
ns 
5-3 


t51 
TD15- TDO Write Data Hold after DWR# 
High 
15 
ns 
5-3 


t52 
DRD# 
Low to DBR# 
Low 
5 
32 
ns 
5-4 


t53 
DSTRB# 
Low to DBR# 
High 
5 
28 
ns 
5-4 


t54 
DIN Setup before SCLOCK 
High 
0 
ns 
5-5 


t55 
DIN Hold after SCLOCK High 
9 
ns 
5-5 


t56 
CAPLR 
Setup before SCLOCK High 
5 
ns 
5-5 


t57 
CAPLR 
High to CINT# 
Low 
16 
ns 
5-5 


t58 
BCLK Low to WCLK Delay 
4 
ns 
5-6 


t59 
BCLK Low to DOUT Delay 
13 
ns 
5-6 


t60 
BCLK Low to PLA YLR 
Delay 
3 
ns 
5-6 


NOTE: 
1. All timing measured at the 1.3V TTL threshold. 
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A(3:0) "~---+-:-.IJ"~~------'.:t.:;I..::I,;jJ'I'------:----'1I"-l'~~ 


TO(15:0) ----~X~l!~X~-------_t-..Jr_(::ll::)r_--- 


ORO- 


t50_+-,t",,5.:..1-i 


OWR- 


READCYCLE 
WRITECYCLE 


Figure 5-3. ADSP External 
Memory 
Read and Write Cycle 
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VA(19:17) 
r" 
---------7~-t1--t-2~·------ 


TD(15:0) ---""~X(IIt~~-----+----- 


OSTRB- 


OBS- 


ORO- 


OBR- 


READBOOT 
LOADBOOT 


Figure 5-4. ADSP Boot Cycle 
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SCLK 
I+--t54-- -t55- 


DIN -------"'" 


[::1 
_ 
241346-25 


CAPLR 


241346-26 


I 


CINTIJ 


Figure 5-5. ADSP Capture 
Waveforms 


BCLK 


WCLK 


DOUT 


__________t6_0--:r~----------------t6-'~ 


Figure 5-6. ADSP Playback 
Waveforms 
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Table 5-8. AC Characteristics 
vcc = 5V ± 10%, TA = O·Cto + 70·C, CL = 45 pF 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


t61 
RGB_HSYNC to HSYNC Delay 
23 
ns 
5·7 


t61 
DB_HSYNC to HSYNC Delay 
23 
ns 
5-7 


t61 
VGA.....HSYNCto HSYNC Delay 
23 
ns 
5-7 


t61 
DB_CSYNC to HSYNC Delay 
23 
ns 
5-7 


t61 
RGB_VSYNC to VSYNC Delay 
22 
ns 
5-7 


t61 
DB_ VSYNC to VSYNC Delay 
22 
ns 
5-7 


t61 
VGA.....VSYNC to VSYNC Delay 
22 
ns 
5-7 


t62 
VCO_IN to PB_ClK, 
DB_ClK 
Delay 
20 
ns 
5-8 


t62 
DOT_ClK 
to PB_ClK, 
DB_ClK 
Delay 
20 
ns 
5-8 


t62* 
SYS_ClK 
to PB_ClK, 
DB_ClK 
Delay 
20 
ns 
5-8 


t62 
1OMHLIN 
to PB_ClK, 
DB_ClK 
Delay 
20 
ns 
5-8 


t62 
XI to PB_ClK, 
DB_ClK 
Delay 
20 
ns 
5-8 


t62 
VCO_IN 
-i- 2 to PB_ClK, 
DB_ClK 
Delay 
26 
ns 
5-8 


t63 
VFB7-VFBO Setup before DOT_ClK 
5 
ns 
5-9 


t63 
DB_KEY Setup before DOT_ClK 
6 
ns 
5-9 


t64 
VFB7- VFBOHold after DOT_ClK 
3 
ns 
5-9 


t64 
DB_KEY Hold after DOT_ClK 
3 
ns 
5-9 


t65 
DOT_ClK 
to DVI,VGA Delay 
8 
ns 
5-9 


t65 
DB_ClK 
to DB_ VRST Delay 
25 
ns 
5-9 


t65 
VCO_IN to DB_HRST Delay 
29 
ns 
5-9 


NOTE: 
1. All timing 
measured 
at the 1.3V TTLthreshold. 
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SYNC_IN -t,,'1 
b.~ 
SYNC_OUT 


241346-27 


Figure 5-7. Sync Delays 


-t,,'1 
b.~ 


CLOCK_IN 


CLOCK_OUT 


241346-28 


Figure 5-8. Clock Delays 


CLOCK=r4i 
\ 


INPUT 


CLOCK---s 


\ 


\ 
OUTPUT 


241346-29 


Figure 5-9. Miscellaneous 
Timing 


I 
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NOTES: 
1. All dimensions 
in millimeters. 
2. Co-planarity 
within 
0.10 mm. 


6.0 
PACKAGE 
THERMAL 
SPECIFICATIONS 


from the 
IC junction 
to the external 
package 
case. 


Measurements 
are typically 
taken using high air flow 
to simulate 
an infinite heat sink. The thermal 
charac- 
teristics 
of the 160-lead 
PQFP package 
are as fol- 
lows: 


Thermal 
impedance 
is defined 
as the ability to dissi- 
pate heat generated 
by an electronic 
device 
and is 
characterized 
by (JJA and 
(JJC. It is measured 
in de- 
grees 
Celsius 
per Watt. 
(JJA 
is the thermal 
imped- 
ance from the IC chip junction 
in still air ambient with 
the package 
mounted 
in a socket 
or directly 
mount- 
ed on a PC Board. 
(JJC 
is the thermal 
impedance 


IJJA = 6O.O"C/W 


IJJC = 18.0·C/W 
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VIDEO PROCESSOR 


• 
High Performance 
Video Processor 
Based on the 82750PB 


• 
Supports 
the Shared 
Frame Buffer 
Architecture 
-Integration 
of Graphics 
and Video 
Into a Single Subsystem 
- 
Simple, Low Cost, High Performance 
Solution 


• 
High Speed Shared 
Frame Buffer 
Interconnect 
(SFBI) 
- 
32/64-blt 
Memory 
Interface 
- 
Supports 
up to 8 MB of VRAM and 
DRAM 


• 
Eyent Synchronization 
via the 
SynchroLlnk* 
Bus 


• 
Universal 
Host Bus Interface 
-ISA, 
EISA, Micro Channel, 
PCI, • 


VL-bus 


• 
82750PD 
Core Features 
Include: 
- 
25 MHz Operation 
with Single Cycle 
Execution 
- 
Programmable 
512 x 48 Instruction 
RAM 
- 
Flexible 
16-Blt ALU 
- 
Two Internal. 16-Blt Buses Providing 
Parallel Transfers 
- 
Plxel Interpolator 
- 
Variable 
Length Sequence 
Decoder 


The 82750PD 
is a programmable 
video 
processor 
that 
supports 
a wide 
range 
of video 
compression 
algo- 


rithms. The 82750PD 
operates 
in conjunction 
with a graphics 
processor 
and an optional 
capture 
processor 
to 
bring real-time 
video compression 
and decompression 
acceleration 
to the graphics 
subsystem. 
The shared 
frame 
buffer 
architecture 
is enabled 
through 
the implementation 
of the Shared 
Frame 
Buffer 
Interconnect 
(SFBI). This allows the integration 
of the video and graphics 
subsystem 
and results in a simple, low cost, and 
high performance 
solution. 
Event synchronization 
is achieved 
through 
the SynchroLink· 
serial bus, providing 


the synchronization 
of graphics, 
video, 
and audio 
events 
without 
the use of host interrupts. 
The 82750PD 
supports 
a Universal 
Host Bus Interface, 
which includes 
ISA, EISA, Micro Channel, 
PCI, and VL-bus. 


The 82750PD 
is implemented 
using 
Intel's 
low power 
CHMOS 
IV technology 
and is packaged 
in either 
a 
196-pin 
Plastic Quad Flat Package 
(PQFP) or a 208-pin 
ceramic 
Pin Grid Array (PGA). 


Figure 1_82750PD 
Block Diagram 


272341-2 


·SynchroLink 
is a trademark 
of ATI Technologies, 
Inc. 


October 1993 
Order Number: 272341-001 
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1.0 
PURPOSE 
2.0 
82750PD OVERVIEW 


This document provides electrical characteristics for 
the 82750PD. For a detailed description of any 
82750PD functional topic, other than parametric per- 
formance, consult the 82750PD Programmer's Ref- 
erence 
Manual 
(Order 
No. 
272352) 
and 
the 


82750PD Video Processor Universal Host Bus Inter- 
face Application Note (Order No. 272378). 


The 82750PD is an i750~ Video Processor that op- 
erates in conjunction with a graphics processor and 
a video capture processor to bring real-time video 
compression and decompression to the graphics 
subsystem. The 82750PD has been designed to op- 
erate in a shared frame buffer architecture where it 
provides video compression/decompression. 
The 


shared frame buffer system is shown in Figure 2. 


Host 
Bus (ISA, 
EISA, lAicro Channel, 
PCI, VL +bus) 
I 
I 
I 


Graphics 
82750PO 
Audio 
Video 
Video I/O 
Controller 
Processor 
Chip 


I 


1 
1 
,I 
SynchroLink 


Shored 
Frame 
Buffer 
Interconnect 


- 


Shared 
Audio 
Frame 
DAC 
A/D 
D/A 
I/O 
Buffer 


Display 
Ccmero, TV, 
VCR 
Sound 


Figure 2. Shared 
Frame Buffer System 


82750PD 


272341-1 
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The 82750PD 
core is a microcode 
compatible 
deriv- 
ative 
of 
the 
82750PB 
and 
executes 
instructions 
stored in the on-chip 
microcode 
RAM. The 82750PD 
adds a flexible 
host interface, 
Shared 
Frame 
Buffer 
InterconneCt 
support, 
and SynchroLink 
interface 
to 
provide 
a highly 
integrated 
solution. 
The 82750PD 
block diagram 
is shown 
in Figure 1 on page 1. 


2.1 
82750PD Core 


The 82750PD 
core includes 
a wide instruction 
proc- 
essor that is optimized 
for implementing 
algorithms 
such 
as 
compression/decompression 
of 
video 
frames. 
The core is comprised 
of a number 
of pro- 
cessing, 
storage, 
and 
input/output 
elements 
as 
shown 
in Figure 3. 


The 
various 
elements 
are 
connected 
via the 
two 
16-bit buses, the A bus and the B bus. During each 
instruction 
execution 
cycle, data can be transferred 
from 
a bus 
source 
to 
a bus 
destination 
on 
both 
buses, 


.. 
A BUS 
.. 


Sequencer 
~ 
B BUS 
~ 
Data 


RAM 
Microcode 
RAM 
I" 
1 
Microcode 
16 V 
16 


Instruction 
.. 


~[31: 
AlU 
I" 
Core 
L-- 
11. 
FlFOs 


Register 
.A 
r 
I" 
~ 


File 


... 
11. 


Barrel 
I" 
Statistical 
~ 
Shifter 


~ 
.. 
Decoder 
'-- 


Counters 
.-- 
I" 


11.[31: 
~ 
BEIt[3 
Pixel 
---... 


Interpolator 
['f- 
,.. 
.. 
Internal 
Bus Interface 


.••••VBUS 
I".. 
...• 


r 


0] 


2] 
:0] 


[3:0] 


272341-3 


Figure 3. 82750PD 
Core Diagram 
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2.2 
Universal Host Bus Interface 


The 82750PD has a Universal Host Bus Interface 
(UHBI) that supports ISA, PCI, VL-bus, EISA, and 
MicroChannel system buses. The bus is selected 
through hardwiring the BUSTYP[2:0] pins in the 
proper configuration upon the rising edge of RESET. 
For configuration details, see Section 5.0, Reset 
Configuration. The pin assignments for each of the 
host interfaces can be found in Section 3.2.1, Uni- 
versal Host Bus Interface. Detailed bus timings for 
each of the host interfaces can be found in Section 
4.5.1. 


2.3 
Shared Frame Buffer Interconnect 
(SFBI) 


The Shared Frame Buffer Interconnect (SFBI) is a 
multi-master interface to the shared frame buffer. 
The SFBI provides a high performance solution by 
freeing the host bus of large video and graphics- 
related data transfers. The SFBI is a 32/64-bit wide 
memory bus that can support up to 200 MByte/sec 
peak throughput to VRAM and DRAM. The pin as- 
signments for the SFBI can be found in Section 
3.2.2, Shared Frame Buffer Interconnect. Detailed 
bus timings can be found in Section 4.5.2. 


2.4 
SynchroLink Interface 


The SynchroLink interface provides a local method 
of synchronizing graphic, video, and audio events 
without relying on the use of host interrupts. The 
SynchroLink interface connects components to a 
time multiplexed serial bus where each device on 
the bus has an opportunity to transmit messages to 
other devices. The pin assignments for the Synchro- 
Link interface can be found in Section 3.2.3, Event 
Interfaces. Detailed bus timings can be found in Sec- 
tion 4.5.3. 


3.0 
PACKAGE 


3.1 
Package Introduction 


This section describes the pins, pinouts and thermal 
characteristics for the' 82750PD in the 208-pin Ce- 


82750PD 


ramic Pin Grid Array (PGA) package and the 196-pin 
Plastic Quad Flat Package (PQFP). For complete 
package specifications and information, see the 
Packaging 
Handbook 
(Order No. 240800). 


3.2 
Pin Descriptions 


The 82750PD pins are described in this section. De- 
scriptions of host interface bus cycles refer to data 
transferred into the 82750PD as write cycles and 
data transfered out of the 82750PD as read cycles. 
Table 1 presents the legend for interpreting the pin 1 
type descriptions in the following tables. The numer- 
ic pin type is preceded by an alpha designator of I 
for input, 0 for output, 1/0 for input/output, or OC for 
open-collector. 


Table 1 presents the legend for interpreting the pin 
type descriptions in the following tables. The "Type" 
column identifies the general electrical characteris- 
tics of the pin. Each type is composed of two com- 
ponents: a function and an electrical description. 
The function abbreviations are given in Table 1. A 
brief electrical desciption of the pin type is given in 
Table 2. See Section 4 for complete electrical char- 
acteristics of the pins. 


Table 
1. Pin Type Descriptions 


Pin Type 
Function 


I 
Input Only 
0 
Output Only 


1/0 
Both Input and Output 


OC 
Open Collector Output 


The pins associated with the Host Interface are de- 
scribed in Tables 3-7. 
Pins associated with the 
Shared Frame Buffer Interconnect are described in 
Table 8. Pins associated with the Event Interfaces 
are described in Table 9. Power, Ground, and Re- 
served Pins are described in Table 10. 
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Type 
Description 


1 
TTL Input Buffer 
2 
TTL Input Buffer, Schmitt Triggered 


3 
TTL Bi-Directional 
Buffer, IOL = IOH = 4 mA 
4 
TTL Bi-Directional 
Buffer, IOL = IOH = 6 mA 
5 
TTL Bi-Directional 
Buffer, IOL = IOH = 18 mA 
6 
TTL Bi-Directional 
Buffer, IOL = IOH = 24 mA 
7 
TTL Bi-Directional 
Buffer, Schmitt Triggered, 
IOL = 24 mA 
8 
TTL Bi-Directional, 
IOL = 6 mA, IOH = 2 mA, PCI Class 11 
9" 
TTL Three-State 
Output Buffer, IOL = IOH = 4 mA 
10" 
TTL Three-State 
Output Buffer, IOL = IOH = 6 mA 
11" 
TTL Three-State 
Output Buffer, IOL = IOH = 12 mA 
12" 
TTL Three-State 
Output Buffer, IOL = IOH = 18 mA 
13" 
TTL Three-State 
Output Buffer,loL 
= IOH = 24 mA 
14" 
TTL Three-State 
Output Buffer, IOl = 6 mA, IOH = 2 mA, PCI Class 11 


Table 
2. Pin Type 
Description 


NOTES: 
For complete 
electrical 
characteristics 
01 the pins, see Section 
4. 


'Some 
01 these 
pins may be internally 
conligured 
to lunction 
as open-collector 
outputs. 


3.2.1 
UNIVERSAL 
HOST BUS INTERFACE 


Table 
3. ISA Bus Interface 


Name 
PGA-Pin 
PQFP-Pln 
Type 
Description 


A[23:16) 
P04, R04, S03, 
101,102, 
1-8 
ADDRESS 
[23:16): 
Non-multiplexed 
address 
N04, P03, R03, 
104-109 
lines. 


S02, M04 


AD[15:0) 
R02, N03, L04, 
110,111, 
1/0-8 
ADDRESS-DATA 
[15:0): Multiplexed 
between 
002, 
R01, M03, 
113-120, 
System Address 
[15 to 0) and Data [15 to 0) 


L03, P02, K04, 
122-127 
N02, 001, 
K03, 
J04, E01, F02, 
J03 


AEN 
A02 
156 
1-1 
ADDRESS 
ENABLE: 
This active high signal is 
used to degate 
1/0 devices from the ISA bus to 
allow DMA transfers 
to take place. 


BALE 
C04 
147 
1-6 
BUFFERED 
ADDRESS 
LATCH 
ENABLE: 
Active 
high signal indicating 
a valid address on the ISA 
bus LA[23:17) 
lines. Addresses 
are latched on 
falling edge of this signal. 


DIR 
C01 
138 
OC-3 
DATA DIRECTION: 
Direction 
control 
line for data 
transceivers. 
Low = Read bus cycles. High = 
Write bus cycles. 


EN#[1:0) 
H04, H03 
130,129 
OC-14 
DATA OUTPUT 
ENABLE 
[1:0): Active low signal 
for the output enable of the external transceiver. 
(EN # (0) controls 
the low byte.) 


10CHRDY 
C06 
155 
OC-7 
1/0 CHANNEL 
READY: 
This signal is pulled low 
(not ready) by the device to lengthen 
ISA bus 
cycles. 
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Table 
3. ISA Bus Interface 
(Continued) 


Name 
PGA-Pin 
PQFP-Pln 
Type 
Description 


IOCSI6# 
803 
151 
OC-6 
I/O 
16-BIT CHIP SELECT: 
Active low signal indicating 
the present data transfer 
is a 16-bit, 1 wait-state, 
I/O 
cycle. 


10R# 
A01 
143 
1-1 
I/O READ: Active low input indicating 
the present 
bus 
cycle is an I/O read data transfer. 


LOW # 
C03 
144 
1·1 
I/O WRITE: Active low input indicating 
the present 
bus cycle is an I/O write data transfer. 


IRQ 
A17 
13 
0·13 
INTERRUPT 
REQUEST: 
This output is used to 
indicate that the 82750PD 
needs attention. 


MEMCS16# 
006 
152 
OC-13 
MEMORY 
16-BIT CHIP SELECT: 
Active low output 


indicating 
the present data transfer 
is a 1 wait-state, 


16-bit, memory cycle. 


MEMR# 
004 
146 
1-1 
MEMORY 
READ: Active low input indicating 
the 
present bus cycle is a memory read data transfer. 


MEMW# 
802 
145 
1-1 
MEMORY 
WRITE: Active low input indicating 
the 
present bus cycle is a memory write data transfer. 


NOWS# 
007 
153 
OC-6 
NO (ZERO) WAIT STATE 
REQUEST: 
This active low 
signal indicates 
the device can complete 
the present 
bus cycle without 
inserting any additional 
wait cycles. 


REFRESH # 
C05 
157 
1-2 
REFRESH: 
Active low input used to indicate 
a refresh 
cycle. 


RENA 
CO2 
139 
OC-3 
RECEIVER 
ADDRESS 
ENABLE: 
Active high output is 
used to enable external 
receiver to gate address 
bits 
15:0 onto AD [15:0]. 


RESET 
805 
159 
1-2 
RESET DRIVE: The Reset Drive is an active high input 
used to reset or initialize the 82750PD. 


S8HE# 
E04 
142 
1-3 
SYSTEM 
BUS HIGH BYTE ENABLE: 
An active low 
input signifying 
data transfer 
on the upper byte of the 
data bus (AD[15:81). 


Table 
4. EISA Bus Interface 


Name 
PGA-Pin 
PQFP-Pln 
Type 
Description 


AD[31:0] 
See Table 12 
See Table 16 
1/0-8 
ADDRESS-DATA 
[31:0]: 
Multiplexed 
between 
for Pin 
for Pin 
System Address 
[31 :2] and Data [31 :0] 


Numbers 
Numbers 


AENx 
A02 
156 
1·1 
ADDRESS 
ENABLE: 
Slot-specific 
Address 
Enable where x identifies 
the slot number 


8CLK 
F14 
11 
1·1 
BUS CLOCK: 
EISA system clock 


8E#[3] 
801 
141 
1·1 
BYTE ENABLE 
[3:0]: These active low inputs 
8E# [2:0] 
003, C02, C01 
140·138 
1·3 
define which byte of the data bus is used for the 
bus cycle. 


CMD# 
004 
146 
1-1 
COMMAND: 
Command 
is an active low input 
indicating 
that a channel 
cycle is in progress. 
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Name 
PGA-Pin 
PQFP-Pln 
Type 
Description 


DIR 
H03 
129 
OC-14 
DATA DIRECTION: 
Direction 
control 
for data 
transceivers; 
Low = Read, High = Write. 


EN#[23] 
E02 
133 
OC-3 
DATA LATCH ENABLE 
[23]: A single active low 
output to contol the data transceivers 
for the high 
two bytes AD[32:16]. 


EN#[1,0] 
F03, G03 
132,131 
OC-14 
DATA LATCH ENABLE 
[1,0]: Active low signal for 
the output enable of the external transceiver. 
(EN # [0] controls 
the low byte.) 


EX32# 
006 
152 
OC-12 
MEMORY 
32-BIT CHIP SELECT: Active low signal 
indicating 
the present data transfer 
is a 32-bit 
memory cycle. 


EXRDY 
C06 
155 
1/0-7 
CHANNEL 
READY: This signal is pulled low (not 
ready) by the device to lengthen 
the bus cycle. 


1016# 
803 
151 
OC-6 
I/O 
16-BIT CHIP SELECT: Active low signal 
indicating 
the present data transfer 
is a 16-bit, 
1 wait-state, 
110 cycle. 


IRQ 
A17 
13 
0-13 
INTERRUPT 
REQUEST: 
This output is used to 
indicate that the 82750PD 
needs attention. 


MIIO# 
802 
145 
1-1 
MEMORY-I/O: 
This signal distinguishes 
the bus 
cycle from being either a memory or an I/O 
(High = Memory cycle, Low = I/O cycle). 


MS8URST# 
005 
149 
1-1 
MASTER 
BURST: This active low input signal is 
used to indicate that the current 
EISA bus master is 
capable of supporting 
the next cycle as a burst cycle. 


NOWS# 
007 
153 
OC-6 
NO (ZERO) WAIT STATE: This active low output 
indicates 
that the device can complete 
the present 
bus cycle without 
inserting any additional 
wait cycles. 


REFRESH# 
C05 
157 
1-2 
REFRESH: Active low input used to indicate a 
refresh cycle. 


RENA 
H04 
- 
130 
OC-14 
RECEIVER 
ADDRESS 
ENABLE: Active high output 
is used to enable external receiver to gate address 
bits 31:0 onto AD[31 :0]. 


RESET 
805 
159 
1-2 
SYSTEM 
RESET: Reset is an active high input used 
to reset or initialize the 82750PD. 


SL8URST# 
C04 
147 
OC-6 
SLAVE BURST: This active low output signal is 
driven by the 82750PD 
to indicate that it is capable 
of 
accepting 
burst cycles. 


START # 
A01 
143 
1-1 
START CYCLE: An active low input signal used to 
indicate the start of an EISA cycle. 


W/R# 
C03 
144 
1-1 
WRITE/READ: 
This signal distinguishes 
the bus 
cycle as either a read or write transfer 
(High = Write, 
Low = Read). 


Table 4. EISA Bus Interface 
(Continued) 
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Table 
5. Micro 
Channellnterface-32-Blt 


Name 
PGA-Pin 
PQFP-Pln 
Type 
Description 


AO[31:0) 
See Table 12 
See Table 16 
110-8 
ADDRESS-DATA 
[31:0): Multiplexed 


for Pin 
for Pin 
between 
System Address 
[31 :0) and 


Numbers 
Numbers 
Data [31:0). 


BE#(3) 
B01 
141 
1·1 
BYTE ENABLE[3:0): 
These active low 
BE#[2:0) 
003, C02, C01 
140-138 
1-3 
input signals identify which byte of the 
data lines is valid for the cycle. 


CHROY 
E02 
133 
0-3 
CHANNEL 
READY: This active high 
output is driven low by the 82750PO to 
extend a command 
cycle. 


CMO# 
004 
146 
1-1 
COMMAND: 
Command 
is an active low 
input indicating 
that a channel 
cycle is in 


progress. 


OIR 
H03 
129 
OC-14 
DATA 
DIRECTION: 
Oirection control 
line 


for data transceivers. 
Low = Read bus 
cycles. High = Write bus cycles. 


OS16#10S32# 
E03 
135 
0-3 
DATA SIZE 16 OR 32: Active low output 
signal used for the OS16# 
and OS32# 
signal of an EISA bus. In a 32-bit EISA 
system, the 82750PO will drive both the 
OS16# 
and OS32# 
signals, with this pin 


indicating 
a 32-bit transfer. 
In a 16-bit 
EISA system, this signal will drive only 
the 0816# 
signal of the EISA bus, 


indicating 
a 16-bit transfer. 


EN#[1:0) 
F03,G03 
132 
OC-14 
DATA OUTPUT 
ENABLE 
(1): Active low 
signal for the output enable of the 
external transceiver. 
(EN # (0) controls 
the low byte.) 


IRQ 
A17 
13 
OC-13 
INTERRUPT 
REQUEST: 
This output is 
used to indicate that the 82750PO 
requires attention. 


M/IO# 
B02 
145 
1-1 
MEMORY 
- 1/0: This signal distinguishes 


the bus cycle from being either a memory 
or an 1/0 cycle (High = Memory cycle, 
Low = 1/0 cycle). 


MAOE24 
B03 
151 
1-6 
MEMORY 
ADDRESS 
ENABLE 
24: This 
active high input signal is active when the 
memory address is less than 16M. 


REFRESH# 
C05 
157 
1-2 
REFRESH: 
Active low input signal used 
to indicate a refresh cycle. 


RENA 
H04 
130 
OC-14 
RECEIVER 
ADDRESS 
ENABLE: 
Active 
high output is used to enable external 
receiver to gate address 
bits 31:0 onto 
AO[31 :0). 


RESET 
B05 
159 
1-2 
SYSTEM 
RESET: The reset is an active 
high input used to reset or initialize the 
82750PO 


III 
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Table 
5. Micro 
Channellnterface-32-Blt 
(Continued) 


Name 
PGA-Pin 
PQFP-Pln 
Type 
Description 


S1# 
C03 
144 
1·1 
STATUS 
BITS [1,0]: These bits indicate the start and type 
SO# 
A01 
143 
of cycle. The commands 
are: 


SO# 
S1# 
Function 
0 
0 
Reserved 
0 
1 
Write 
1 
0 
Read 
1 
1 
Reserved 


SBHE# 
E04 
142 
1-3 
SYSTEM 
BYTE HIGH ENABLE: 
Active low input to 
enable transfer 
of data on the high data byte (D[15:8]), 
and is.used with A[O] to distinguish 
between 
high and low 
byte transfers. 


SETUP# 
A02 
156 
1-1 
CARD SETUP: An active low input used to place the 
82750PD 
in Setup Mode to enable programming 
and read 
operations 
of the chip POS functions. 


SFDBK# 
G04 
134 
0-3 
CARD SELECT 
FEEDBACK: 
An active low output 
indicating 
that an adapter is present at the address 
specified 
during a channel 
cycle. 


Table 
6. PCI Interface 


Name 
PGA-Pin 
PQFP-Pln 
Type 
Description 


AD[31:0] 
See Table 12 
See Table 16 
1/0-8 
ADDRESS-DATA 
[31:0]: 
Multiplexed 
address 
for Pin 
for Pin 
and data lines. 


Numbers 
Numbers 


C/BE#[3:0] 
F04, D02, 
137-134 
1-3 
BUS COMMAND-BYTE 
ENABLE 
[3:0]: 
E03, G04 
'These 
lines are multiplexed 
between 
the bus 
command 
and the byte enables. 
During the 
, 
address phase of a transaction 
these lines are 
used to define the bus command. 
During the 
data phase these lines are used as byte 
enables. 


ClK 
F14 
11 
1-1 
CLOCK: 
Input signal which provides timing for 
all transactions 
on PC!. All signals are 
sampled 
on the rising edge of ClK. 


DEVSEl# 
G03 
131 
1/0-14 
DEVICE SELECT: 
Active low output indicating 
the 82750PD 
has decoded 
its address as the 
target of the current PCI access. 


FRAME# 
E02 
133 
1-3 
CYCLE 
FRAME: Active low input indicating 
the beginning 
and duration 
of an access. 
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Table 
6. PCllnterface 
(Continued) 


Name 
PGA-Pin 
PQFP-Pln 
Type 
Description 


IDSEL 
A02 
156 
1-1 
INITIALIZATION 
DEVICE SELECT: 
This input is used as 


a chip select during configuration 
read and write 
transactions. 


IRDY# 
007 
153 
1-6 
INITIATOR 
READY: Active low input to signify that data is 


present 
on AD[31 :0] during write cycles, and to signify the 


PCI Master is prepared 
to accept data during read cycles. 


IRQ 
A17 
13 
0·13 
INTERRUPT 
REQUEST: 
This output is used to indicate 


that the 82750PD 
requires attention . 


PAR 
H03 
129 
1/0-14 
PARITY: 
This input/output 
pin completes 
even parity 
across AD [31:0] and C/8E# 
[3:0]. 


RESET 
805 
159 
1-2 
RESET: Reset is an active high input used to reset or 
initialize the 82750PD. 


STOP # 
H04 
130 
1/0-14 
STOP: Active low output from the 82750PD 
that requests 


the PCI master to stop the current transaction. 


TRDY# 
F03 
132 
1/0-14 
TARGET 
READY: 
Active low output to signify data is 


present on AD[31 :0] during read cycles, and to signify the 
82750PD 
is prepared to accept data during write cycles. 


Table 
7. VL-Bualnterface 


Name 
PGA-Pin 
PQFP-Pln 
Type 
Description 


AD[31:0] 
See Table 12 
See Table 16 
1/0-8 
ADDRESS-DATA 
[31:0]: 
Multiplexed 
address 


for Pin 
for Pin 
[31:2] and data [31:0] lines. 


Numbers 
Numbers 


8E#[3:0] 
F04,002, 
137-134 
1-3 
BYTE ENABLE 
[3:0]: These active low input 


E03,G04 
signals describe 
which bytes of the 32 bits of 
data are involved with the current VL-Bus 
transfer. 


BLAST # 
005 
149 
1-1 
BURST LAST: Active low input indicating 
that 


the next time the BRDY # signal is asserted 
the 


burst cycle will complete. 


8RDY# 
007 
153 
OC-6 
BURST READY: Active low output to terminate 
the current cycle of a burst transfer. 


D/C# 
CO2 
139 
1-1 
DATA-CODE 
STATUS: 
This signal indicates 


whether 
the current cycle is transferring 
data or 


code. 


DIR 
H03 
129 
OC-3 
DATA DIRECTION: 
Direction 
control 
line for 
data transceivers. 
Low = Read bus cycles. High 
= Write bus cycles. 


EN# 
G03 
131 
OC-14 
DATA OUTPUT 
ENABLE: 
Active low signal for 


the output enable of external data transceivers. 
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Table 7. VL-Bus Interface 
(Continued) 


Name 
PGA-Pin 
PQFP-Pln 
Type 
Description 


ID[4:0] 
D04, B02, 
146-142 
1-1 
IDENTIFIER 
[4:0]: These input pins are used to 
C03, A01, E04 
1-3 
identify the type and speed of the host CPU. 


IRQ 
A17 
13 
0-13 
INTERRUPT 
REQUEST: This output is used to 


indicate that the 82750PD 
requires attention. 


LADS # 
C04 
147 
1-6 
LOCAL ADDRESS 
DATA STROBE: This active 


low input indicates 
the start of a cycle. 


LCLK 
F14 
11 
1-1 
LOCAL CPU CLOCK: Input signal which is a 1X 
clock that follows the same phase as an i486TM 
CPU. 


LDEV# 
F03 
132 
OC-14 
LOCAL DEVICE: This output is used to signal 
that the current cycle is a VL-Bus cycle. 


LRDY# 
C06 
155 
110-7 
LOCAL READY: Active low output to terminate 
the current bus cycle. 


M/IO# 
C01 
138 
1-3 
MEMORY 
OR I/O STATUS: This input indicates 
the type of access currently 
executing 
on the VL- 
Bus. A high signifies a memory cycle. A low on 
this signal signifies an 110 cycle. 


RDYRTN# 
A02 
156 
1-1 
READY RETURN: Active low input signal which 
indicates 
the end of the current cycle. 


RESET 
B05 
159 
1-2 
SYSTEM 
RESET: This active high input forces 
the 82750PD 
into a known state. 


RENA 
H04 
130 
OC-14 
RECEIVER 
ADDRESS 
ENABLE: Active high 
output is used to enable external 
receiver to gate 
address bits 31:0 onto AD[31:0]. 


W/R# 
E02 
133 
1-3 
WRITE OR READ STATUS: This input indicates 
the type of access currently 
executing 
on the VL- 


Bus. A write access is indicated 
by a high on this 
pin, while a read access is indicated 
by a low on 


this pin. 
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3.2.2 
SHARED 
FRAME 
BUFFER 
INTERCONNECT 
(SFBI) 


Table 8. Shared Frame Buffer Interconnect 


Name 
PGA-Pin 
PQFP-Pln 
Type 
Description 


CAS # [3:0] 
P17, Ll4, L15, 
40-37 
OC·5 
COLUMN 
ADDRESS 
STROBE 
[3:0]: These 
N16 
active low outputs are the column address 
strobes to each RAM bank, CAS# 
[0] to the 
first bank, CAS # [1] to the second, 
etc. 


DSF 
H15 
27 
OC·10 
SPECIAL 
FUNCTION 
INPUT FLAG: Active 
high output used when new functions 
such as 
flash write cycles are used. 


GRANT 
H17 
26 
1·2 
GRANT: Active high input signal from the 
SFBI arbiter signifying 
the 82750PD 
has 
mastership 
of the SFBI. 


MA[9:0] 
H14, E17, D17, 
23·14 
OC·11 
MEMORY 
ADDRESS 
[9:0]: These ten bits of 
F15, E16, G14, 
address are multiplexed 
with the row and 


C17, D16, 817, 
column address. 
Supports 
either syrnmetnc- 


C16 
type RAMs (9 row/9 
column) 
or asymmetric- 
type (10 row/8 
column). 


MCLK 
E15 
12 
1·1 
MEMORY 
CLOCK: Clock input for SFBI 
timing. 


MD[63:0] 
See Tables 
12 
See Tables 
16 
1/0·4 
MEMORY 
DATA [63:0]: SFBI 64 bits of data. 
and 13 for Pin 
and 17 for Pin 
MD[63:32] 
are not used in the 1 MB 


Numbers 
Numbers 
configuration. 


MPRQ[1,O] 
F17,G15 
25,24 
1/0·4 
MEMORY 
PRIORITY 
REQUEST 
[1,0]: Two- 


bit priority output from the 82750PD 
to be 
used in SFBI arbitration. 


MPRQ[1] 
MPRQ[O] 
Function 
0 
0 
Highest Priority 
0 
1 
Medium Priority 
1 
0 
Lowest Priority 
1 
1 
No Priority 
(idle) 


RAS# 
M15 
41 
OC·12 
ROW ADDRESS 
STROBE: This active low 
output is the row address strobe to all RAM 
banks. 


TRG/OE# 
J17 
28 
OC·10 
TRANSFER-oUTPUT. 
ENABLE: This 
function 
pin is used to select serial transfers 
or output enable of VRAM. 


WE#[7:0] 
N17, M16, K14 
36-29 
OC-10 
WRITE 
ENABLES 
[7:0]: These active low 
K15,L17,K17 
outputs are the write enables for each data 


J15, J14 
byte (WE#[O] 
to MD[7:0], 
WE#[1] 
to 
MD[15:81, etc.). 
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3.2.3 
EVENT 
INTERFACES 


Table 
9. Event 
Interfaces 


Name 
PGA-Pin 
PQFP-Pln 
Type 
Description 


CLKOUT 
(for ISA) 
F03 
132 
0-3 
CLOCK 
OUT: This output is the 
internal clock synchronized 
to MCLK. 


This clock is half the frequency 
of 
MCLK and can be used to follow the 
internal state of the 82750PO. 


CLKOUT 
(for PCI) 
CO2 
139 
0-3 
CLOCK 
OUT: This output is the 
internal clock synchronized 
to MCLK. 


.This clock is half the frequency 
of 
MCLK and can be used to follow the 
internal state of the 82750PO. 


PMON '*' (for ISA) 
G03 
131 
0-14 
PROGRAM 
MONITOR: 
Active low 
output available 
for software 
debug. 


PMON '*' (for PCI) 
C01 
138 
0-3 
PROGRAM 
MONITOR: 
Active low 
output available 
for software 
debug. 


SLOATA,*,' 
N15 
42 
1/0-4 
SynchroLlnk 
OAT A: Active low signal 
for serial transfer 
of all information 
across the SynchroLink. 


SLOATA' 
M14 
43 
1/0-4 
SynchroLlnk 
CLOCK: 
The rising edge 
of this signal is used to latch incoming 
data on the SLOAT A '*' line. The falling 
edge is used by the 82750PO to 
enable output data on the SLOAT A'*' 
line. 


VBUS[3:0], 
F04, 002, E03, 
137-134 
1-3 
VIDEO COMMUNICATION 
BUS [3:0]: 
(for ISA) 
G04 
Asynchronous 
inputs to communicate 
events to the 82750PO. 


VBUS[3,2,O] 
C03, A01, 
144,143,141 
1-1 
VIDEO COMMUNICATION 
BUS [3:0]: 
VBUS[1]' 
(for PCI) 
B01,E04 
142 
1-3 
Asynchronous 
inputs to communicate 
events to the 82750PO. 


'Either the SynchroUnk interface or the V8US Input IS operational as the event Interface, but not both. Refer to the 
82750PD 
Programmer's 
ReferefICB 
Manual 
(Order No. 272352) for additional information on configuring which interface is 
used for event communication. 
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3.2.4 
POWER, 
GROUND, 
AND 
RESERVED 
PINS 


Table 
10. Power, 
Ground, 
and Reserved 
Pins 


Name 
PGA-Pin 
PQFP-Pln 
Type 
Description 


Reserved 
001 
128 
- 
RESERVED: 
This pin is not used and should 
not be connected. 


Vcc 
N01, 508, A09, 
54, 60, 72, 84, 
- 
POWER: Power pins provide the + 5V D.C. 


M01, A11, F01, 
96,150, 
158 
supply input. 


A05, G01, 505, 
174, 190, 196 
L01, K01, A06, 
J01, P01, H01, 
S11,M17,S12, 
G17 


Vss 
808,L16,806 
1,3,8,46,50, 
- 
GROUND: 
Ground pins provide the OV 
811, G02, K16, 
52, 56, 64, 65, 
connection 
to which all inputs and outputs are 


812, F16, M02, 
68,80,81,91, 
referenced. 


J16, H16, R07, 
92,98,103, 
R12, G16, R08, 
112,121,148, 


K02, H02, R10, 
154,162,167, 
J02, R11, A16, 
168,170,178, 
807, C11, C14, 
182,185,186, 
L02, P07, P13, 
194 
003,017, 
R09, 
R15,S01 
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3.3 Pinout 
Table 12- 
Universal Host Bus Interface A/O 
Sig- 
nals (Pin Order) 
Table 13- 
Other Signals (Pin Order) 
Table 14- 
Host Interface AlO Signals 
(Name Order) 
Table 15- 
Other Signals (Name Order) 


3.3.1 PGA 


This group of tables presents the host interface sig- 
nals for the Pin Grid Array (PGA) package. The ta- 
bles are structured as follows: 
Table 11- 
Universal Host Bus Interface Control 
Signals (Pin Order) 


Table 11. Universal Host Bus Interface Control Slgnals-PGA 
Package (Pin Order) 


Pin # 
ISA 
EISA 
Micro Channel 
PCI 
VL-Bus 


A01 
10R# 
START# 
SO# 
VBUS[2)(1) 
10[1) 


A02 
AEN 
AEN 
SETUP# 
10SEL 
ROYR# 


A17 
IRQ 
IRQ 
IRQ 
IRQ 
IRQ 


B01 
(3) 
BE#[3) 
BE#[3) 
VBUS[O)(1) 
(3) 


B02 
MEMW# 
M/IO# 
M/IO# 
(3) 
10[3) 


B03 
IOCS16# 
1016# 
MAOE24 
(2) 
(3) 


B05 
RESET 
RESET 
RESET 
RESET 
RESET 


C01 
OIR 
BE#[O) 
BE#[O) 
PMON#(1) 
M/IO# 


CO2 
RENA 
BE#[1) 
BE#[1) 
CLKOUT(1) 
O/C# 


C03 
10W# 
W/R# 
S1# 
VBUS[3)(1) 
10[2) 


C04 
BALE 
SLBURST# 
(3) 
(2) 
LAOS# 


C05 
REFRESH# 
REfRESH# 
REFRESH# 
(3) 
(3) 


C06 
10CHROY 
EXROY 
(3) 
(2) 
LROY# 


002 
VBUS[2)(1) 
(2) 
(3) 
C/BE#[2) 
BE#[2) 


003 
(2) 
BE#[2) 
BE#[2) 
(2) 
(2) 


004 
MEMR# 
CMO# 
CMO# 
(3) 
10[4) 


005 
(3) 
MSBURST# 
(3) 
(3) 
BLAST# 
006 
MEMCS16# 
EX32# 
(3) 
(2) 
(3) 


007 
NOWS# 
NOWS# 
(3) 
IROY# 
BROY# 


E02 
(3) 
EN#[23) 
CHROY 
FRAME# 
W/R# 
E03 
VBUS[1)1 
(2) 
OS16#/OS32# 
C/BE#[1) 
BE#[1) 


E04 
SBHE# 
(2) 
SBHE# 
VBUS[1)(1) 
10[0) 


F03 
CLKOUT(1) 
EN#[1) 
EN#[1) 
TROY# 
LDEV# 
F04 
VBUS[3)(1) 
(3) 
(3) 
C/BE#[3) 
BE#[3) 


F14 
(3) 
BCLK 
(3) 
CLK 
LCLK 
G03 
PMON#(1) 
EN#[O) 
EN#[O) 
OEVSEL# 
EN# 
G04 
VBUS[O)(1) 
(2) 
SFOBK# 
C/BE#[O) 
BE#[O) 
H03 
EN#[O) 
OIR 
OIR 
PAR 
OIR 
H04 
EN#[1) 
RENA 
RENA 
STOP# 
RENA 


NOTES: 
1. These 
signals 
share 
pins used for the Host Interface 
but are described 
in the Event 
Interface. 


2. These 
pins are no-connects. 
3. These 
pins must be tied to Vss- 
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Table 
12. Universal 
Host Bus Interface 
Address/Data 
Signals-PGA 
Package 
(Pin Order) 


Pin # 
ISA 
EISA 
Micro Channel 
PCI 
VL-BUS 


E01 
AD[2] 
AD[2] 
AD[2] 
AD[2] 
AD[2] 


F02 
AD[1] 
AD[1] 
AD[1] 
AD[1] 
AD[1] 


J03 
AD[O] 
AD[O] 
AD[O] 
AD[O] 
AD[O] 


J04 
AD[3] 
AD[3] 
AD[3] 
AD[3] 
AD[3] 


K03 
AD[4] 
AD [4] 
AD[4] 
AD[4] 
AD [4] 


K04 
AD[7] 
AD[7] 
AD[7] 
AD[7] 
AD[7] 


L03 
AD[9] 
AD[9] 
AD[9] 
AD[9] 
AD[9] 


L04 
AD[13] 
AD[13] 
AD[13] 
AD[13] 
AD[13] 


M03 
AD[10] 
AD[10] 
AD[10] 
AD[10] 
AD[10] 


M04 
A[16] 
AD[16] 
AD[1S] 
AD[16] 
AD[1S] 


N02 
AD[S] 
AD[S] 
AD[S] 
AD[S] 
AD[S] 


N03 
AD[14] 
AD[14] 
AD[14] 
AD[14] 
AD[14] 


N04 
A[20] 
AD[20] 
AD [20] 
AD[20] 
AD[20] 


P02 
AD[S] 
AD[S] 
AD[S] 
AD[S] 
AD[S] 


P03 
A[19] 
AD[19] 
AD[19] 
AD[19] 
AD[19] 


P04 
A[23] 
AD[23] 
AD [23] 
AD[23] 
AD[23] 


P05 
AD [27] 
AD[27] 
AD [27] 
AD[27] 


POS 
AD [2S] 
AD[2S] 
AD[2S] 
AD[2S] 


001 
AD[5] 
AD[5] 
AD[5] 
AD[5] 
AD[5] 


002 
AD[12] 
AD[12] 
AD[12] 
AD[12] 
AD[12] 


004 
AD[25] 
AD[25] 
AD [25] 
AD[25] 


005 
AD[2S] 
AD[2S] 
AD[2S] 
AD[2S] 


OOS 
AD[31] 
AD[31] 
AD[31] 
AD[31] 


R01 
AD[11] 
AD[11] 
AD[11] 
AD[11] 
AD[11] 


R02 
AD[15] 
AD[15] 
AD[15] 
AD[15] 
AD[15] 


R03 
A[1S] 
AD[1S] 
AD[1S] 
AD[1S] 
AD[1S] 


R04 
A[22] 
AD [22] 
AD[22] 
AD [22] 
AD[22] 


R05 
AD[29] 
AD [29] 
AD[29] 
AD[29] 


ROS 
AD[30] 
AD [30] 
AD[30] 
AD[30] 


502 
A[17] 
AD[17] 
AD[17] 
AD[17l 
AD[17l 


503 
A[21] 
AD[21) 
AD[21] 
AD[21] 
AD[21] 


504 
AD [24] 
AD[24] 
AD [24] 
AD[24] 


NOTE: 
Blank pins must be tied to Vss. 
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Table 13. Other Slgnals-PGA 
Package (Pin Order) 


Pin 
Name 


A03 
MO[61) 


A04 
MO[58) 


A05 
Vcc 


A06 
Vcc 


A07 
MO[57) 


A08· 
MO[55) 


A09 
Vcc 


A10 
MD [52) 


A11 
Vcc 


A12 
MO[51) 


A13 
MO[48) 


A14 
MO[45) 


A15 
MO[36) 


A16 
Vss 


B04 
MO[63) 


B06 
Vss 


B07 
Vss 


B08 
Vss 


B09 
MD [53) 


B10 
MD [50) 


B11 
Vss 


B12 
Vss 


B13 
MO[46) 


B14 
MO[41) 


B15 
MO[35) 


NOTE: 
"The 
reserved 
pin must be left unconnected. 
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Pin 
Name 


B16 
MO[32) 


B17 
MA[1) 


C07 
MO[60) 


C08 
MO[59) 


C09 
MO[54) 


C10 
MO[49) 


C11 
Vss 


C12 
MO[44) 


C13 
MO[43) 


C14 
Vss 


C15 
MO[37) 


C16 
MA[O) 


C17 
MA[3) 


001 
reserved" 


008 
MO[62) 


009 
MO[56) 


010 
MO[47) 


011 
MO[40) 


012 
MO[42) 


013 
MO[39) 


014 
MO[38) 


015 
MO[33) 


016 
MA[2) 


017 
MA[7) 


E14 
MO[34) 


Pin 
Name 


E15 
MCLK 


E16 
MA[5) 


E17 
MA[8) 


F01 
Vcc 


F15 
MA[6) 


F16 
Vss 


F17 
MPRQ[1) 


G01 
Vcc 


G02 
Vss 


G14 
MA[4) 


G15 
MPRQ[O) 


G16 
Vss 


G17 
Vcc 


H01 
Vcc 


H02 
Vss 


H14 
MA[9) 


H15 
OSF 


H16 
Vss 


H17 
GRANT 


J01 
'Vcc 


J02 
Vss 


J14 
WE#[O) 


J15 
WE#[1) 


J16 
Vss 
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Table 13. Other Slgnals-PGA 
Package 
(Pin Order) (Continued) 


Pin 
Name 


J17 
TRG/OE 


K01 
Vcc 


K02 
Vss 


K14 
WEfI[5] 


K15 
WEfI[4] 


K16 
Vss 


K17 
WEfI[2] 


L01 
Vcc 


L02 
Vss 


L14 
CASfI[2] 


L15 
CASfI[1] 


L16 
Vss 


L17 
WEfI[3] 


M01 
Vcc 


M02 
Vss 


M14 
SLCLK 


M15 
RASfI 


M16 
WEfI[6] 


M17 
Vcc 


N01 
Vcc 


N14 
MD[2] 


N15 
SLDATAfI 


N16 
CASfI[O] 


N17 
WEfI[7] 


P01 
Vcc 


Pin 
Name 


P07 
Vss 


P08 
MD[29] 


P09 
MD[23] 


P10 
MD[16] 


P11 
MD[13] 


P12 
MD[7] 


P13 
Vss 


P14 
MD[5] 


P15 
MD[1] 


P16 
MD[O] 


P17 
CASfI[3] 


003 
Vss 


007 
MD[30] 


008 
MD[26] 


009 
MD[25] 


010 
MD[20] 


011 
MD[19] 


012 
MD[14] 


013 
MD[9] 


014 
MD[11] 


015 
MD[6] 


016 
MD[3] 


017 
Vss 


R07 
Vss 


Pin 
Name 


R08 
Vss 


R09 
Vss 


R10 
Vss 


R11 
Vss 


R12 
Vss 


R13 
MD[17l 


R14 
MD[10] 


R15 
Vss 


R16 
MD[8] 


R17 
MD[4] 


S01 
Vss 


S05 
Vcc 


S06 
MD[31] 


S07 
MD[28] 


S08 
Vcc 


S09 
MD[27] 


S10 
MD[24] 


S11 
Vcc 


S12 
Vcc 


S13 
MD[22] 


S14 
MD[21] 


S15 
MD[18] 


S16 
MD[15] 


S17 
MD[12] 
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Table 14. Universal 
Host Bus Interface 
Address/Data 
Signals-PGA 
Package 
(Name Order) 


Pin 
#< 
ISA 
EISA 
Micro Channel 
PCI 
VL-Bus 


J03 
AD[O] 
AD[O] 
AD[O] 
AD[O] 
AD[O] 


F02 
AD[1] 
AD[1] 
AD[1] 
AD[l] 
AD[1] 


E01 
AD[2] 
AD[2] 
AD[2] 
AD[2] 
AD[2] 


J04 
AD[3] 
AD[3] 
AD[3] 
AD[3] 
AD[3] 


K03 
AD[4] 
AD[4] 
AD[4] 
AD[4] 
AD [4] 


001 
AD[5] 
AD[5] 
AD[5] 
AD[5] 
AD[5] 


N02 
AD[6] 
AD[6] 
AD[6] 
AD[6] 
AD[6] 


K04 
AD[7] 
AD[7] 
AD[7] 
AD[7] 
AD[7] 


P02 
AD[8] 
AD[8] 
AD[8] 
AD[8] 
AD[8] 


L03 
AD[9] 
AD[9] 
AD[9] 
AD[9] 
AD[9] 


M03 
AD[10] 
AD[10] 
AD[10] 
AD[10] 
AD[10] 


R01 
AD[11] 
AD[11] 
AD[11] 
, AD[11] 
AD[11] 


002 
AD[12] 
AD[12] 
AD[12] 
AD[12] 
AD[12] 


L04 
AD[13] 
AD[13] 
AD[13] 
AD[13] 
AD[13] 


N03 
AD[14] 
AD[14] 
AD[14] 
AD[14] 
AD[14] 


R02 
AD[15] 
AD[15] 
AD[15] 
AD[15] 
AD[15] 


M04 
A[16] 
AD[16] 
AD[16] 
AD[16] 
AD[16] 


802 
A[17] 
AD [17] 
AD[17] 
AD[17] 
AD[17] 


R03 
A[18] 
AD[18] 
AD[18] 
AD[18] 
AD[18] 


P03 
A[19] 
AD[19] 
AD[19] 
AD[19] 
AD[19] 


N04 
A[20] 
AD[20] 
AD[20] 
AD[20] 
AD[20] 


803 
A[21] 
AD[21] 
AD[21] 
AD[21] 
AD[21] 


R04 
A[22] 
AD [22] 
AD[22] 
AD[22] 
AD[22] 


P04 
A[23] 
AD[23] 
AD[23] 
AD[23] 
AD[23] 


804 
AD [24] 
AD[24] 
AD[24] 
AD [24] 


004 
AD[25] 
AD[25] 
AD[25] 
AD [25] 


P06 
AD[26] 
AD[26] 
AD [26] 
AD [26] 


P05 
AD[27] 
AD[27] 
AD [27] 
AD[27] 


005 
AD[28] 
AD [28] 
AD[28] 
AD[28] 


R05 
AD[29] 
AD [29] 
AD[29] 
AD[29] 


R06 
AD[30] 
AD[30] 
AD[30] 
AD[30] 


006 
AD[31] 
AD[31] 
AD[31] 
AD[31] 


NOTE: 
Blank pins are no-connect 
for the ISA. 
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Table 
15. Other 
Signals-PGA 
Package 
(Name Order) 


Name 
Pin 


CA8#[3] 
P17 


CA8#[2] 
L14 


CA8#[1] 
L15 


CA8#[0] 
N16 


D8F 
H15 


GRANT 
H17 


MA[9] 
H14 


MA[8] 
E17 


MA[7] 
017 


MA[6] 
F15 


MA[5] 
E16 


MA[4] 
G14 


MA[3] 
C17 


MA[2] 
016 


MA[1] 
817 


MA[O] 
C16 


MCLK 
E15 


MD[63] 
804 


MD[62] 
008 


MD[61] 
A03 


MD[60] 
C07 


MD[59] 
C08 


MD[58] 
A04 


MD[57] 
A07 


MD[56] 
009 


MD[55] 
A08 


MD [54] 
C09 


MD[53] 
809 


MD[52] 
A10 


MD[51] 
A12 


MD[50] 
810 


MD[49] 
C10 


MD[48] 
A13 


MD[47] 
010 


NOTE: 
·The 
reserved 
pin must be tied to Vss. 


Name 
Pin 


MD[46] 
813 


MD[45] 
A14 


MD[44] 
C12 


MD[43] 
C13 


MD[42] 
012 


MD[41] 
814 


MD[40] 
011 


MD[39] 
013 


MD[38] 
014 


MD[37] 
C15 


MD[36] 
A15 


MD[35] 
815 


MD[34] 
E14 


MD[33] 
015 


MD[32] 
816 


MD[31] 
806 


MD[30] 
007 


MD[29] 
P08 


MD[28] 
807 


MD[27] 
809 


MD[26] 
008 


MD[25] 
009 


MD[24] 
810 


MD[23] 
P09 


MD[22] 
813 


MD[21] 
814 


MD[20] 
010 


MD[19] 
011 


MD[18] 
815 


MD[17] 
R13 


MD[16] 
P10 


MD[15] 
816 


MD[14] 
012 


MD[13] 
P11 


Name 
Pin 


MD[12] 
817 


MD[11] 
014 


MD[10] 
R14 


MD[9] 
013 


MD[8] 
R16 


MD[7] 
P12 


MD[6] 
015 


MD[5] 
P14 


MD[4] 
R17 


MD[3] 
016 


MD[2] 
N14 


MD[1] 
P15 


MD[O] 
P16 


MPRO[1] 
F17 


MPRO[O] 
G15 


RA8# 
M15 


reserved" 
001 


8LDATA# 
N15 


8LCLK 
M14 


TRG/OE 
J17 


Vcc 
N01 


Vcc 
808 


Vcc 
A09 


Vcc 
M01 


Vcc 
A11 


Vcc 
F01 


Vcc 
A05 


Vcc 
G01 


Vcc 
805 


Vcc 
L01 


Vcc 
K01 


Vcc 
A06 


Vcc 
J01 


Vcc 
P01 
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Table 
15. Other Slgnars-PGA 
Package 
(Name Order) 
(Continued) 


Name 
Pin 


Vcc 
H01 


Vcc 
511 


Vcc 
M17 


Vcc 
512 


Vcc 
G17 


Vss 
B08 


Vss 
L16 


Vss 
B06 


Vss 
B11 


Vss 
G02 


Vss 
K16 


Vss 
B12 


Vss 
F16 


Vss 
M02 


Vss 
J16 
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Name 
Pin 


Vss 
H16 


Vss 
R07 


Vss 
R12 


Vss 
G16 


Vss 
R08 


Vss 
K02 


Vss 
H02 


Vss 
R10 


Vss 
J02 


Vss 
R11 


Vss 
A16 


Vss 
B07 


Vss 
C11 


Vss 
C14 


Vss 
L02 


Name 
Pin 


Vss 
P07 


Vss 
P13 


Vss 
003 


Vss 
017 


Vss 
R09 


Vss 
R15 


Vss 
501 


WE#[7] 
N17 


WE#[6] 
M16 


WE#[5] 
K14 


WE#[4] 
K15 


WE#[3] 
L17 


WE#[2] 
K17 


WE#[1] 
J15 


WE#[O] 
J14 


Pin # 
ISA 
EISA 
Micro Channel 
PCI 
VL-Bus 


11 
(3) 
BCLK 
(3) 
CLK 
LCLK 


13 
IRQ 
IRQ 
IRQ 
IRQ 
IRQ 


129 
EN#[O] 
DIR 
DIR 
PAR 
DIR 


130 
EN#[1] 
RENA 
RENA 
STOP# 
RENA 


131 
PMON#(1) 
EN#[O] 
EN#[O] 
DEVSEL# 
EN# 


132 
CLKOUT(1) 
EN#[1] 
EN#[1] 
TRDY# 
LDEV# 


133 
(3) 
EN#[23] 
CHRDY 
FRAME# 
W/R# 


134 
VBUS[O](1) 
(2) 
SFDBK# 
C/BE#[O] 
BE#[O] 


135 
VBUS[1](1) 
(2) 
DS16#/32# 
C/BE#[1] 
BH[1] 


136 
VBUS[2](1) 
(2) 
(3) 
C/BE#[2] 
BE#[2] 


137 
VBUS[3](1) 
(3) 
(3) 
C/BE#[3] 
BE#[3] 


138 
DIR 
BE#[O] 
BE#[O] 
PMON#(1) 
M/IO# 


139 
RENA 
BE#[1] 
BE#[1] 
CLKOUT(1) 
DC# 


140 
(2) 
BE#[2] 
BE#[2] 
(2) 
(2) 


141 
(3) 
BE#[3] 
BE#[3] 
VBUS[O](1) 
(3) 


142 
SBHE# 
SBHE# 
VBUS[1](1) 
10[0] 


143 
10R# 
START# 
SO# 
VBUS[2](1) 
10[1] 


144 
10W# 
W/R# 
SH 
VBUS[3](1) 
10[2] 


145 
MEMW# 
M/IO# 
M/IO# 
(3) 
10[3] 


146 
MEMR# 
CMD# 
CMD# 
(3) 
10[4] 


147 
BALE 
SLBURST# 
(3) 
(2) 
LADS# 


149 
(3) 
, 
MSBURST# 
(3) 
(3) 
BLAST# 
151 
IOCS16# 
1016# 
MADE24 
(2) 
(3) 


152 
MEMCS16# 
EX3U 
(3) 
(2) 
(3) 


153 
NOWS# 
NOWS# 
(3) 
IRDY# 
BRDY# 
155 
10CHRDY 
EXRDY 
(3) 
(2) 
LRDY# 
156 
AEN 
AEN 
SETUP# 
IDSEL 
RDYRTN# 
157 
REFRESH# 
REFRESH# 
REFRESH# 
(3) 
(3) 


159 
RESET 
RESET 
RESET 
RESET 
RESET 


NOTES: 
1. These signals share pins used for the Host Interface but are described in the Event Interface. VBUS tested on PCI only 
(not ISA). 
2. These pins are no-connects. 
3. These pins must be tied to Vss. 


- 
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Table 17. Universal Host Bus Interface Address/Data 
Slgnals-PQFP 
Package (Pin Order) 


Pin 11 
ISA 
EISA 
Micro Channel 
PCI 
VL·Bus 


89 
AD[31] 
AD[31] 
AD[31] 
AD[31] 


90 
AD[30] 
AD[30] 
AD[30] 
AD [30] 


93 
AD[29] 
AD[29] 
AD[29] 
AD [29] 


94 
AD[28] 
AD[28] 
AD[28] 
AD[28] 


95 
AD[27] 
AD[27] 
AD[27] 
AD[27] 


97 
AD [26] 
AD [26] 
AD [26] 
AD [26] 


99 
AD[25] 
AD[25] 
AD[25] 
AD[25] 


100 
AD[24] 
AD[24] 
AD[24] 
AD[24] 


101 
A[23] 
AD[23] 
AD[23] 
AD[23] 
AD[23] 


102 
A[22] 
AD[22] 
AD[22] 
AD[22] 
AD[22] 


104 
A[21] 
AD[21] 
AD[21] 
AD[21] 
AD[21] 


105 
A[20] 
AD[20] 
AD[20] 
AD[20] 
AD[20] 


106 
A[19] 
AD[19] 
AD[19] 
AD[19] 
AD[19] 


107 
A[18] 
AD[18] 
AD[18] 
AD[18] 
AD[18] 


108 
A[17] 
AD[17] 
AD[17] 
AD[17] 
AD[17] 


109 
A[16] 
AP[16] 
AD[16] 
AD[16] 
AD[16] 


110 
AD[15] 
AD[15] 
AD[15] 
AD[15] 
AD[15] 


111 
AD[14] 
AD[14] 
AD[14] 
AD[14] 
AD[14] 


113 
AD[13] 
AD[13] 
AD[13] 
AD[13] 
AD[13] 


114 
AD[12] 
AD[12] 
AD[12] 
AD[12] 
AD[12] 


115 
AD[11] 
AD[11] 
AD[11] 
AD[11] 
AD[11] 


116 
AD[10] 
AD[10] 
AD[10] 
AD[10] 
AD[10] 


117 
AD[9] 
AD[9] 
AD[9] 
AD[9] 
AD[9] 


118 
'AD[8] 
AD[8] 
AD[8] 
AD[8] 
AD[8] 


119 
AD[7] 
AD[7] 
AD[7] 
AD[7] 
AD[7] 


120 
AD[6] 
AD[6] 
AD[6] 
AD[6] 
AD[6] 


122 
AD[5] 
AD[5] 
AD[5] 
AD[5] 
AD[5] 


123 
AD[4] 
AD[4] 
AD[4] 
AD[4] 
AD[4] 


124 
AD[3] 
AD[3] 
AD[3] 
AD[3] 
AD[3] 


125 
AD[2] 
AD[2] 
AD[2] 
AD[2] 
AD[2] 


126 
AD[1] 
AD[1] 
AD[1] 
AD{1] 
AD[1] 


127 
AD[O] 
AD[O] 
AD[O] 
AD[O] 
AD[O] 


NOTE: 
Blank pins must be tied to Vss- 
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Table 18. Other Slgnals-PQFP 
Package 
(Pin Order) 


Pin 
Name 


1 
Vss 


2 
MO[38] 


3 
Vss 


4 
MO[37] 


5 
MO[36] 


6 
MO[35] 


7 
MO[34] 


8 
Vss 


9 
MO[33] 


10 
MO[32] 


12 
MCLK 


14 
MA[O] 


15 
MA[1] 


16 
MA[2] 


17 
MA[3] 


18 
MA[4] 


19 
MA[5] 


20 
MA[6] 


21 
MA[7] 


22 
MA[8] 


23 
MA[9] 


24 
MPRQ[O] 


25 
MPRQ[1] 


26 
GRANT 


27 
OSF 


26 
TRG/OE# 


29 
WE#[O] 


30 
WE#[1] 


31 
WE#[2] 


32 
WE#[3] 


33 
WE#[4] 


34 
WH[5] 


35 
WE#[6] 


36 
WE#[7] 


37 
CAS # [0] 


38 
CAS#[1] 


NOTE: 
"The 
reserved 
pin must be tied to Vss. 


Pin 
Name 


39 
CAS # [2] 


40 
CAS # [3] 


41 
RAS# 


42 
SLOATA# 


43 
SLCLK 


44 
MO[O] 


45 
MO[1] 


46 
Vss 


47 
MO[2] 


~8 
MO[3] 


49 
MO[4] 


50 
Vss 


51 
MO[5] 


52 
Vss 


53 
MO[6] 


54 
Vcc 


55 
MO[7] 


57 
MO[8] 


58 
MO[9] 


59 
MO[10] 


60 
Vcc 


61 
MO[11] 


62 
MO[12] 


63 
MO[13] 


64 
Vss 


65 
Vss 


66 
MO[14] 


67 
MO[15] 


68 
Vss 


69 
MO[16] 


70 
MO[17] 


71 
MO[18] 


72 
Vcc 


73 
MO[19] 


74 
MO[20] 


75 
MO[21] 


Pin 
Name 


76 
MO[22] 


77 
MO[23] 


78 
MO[24] 


79 
MO[25] 


80 
Vss 


81 
Vss 


82 
MO[26] 


83 
MO[27] 


84 
Vcc 


85 
MO[28] 


86 
MO[29] 


87 
MO[30] 


88 
MO[31] 


91 
Vss 


92 
VSS 


96 
Vcc 


98 
Vss 


103 
Vss 


112 
Vss 


121 
Vss 


128 
reserved" 


148 
Vss 


150 
Vcc 


154 
Vss 


158 
Vcc 


160 
MO[63] 


161 
MO[62] 


162 
Vss 


163 
MO[61] 


164 
MO[60] 


165 
MO[59] 


166 
MO[58] 


167 
Vss 


168 
Vss 


169 
MO[57] 


170 
Vss 
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Table 18. Other Slgnals-PQFP 
Package 
(Pin Order) (Continued) 


Pin 
Name 


171 
MD[56] 


172 
MD[55] 


173 
MD[54] 


174 
Vcc 


175 
MD[53] 


176 
MD[52] 


177 
MD[51] 


178 
Vss 


179 
MD[50] 
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Pin 
Name 


180 
MD[49] 


181 
MD[48] 


182 
Vss 


183 
MD[47] 


184 
MD[46] 


185 
Vss 


186 
Vss 


187 
MD[45] 


188 
MD[44] 


Pin 
Name 


189 
MD[43] 


190 
Vcc 


191 
MD[42] 


192 
MD[41] 


193 
MD[40] 


194 
Vss 


195 
MD[39] 


196 
Vcc 
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Table 19. Other Slgnals-PQFP 
Package 
(Name Order) 


Name 
Pin 


CAS # [3] 
40 


CAS # [2] 
39 


CAS#[1] 
38 


CAS#[O] 
37 


DSF 
27 


GRANT 
26 


MA[9] 
23 


MA[8] 
22 


MA[7] 
21 


MA[6] 
20 


MA[5] 
19 


MA[4] 
18 


MA[3] 
17 


MA[2] 
16 


MA[1] 
15 


MA[O] 
14 


MCLK 
12 


MD[63] 
160 


MD[62] 
161 


MD[61] 
163 


MD[60] 
164 


MD [59] 
165 


MD[58] 
166 


MD [57] 
169 


MD[56] 
171 


MD[55] 
172 


MD [54] 
173 


MD[53] 
175 


MD [52] 
176 


MD[51] 
177 


MD[50] 
179 


MD[49] 
180 


MD[48] 
181 


MD[47] 
183 


MD[46] 
184 


MD[45] 
187 


NOTE: 
"The 
reserved 
pin must be left unconnected. 


Name 
Pin 


MD[44] 
188 


MD[43] 
189 


MD[42] 
191 


MD[41] 
192 


MD[40] 
193 


MD[39] 
195 


MD[38] 
2 


MD[37] 
4 


MD[36] 
5 


MD[35] 
6 


MD[34] 
7 


MD[33] 
9 


MD[32] 
10 


MD[31] 
88 


MD[30] 
87 


MD[29] 
86 


MD[28] 
85 


MD[27] 
83 


MD[26] 
82 


MD[25] 
79 


MD[24] 
78 


MD[23] 
77 


MD[22] 
76 


MD[21] 
75 


MD[20] 
74 


MD[19] 
73 


MD[18] 
71 


MD[17] 
70 


MD[16] 
69 


MD[15] 
67 


MD[14] 
66 


MD[13] 
63 


MD[12] 
62 


MD[11] 
61 


MD[10] 
59 


MD[9] 
58 


Name 
Pin 


MD[8] 
57 


MD[7] 
55 


MD[6] 
53 


MD[5] 
51 


MD[4] 
49 


MD[3] 
48 


MD[2] 
47 


MD[1] 
45 


MD[O] 
44 


MPRO[1] 
25 


MPRO[O] 
24 


RAS# 
41 


reserved" 
128 


SLDATA# 
42 


SLCLK 
43 


TRG/OE 
28 


Vcc 
54 


Vcc 
60 


Vcc 
72 


Vcc 
84 


Vcc 
96 


Vcc 
150 


Vcc 
158 


Vcc 
174 


Vcc 
190 


Vcc 
196 


Vss 
1 


Vss 
3 


Vss 
8 


Vss 
46 


Vss 
50 


Vss 
52 


Vss 
56 


Vss 
64 


Vss 
65 


Vss 
68 
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Vss 


1-322 


148 
Vss 
WH[O] 
29 
186 
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3.4 
Mechanical Data 
TA (the ambient temperature) can be calculated 
from (JCA(thermal resistance from case to ambient) 
with the following equation: 
Refer to Intel's Packaging Handbook, (Order No. 
240800), for detailed information on packaging of 
the 82750PD. 


where P is the power dissipated. 
3.5 
Package Thermal Specifications 


The 82750PD is specified for operations when TC 
(the case temperature) is within the range of O·Cto 
85·C. TC may be measured in any environment to 
determine whether the 82750PD is within specified 
operation range. The case temperature should be 
measured at the center of the top surface. 


Table 20. Thermal 
Resistance 
("C/W) 


Typical values for (JCAat various airflows are given 
in Table 20 for the 196-lead PQFP package. Table 
21 shows the maximum TA allowable (without ex- 
ceeding Tcl at various airflows. 


(JCAvs Airflow-ft/min 
(m/sec) 


Package 
0 
200 
400 
600 
800 
1000 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 
(5.07) 


196-Lead 
19.5 
14.5 
11.5 
9.5 
8.5 
8.1 


TA vs Alrflow-ft/mln 
(m/sec) (O·C) 


Package 
MCLK Frequency 
0 
200 
400 
600 
800 
1000 
(MHz) 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 
(5.07) 


196-Lead 
50 
46 
56 
62 
66 
68 
69 


Table 21. Maximum 
TA at Various Airflows ("C) 
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4.0 
ELECTRICAL 
CHARACTERISTICS 


4.1 
Absolute Maximum Ratings 


Case Temperature 
under Bias 
- 65·C to + 11O·C 
, 


Storage Temperature 
- 65·C to + 150·C 


Voltage 
on Any Pin 
with Respect 
to Vss 
-O.5V 
to vcc +0.5V 


Supply Voltage 
with Respect 
to VSS 
- 0.5V to + 6.5V 


4.2 
Operating Conditions 


intel~ 


NOTICE: This data sheet 
contains 
information 
on 
products in the sampling and initial production phases 
of development. 
The specifications 
are subject 
to 
change 
without 
notice. 
Verify with your local 
Intel 
Sales office that you have the latest data sheet be- 
fore finalizing a design. 


• WARNING: Stressing the device beyond the "Absolute 
Maximum 
Ratings" 
may 
cause permanent 
damage. 
These are stress ratings 
only. Operation 
beyond 
the 
"Operating 
Conditions" 
is not recommended 
and ex- 
tended exposure 
beyond 
the 
"Operating 
Conditions" 
may affect device reliability. 


Table 22. Operating 
Conditions 


Symbol 
Parameter 
Mln 
Max 
Units 


TCASE 
Temperature 
0 
85 
·C 


VCC 
Supply Voltage 
4.75 
5.25 
V 


fpCI 
PCI Host Interface 
Frequency 
33 
MHz 


fVL 
VL·Bus Host Interface 
Frequency 
33 
MHz 


fMCLK 
SFBI Interface 
Frequency 
50 
MHz 


4.3 
Recommended Connections 


Power 
and 
ground 
connections 
must 
be made 
to 
multiple 
VCC and Vss 
(GND) pins. Every 82750PD· 
based circuit board should include 
power (Vcc) 
and 
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ground 
(VSS) planes 
for 
power 
distribution. 
Every 
vcc pin must be connected 
to the power plane and 
every 
Vss 
pin must 
be connected 
to the 
ground 
plane. Unused 
pins should 
not be connected. 


82750PD 


4.4 
DC Specifications 


Table 
23. DC Characteristics 
(over 
specified 
operating 
conditions) 


Symbol 
Parameter 
Conditions 
Mln 
Typ 
Max 
Units 


VIL 
Input Low Voltage 
0.8 
V 


VIH 
Input High Voltage 
2.0 
vcc +0.5 
V 


VOL 
Output Low Voltage 
IOL = Rated Buffer Current" 
0.2 
0.4 
V 


VOH 
Output High Voltage 
IOH = Rated Buffer Current" 
2.4 
3.4 
V 


VT+ 
Schmitt 
Trigger 
+ ev Threshold 
2.0 
2.4 
V 


VT- 
Schmitt Trigger 
-ve 
Threshold 
0.6 
0.8 
V 


OLl 
Output Leakage Current 
Vss < VIN < vcc 
0 
±10 
/A-A 


III 
Input Leakage Current 
Vss < VIN < vcc 
0 
±10 
/A-A 


Ice 
Active Mode Current 
fMCLK = Max 
300 
mA 


VCC = Max 


Cs 
Pin Capacitance 
Any Pin to Yss 
10 
pF 


'See Table 25. 


Table 
24. Additional 
PCI Class 11DC Characteristics 
(over 
specified 
operating 
conditions) 


Symbol 
Parameter 
Conditions 
Mln 
Typ 
Max 
Units 


lodAC) 
Switch Low Current 
VOUT = 1.4V 
95 
mA 


IOH(AC) 
Switch High Current 
VOUT = 2.2V 
-44 
mA 


Table 
25. Pin Type 
Rated 
Buffer 
Currents 


Type 
Description 


1 
TTL Input Buffer 


2 
TTL Input Buffer. Schmitt Triggered 


3 
TTL Bi-Directional 
Buffer, IOL = IOH = 4 mA 


4 
TTL Bi-Directional 
Buffer, IOL = IOH = 6 mA 


5 
TTL Bi-Directional 
Buffer, IOL = IOH = 18 mA 


6 
TTL Bi-Directional 
Buffer, IOL = IOH = 24 mA 


7 
TTL Bi-Directional 
Buffer, Schmitt Triggered, 
IOH = 24 mA 


8 
TTL Bi-Directional, 
IOL = 6 mA, IOH = 2 mA, PCI Class 11 


9 
TTL Three-State 
Output Buffer, IOL = IOH = 4 mA 


10 
TTL Three-State 
Output Buffer, IOL = IOH = 6 mA 


11 
TTL Three-State 
Output Buffer, IOL = IOH = 12 mA 


12 
TTL Three-State 
Output Buffer, IOL = IOH = 18 mA 


13 
TTL Three-State 
Output Buffer, IOL = IOH = 24 mA 


14 
TTL Three-State 
Output Buffer, IOL = 6 mA, IOH = 2 mA, PCI Class 11 
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Figure 
4. AC Waveform 
Test Points 


4.5 AC Characteristics 


4.5.1 
UNIVERSAL 
HOST 
BUS INTERFACE 
TIMINGS 


Table 
26. Host Bus Tlmlngs-ISA 


Symbol 
Description 
Mln (ns) 
Max (ns) 


T1 
REFRESH# 
Setup Time to Memory Command 
Active 
30 


T2 
REFRESH # Hold Time from Memory Command 
Inactive 
10 


T3 
SBHE # Setup Time to Memory Command 
Active 
10 


T4 
SBHE # Hold Time from Memory Command 
Inactive 
10 


T5 
Address 
[16:0] Setup Time to Memory Command 
Active 
10 


T6 
Address 
[16:0] 
Hold Time from Memory Command 
Active 
10 


T7 
Read Data Valid from IOCHRDY 
Active 
20 


T8 
Read Data Invalid from MEMR# 
Inactive 
0 


T9 
Write Data Setup Time to MEMW # Inactive 
20 


T10 
Write Data Hold Time from MEMW# 
Inactive 
10 


T11 
A[23:17] 
Setup Time to BALE Inactive 
10 


T12 
A[23:17] 
Hold Time from BALE Inactive 
5 


T13 
MEMCS16# 
Active from Address 
[23:17] 
Valid 
20 


T13a 
MEMCS16# 
Active from Address 
[16:0] Valid 
10.5 


T14 
MEMCS16# 
Inactive from Address 
[23:17] 
Invalid 
20 


T14a 
MEMCS16# 
Inactive from Address 
[16:0] Invalid 
10.5 


T15 
IOCHRDY 
Inactive from Memory Command 
Active 
13 


T16 
EN#[1,0], 
DIR, RENA Active from Memory Command 
Active 
20 


T17 
EN # [1,0], DIR, RENA Inactive from Memory Command 
Inactive 
5 
20 


T18 
NOWS# 
Delay from MEMW# 
Command 
10 
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Figure 5. Host Bus Tlmlngs--ISA 
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Symbol 
Description 
Mln (ns) 
Max(ns) 


T1 
BLCKPeriod 
120 


T2 
BCLK High Time 
54 


T3 
BCLK Low Time 
54 


T4 
START # Setup Time to BCLK Rising 
30 


T5 
START # Hold Time from BCLK Rising 
2 


T6 
M/IO#, 
W/R# 
Setup Time to BCLK Rising 
30 


T7 
M/IO#, 
W/R# 
Hold Time from BCLK Rising 
8 


T8a 
Address 
Setup Time to BCLK Falling (Burst) 
20 


T8b 
Address Setup Time to BCLK Rising 
20 


T8c 
BE # [3:0] Setup Time to BCLK Rising 
5 


T9a 
Address 
Hold Time from BCLK Falling (Burst) 
5 


T9b 
Address 
Hold Time from BCLK Rising 
10 


T9c 
BE # [3:0] Hold Time from BCLK Rising 
2 


T10a 
Write Data Setup Time to BCLK Rising (Burst) 
30 


T10b 
Write Data Setup Time to BCLK Falling 
10 


T11a 
Write Data Hold Time from BCLK Rising (Burst) 
5 


T11b 
Write Data Hold Time from BCLK Falling 
10 


T12 
Read Data Valid from BCLK Rising 
30 


T13 
Read Data Invalid from CMD# 
Inactive 
2 


T14 
EXADY Inactive from BCLK Rising 
20 


T15 
EXRDY Active from BCLK Falling 
20 


T16 
EX32#, 
NOWS# 
Active from Address Valid 
30 


T17 
EX32 #, NOWS# 
Inactive from BCLK 
70 


T18 
EN# [23,1,0], 
DIR, RENA Active from BCLK Rising 
30 


T19 
EN#[23,1,O], 
DIR, RENA Inactive from BCLK Falling 
30 


Table 
27. Host Bus Timings-EISA 
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Figure 6. Host Bus Tlmlngs-EISA 
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Table 28. Host Bus Timings-Micro 
Channel 


Symbol 
.Description 
Mln (ns) 
Max (ns) 


T1 
50#,51 
#, MIIO#, 
REFRESH#, 
MADE24 
30 
Setup Time to CMD# 
Active 


T2 
50#,51 
#, M/IO#, 
REFRESH#, 
MADE24 
10 
Hold Time from CMD# 
Active 


T3 
Address 
Setup Time to CMD# 
Active 
20 


T4 
Address 
Hold Time from CMD# 
Active 
10 


T5 
Write Data Setup Time to CMD# 
Inactive 
20 


T6 
Write Data Hold Time from CMD# 
Inactive 
10 


T7 
Read Data Valid to CHRDY Active 
20 


T8 
Read Data Invalid from CMD# 
Inactive 
0 
20 


T9 
SFDBK# 
Active from Address Valid 
30 


T10 
SFDBK# 
Inactive from CMD# 
Active 
0 
10 


T11 
CHRDY Active from Address Valid 
20 


T12 
EN# [1,0], DIR, RENA Active from CMD# 
Active 
20 


T13 
EN# [1,0], DIR, RENA Inactive from CMD# 
Inactive 
20 


Ct.lD# 


SO#, 
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T2 


t.I/IO#, 
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& 
~ 
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~ 
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I 
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1 
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1 
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Figure 7. Host Bus Tlmlngs-Mlcro 
Channel 
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Symbol 
Parameter 
33-10 
Units 
Notes 


TI 
ClK 
to Signal Valid Delay (max) 
Class 11-11 
ns 
CL = 50 pF 


T2 
ClK 
to Signal Invalid Delay (min) 
2 
ns 


T3 
Output Hi-Z to Active from ClK 
(min) 
2 
ns 


T4 
Output Active to Hi-Z from ClK 
(max) 
28 
ns 
Minimum = Tval min 


T5 
Input Signal Valid Setup Time before ClK 
(min) 
7 
ns 


T6 
Input Signal Hold Time from ClK 
(min) 
0 
ns 


Table 
29_ Host 
Bus Timings-PCI 


r--- 
II 


,. 


; 
)~:--------------- 
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Figure 
8_ Host Bus Timings-PCI 
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Table 
30. Host Bus Tlmlngs-VL-Bus 


Symbol 
Description 
Mln (ns) 
Max (ns) 


T1 
LCLK Period 
30 


T2 
LCLK High Time 
14 


T3 
LCLK Low Time 
14 


T4 
LADS 11 Setup Time to LCLK 
7 


T5 
LADS 11 Hold Time from LCLK 
3 


T6 
MIIOII, 
D/CII, 
W/RII, 
BEll [3:0] Setup Time to LCLK 
7 


T7 
M/lOll, 
D/CII, 
W/RII, 
BEll [3:0] Hold Time from LCLK 
3 


T8 
Address 
Setup Time to LCLK 
2 


T9 
Address 
Hold Time from LCLK 
1 


T10 
Read Data Valid from LCLK 
10 
15 


T11 
Read Data Float from LCLK 
2 


T12 
Write Data Setup Time to LCLK 
2 


T13 
Write Data Hold time from LCLK 
1 


T14 
LDEV 11 Active from Address 
Valid 
20 


T16 
LDEV 11 Inactive from LCLK 
20 


T17 
LRDY 11, BRDY 11 Inactive from LCLK 
10 


T18 
LRDY 11, BRDY 11 Active from LCLK 
10 


T19 
LRDY 11, BRDY 11 Float from LCLK 
10 


T20 
ENII, 
RENA, DIR Active from LCLK 
10 


T21 
EN 11, RENA, DIR Inactive from LCLK 
10 


, 
Tl 
, 
;+--, 
1121 
I 


"'-, 
1 
LCLK 


:r.: T5 
I 
: 
: 


~I 
I 
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1 
1 


, T6 :: 
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• 
I 
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I 
k 
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1 
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I' 
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I 
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11 
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: 
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: 
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I 
I 
I 
I 
I 
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Figure 
9. Host Bus Timlngs-VL-Bus 
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4.5.2 
SHARED 
FRAME 
BUFFER 
INTERCONNECT 
TIMINGS 


Table 
31. Shared 
Frame 
Buffer 
Interconnect 
Timings 


Symbol 
Description 
Mln (ns) , 
Max (ns) 


T1 
MCLK Period 
20 


T2 
MCLK High Pulse 
8 
13.2 


T3 
MCLK Low Pulse 
8 
13.2 


T4 
MCLK to RAS# 
Delay 
0 
22 


T5 
MCLK to CAS# 
Delay 
0 
30 


T6 
Read Data to CAS # Setup 
1 


T7 
MCLK to OE # Delay 
0 
40 


T8 
MCLK to WE # Delay 
0 
35 


T9 
MCLK to Address 
Delay 
2 
31 


T10 
MCLK to Write Data Delay 
2 
35 


T11 
RESET High Pulse Width 
92T1 


Td 
Delta of h> I or I> h of Any Signal 
2 
(i.e., (HL) = TdRAS(LH) 
± Td) 


• 
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1T2 
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I 
I 
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Figure 
10. Shared 
Frame 
Buffer 
Interconnect 
Timings 
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Table 32. Shared Frame Buffer Interconnect 
Arbitration 
Timings 


Symbol 
Description 
Min (ns) 
Max(ns) 


T1 
MPRQ Valid from MCLK 
16 


T2* 
Grant Setup Time to MCLK 
5 


'This timing need only be met to guarantee max GRANT to RASlI' delay Illustrated In the bus waveform figures found In 
Section 6. 


WCLK 


---: 
'- 
Tl 
I 


WPRQ 
11 ;X,, 
I 
I 
III1I 
GRANT 
••• 
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Figure 11. Shared Frame Buffer Interconnect 
Arbitration 
Timings 


4.5.3 
EVENT 
INTERFACE 
TIMINGS 


Table 33. Event Synchronization 
Bus Timings 


Symbol 
Description 
Mln (ns) 
Max (ns) 


T1 
SLCLK Period 
120 
125 


T2 
SLCLK Low Pulse 
48 


T3 
SLCLK High Pulse 
48 


T4 
SLDATA# 
Valid from SLCLK 
20 


T5 
SLDAT A # Float from SLCLK 
20 


T6 
SLDATA # Setup Time to SLCLK 
20 


T7 
SLDATA# 
Hold Time from SLCLK 
10 
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Figure 
12. SynchroLlnk 
Timings 


Table 34. VBUS Timings 


Symbol 
Parameter 
Mln (ns) 
Max(ns) 


T1 
VBUS Setup Time to MCLK 
10 


T2 
VBUS Hold Time after MCLK 
10 


NOTE: 
VBVS tested 
on PCI only. 


Figure 
14. AC Test 
Load Circuit 
272341-5 
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Figure 
13. VB US Timings 


Pin~ 
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272341-21 


1-335 


•• 


82750PD 


5.0 
RESET CONFIGURATION 


The 82750PD is configured during reset. The config- 
uration options include selecting the universal host 
bus interface type, 1/0 Base address, Shared Frame 
Buffer Interconnect memory type, and device ID for 
the 82750PD. This section outlines the reset pin as- 
signments for device configuration. Each option is 
described. The reset sequence is also defined. 


5.1 
Reset Input Pin Function 


The following tables describe the pins used to con- 
figure the 82750PD into the proper operating condi- 
tion. 


Table 35. 82750PD Reset Configuration 
Inputs 


Reset Setup Function 82750PDPln 
Value 


BUSTYP[2:0] 
MD[2:0] 


Reserved 
MD[3] 
Must Be High 


IOBASE[8:0] 
MD[12:4] 


Reserved 
MD[13] 
Must Be Low 


MSLOT16 
MD[14] 


MEMTYPE 
MD[15] 


Reserved 
MD[16] 
Must Be Low 


DEVID[1:0] 
MPRQ[1:0] 


DEVID[2] 
GRANT 


BUSTYP[2:0] (Host Bus Type)-These 
decoded 
bits are listed in Table 36. 


intel~ 


IOBASE[8:0] (1/0 Base Address)- 
The configura- 
tion for these pins directly correlates (MSB through 
LSB) to the I/O base address which can read and 
write through I/O to the 82750PD. For example, the 
base address of 2EAH would set IOBASE[8:0] to 
175H (right shift 1 bit). See the 82750PD Program- 
mer's Reference Manual (Order No. 272352) for de- 
tails. 


MSLOT16 (Micro Channel Slot, 16 Blt)-This 
pin is 
used if the BUSTYP is 2 (Micro Channel). If this pin 
is pulled high it will indicate that the 82750PD is con- 
figured to a 16-bit Micro Channel bus. A low on this 
pin will configure the 82750PD for the 32-bit Micro 
Channel bus. 


MEMTYPE (Memory Type)- 
This pin indicates the 
access method for the memory type being used. A 
low indicates symmetric RAMs (9 row/9 column) are 
used in the Shared Frame Buffer. A high indicates 
asymmetric RAMs (110 row/8 column) are used in 
the Shared Frame Buffer. 


DEVID[2:0] (Device ID)-These 
bits identify which 
device the 82750PD is configured as in the system. 
Pulling all these pins low configures the 82750PD to 
"device 0", setting these pins to the value 1 (Low, 
Low, High) configures the 82750PD to "device 1" 
and so on. There is a total of eight device ID options. 


Reserved-These 
pins are reserved and should be 
pulled to the value indicated. 


Table 36. Bus Type Field Decode 


UHBI Configuration 
Number 
BUSTPY[2] 
BUSTYP[1] 
BUSTYP[O] 


EISA 
0 
Low 
Low 
Low 


ISA 
1 
Low 
Low 
High 


Micro Channel 
2 
Low 
High 
Low 


Reserved 
3 
Low 
High 
High 


PCI 
4 
High 
Low 
Low 


Reserved 
5 
High 
Low 
High 


Reserved 
6 
High 
High 
Low 


VESA 
7 
High 
High 
High 
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5.2 
Reset Initialization Sequence 


The 82750PD 
is initialized 
as a result of the RESET 


input being 
asserted. 
The initialization 
process 
has 
several 
phases 
as shown 
in the timing diagram 
be- 
low. 


82750PD 


Table 
37. Reset 
Sequence 
Timing 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


T1 
Reset Recognition 
Time 
67 
67 
MCLKs 
1,2,3 


T2 
Reset Hold Time 
25 
3 


T3 
Configuration 
Input Setup Time 
5 
MCLKs 
3 


T4 
Configuration 
Input Hold Time 
1 
MCLKs 


NOTES: 
1. The RESET 
input is asynchronous 
to MCLK, 
however 
RESET 
must meet setup 
and hold times 
with respect 
to MCLK 
to 
guarantee 
recognition 
in a particular 
clock 
(see AC Characteristics). 


2. Once 
RESET 
is recognized 
in a particular 
clock, 
there 
is an internal 
delay, 
before 
it takes 
effect. 


3. The minimum 
RESET 
pulse width 
is given by T1 + T2. 


State 
x 
X 
RSl 
RS2 
h-V\.-V\.- ••• V\.-V\.-V\.- ••• h-h- •••r-:r»:r-:r-:r-: 


~ 
••• 
••• 
•••- 


Tl 
T2 
- 
••• 
••• 
••• 
'"-- 


le: 
Configuration In uls 


T31 
1 
T4 


x 
x 
x 


MCLK 


RESET 


OUTPUTS 


GRANT, 
1010[16:0), 


MPRQ[I,O) 
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Figure 
15. Reset 
Initialization 
Sequence 


During the first phase 
(T1) of the initialization 
pro- 
cess the RESET input is synchronized 
to MCLK. Ad- 
ditionally, 
the RESET input is "filtered." 
To prevent a 


"false 
reset" 
from occurring, 
the 82750PD 
ensures 
that 
the RESET 
signal 
has been active 
for a mini- 
mum amount 
of time (T1) before 
recognizing 
an ac- 


tive RESET as valid. If the RESET input goes inac- 
tive before 
the minimum 
recognition 
time 
(T1), the 
RESET activation 
will be ignored. 


If a valid reset is recognized, 
the 82750PD 
will pro- 
ceed 
to the second 
phase 
(T2) of the initialization 
process. 
During this phase, the internal 
state of the 
82750PD 
is initialized 
and ALL outputs 
are floated. 
During the third 
phase 
(T3) the reset configuration 


pins are sampled 
as initialization 
parameters 
(see 


Section 
5.1 for details). These configuration 
parame- 


ters will be latched when RESET is deactivated 
(fail- 


ing edge). The duration 
of T1 is not tested. 
Only the 


total duration 
T1 + T2 is tested. 


6.0 
BUS WAVEFORMS 


This section 
supplements 
the information 
in Section 


4. The waveforms 
here do not replace 
the informa- 


tion in Section 
4. The figures in this section 
illustrate 


the relationships 
between 
the bus signals for the PCI 


interface 
and the signals 
for the SFBI. Durations 
of 


the SFBI signals are listed in terms of MCLK periods. 
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Figure 16a. PCI Waveforms: 
Basic Read Bus Transaction 
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Figure 16b. PCI Waveforms: 
Basic Write Bus Instruction 
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NOTES: 
1. Due to the series 
resistor 
on 10SEL the Address 
setup will be required 
during 
BCLK 
#2 
for the 82750PO, 
2. TROY# 
will be driven 
low by device 
100. 


Figure 16c. PCI Waveforms: 
Configuration 
Bus Transaction 
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Table 
38. Shared 
Frame 
Buffer 
Interconnect 
Waveforms 
(Non-Page 
Mode Cycle 
Timings) 


Symbol 
Description 
MCLK Periods 


T1 
RAS # Pulse Width 
4 


T2 
RAS # Precharge 
Time 
3 


T3 
RAS # to CAS # Delay Time 
2 


T4 
CAS # Pulse Width 
2 


T5 
OE # to CAS # Setup Time 
1 


T6 
CAS # to OE # Delay 
0 


T7 
Write Command 
Setup Time 
1 


Ta 
Write Command 
Hold Time 
2 


T9 
Row Address 
Setup Time 
1 


T10 
Row Address 
Hold Time 
1 


T11 
Column Address 
Setup Time 
1 


T12 
Column Address 
Hold Time 
2 


T13 
Data Out Delay Time 
2 


T2 
-I- 
-, 


14 
I 
-I 


T8 


WRITECYCLE ONLY 


T9 
I 
11 
ADDR_X~ __ ~R~OW~AO~O~R[~S~S g~~C~OL~U~~~N~AO~O~RE~S~S~IX~ 
_ 
I 
11 
I 
T13 
11' 


I 
·-----~ll 
WDATA~I 
X~__~O~AT~A 
__ ~~X~ 
_ 
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Figure 
17. Shared 
Frame 
Buffer 
Interconnect 
Waveforms 
(Non-Page 
Mode Cycle 
Timings) 
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Table 39. Shared 
Frame Buffer Interconnect 
Waveforms 
(Fast Page Mode Cycle Timings) 


Symbol 
Description 
MCLKCycles 


T1 
RAS# 
to CAS# 
Delay Time 
2 


T2 
RAS# 
Precharge 
Time 
3 


T3 
CAS# 
Pulse Width 
2 


T4 
CAS# 
Pulse Width in Page Cycle 
1 


T5 
CAS # Precharge 
Time 
1 


T6 
OE # to CAS #" Setup Time 
1 


T7 
CAS #" to OE # Delay 
0 


T8 
Write Command 
Setup Time 
1 


T9 
Write Command 
Hold Time 
2 


T10 
Write Command 
Setup Time in Page Cycle 
1 


T11 
Write Command 
Hold Time in Page Cycle 
1 


T12 
Row Address 
Setup Time 
1 


T13 
Row Address 
Hold Time 
1 


T14 
Column Address 
Setup Time 
1 


T15 
Column Address 
Hold Time 
2 


T16 
Data in Hold Time 
2 


T17 
Data in Hold Time in Write Page Cycle 
1 


T18 
Access Time from Column Address 
3 


T19 
Access Time from CAS #" in Read Page Cycle 
1 


T20 
Access Time from Row Address 
7 


T21 
Access Time from CAS # 
2 


T22 
Access Time from Column Address 
in Read Page Cycle 
2 


T23 
Access Time from RAS#" 
3 
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Figure 18. Shared Frame Buffer Interconnect 
Waveforms 


Table 40. Shared Frame Buffer Interconnect 
Arbitration 
Timings 
(Fast Page Mode Cycle Timings) 


Symbol 
Description 
Mln (ns) 
Max(ns) 


T1 
GRANT to RAS# 
Active 
2TC 


T2 
Request 
Release to GRANT 
Inactive 
2TC 


WClK 
••• 


WA[9:0] 
( 
> 
CAS# 
VALID 
••• 
WE#[7:0] 


RASII 
, 
••• 
r 
., 
I 
I 
I 
I 
g 
hi PRO llX 
Dl 
I 
••• 
01 
11 
I 
I 
I- 
Tl 
I 
~ 
I 
I 
GRANT 
IIII 
••• 
I 
I 
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Figure 19.Shared Frame Buffer Interconnect 
Arbitration 
Timings 


1-341 


Using the 82750PD in an 
ATI/ISA Implementation 


AL WEIDNER 
TECHNICAL 
MARKETING 
ENGINEER 


October 
1993 


1-342 I 


Order Number: 272366-001 


3.0 COMPONENT 
DESCRiPTION 
1-347 


3.1 Intel82750PD 
1-347 


3.2 ATI68800DX 
1-347 


3.3 ATI68890A 
1-347 


4.0 SCHEMATIC 
ROAD MAP 
1-347 


4.1 Schematic 
Description 
1-347 


4.2 Sheet 1-ISA 
Bus Interface 
1-348 


4.3 Sheet 2-Graphics 
Processor and 
BIOS ROMs 
1-348 


I 


4.11 Sheets 10 and 11-Resistors, 


Capacitors, 
and Header Pins 
1-349 


5.0 PAL EQUATIONS 
1-349 


6.0 JUMPER 
DEFINITIONS 
1-349 


7.0 CONNECTOR 
DEFINITIONS 
1-350 


8.0 SCHEMATICS 
1-351 


9.0 BILL OF MATERIALS 
1-362 


1-343 


AP-491 


1.0 
INTRODUCTION 


The 82750PD Board is an ISA bus-based demonstra- 
tion vehicle that, 
in addition 
to high performance 
graphics acceleration, includes real-time Indeof 
video 
capture/compression 
and accelerated video decompres- 


sion/playback. 
This configuration and the associated 


software drivers fully supports Microsoft's Video for 
Windows" and Intel's Indeo Video 'Technology. The 
board's major components include: 


Intel 82750PD Video Processor 
ATI 68800DX Graphics Controller 
ATI 68890A Video Capture Controller 
4 MB VRAM (max) 
4 MB DRAM (max) 


This material is intended purely as a reference, not as a 
production ready design. The design includes compo- 
nent options (PLL vs oscillators, individual component 
reset jumpers/ etc.), that are useful for evaluation but 
would not normally be implemented in a production 
subsystem. The intent is that this be the basis for a fully 
productized design. 


Future designs which implement ditTerent host buses, 
contain enhanced functionality and are geared more 
toward production readiness will be provided as they 
become available. 


1.1 Block Diagram 


Below is the basic block diagram of the board. 


1.2 Overview 


This architecture allows each of the main components 
to be accessed by the host via the ISA bus. A common 
set of TTL butTers provides a multiplexed interface to 
the ISA bus. The components, in turn, interface to the 
shared local memory via the Shared Frame ButTer In- 
.terconnect (SFBI). The local memory is a combination 
of VRAM and DRAM. The VRAM portion makes up 
the normal graphics frame butTerfrom which all graph- 
ics and 
video information 
is displayed, 
while the 
DRAM is used for video data butTering, working stor- 
age and microcode associated with the 82750PD video 
processor's 
compression 
and 
decompression 
algo- 
rithms. This design may be considered a "single frame 
butTer" in that both the graphics and video data are 
displayed through a single RGB format contained in 
graphic VRAM. 


This board supports three distinct yet related functions: 
graphics 
processing/acceleration 
handled 
by 
the 
68800DX, video data capture handled by the 68890A 
and real-time Indeo video data compression/decom- 
pression perfofIIl;edby the 82750PD. 


In the case of graphics operations, the normal data flow 
is from the host, through the 68800DX, to the VRAM 
frame butTer for display. 


For video playback, the, compressed frame is moved 
into the DRAM memory by the host, where it is de- 
compressed by the 82750PD, then converted and scaled 
into the VRAM display butTer by the 68800DX. 


VIDEOIN---i 


NOTE: 
1. SynchroLink 
is a trademark 
of ATI Technologies, 
Inc. 
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In video capture, the uncompressed video image is 
stored in ORAM by the 68890A, reformatted by the 
688000X for display, while also being compressed in 
real-time by the 82750PO. The compressed video data 
is then read by the host for storage. 


This architecture 
provides a way to deal with video 
data so only low bandwidth, 
compressed data need- 


travel on the ISA bus. The movement of the raw, un- 
compressed video data is limited to the high speed 
SFBI. 


The following two diagrams provide data flow for play- 
back and capture. 


2.0 
COMMON INTERFACES 


Integral in this architecture are three interconnects that 
are common to the main components: the Universal 
Host Bus Interface, the Shared Frame Buffer Intercon- 
nect and the SynchroLink' 
bus. 


2.1 
Universal Host Bus Interface 
(UHBI) 


The Universal Host Bus Interface is the common con- 
nection used by each of the main components to com- 
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municate on system bus. This interface reduces the 
number of signals required to attach to the ISA bus 
from 56 to 44 by multiplexing SA[J5:0] with SO[15:0]. 
This multiplexed bus normally "passes" the address 
portion of the ISA bus so each device can "watch" for 
accesses to its unique address spaces. Using four com- 
mon control signals, the device being accessed will 
"switch" the bus from address to data while controlling 
the "direction" of the data based on the type of ISA 
cycle (read or write). 


Although 
this 
specific implementation 
is ISA bus 
based, the Universal Host Bus Interface has been de- 
signed so it may be configured to support ISA, EISA, 
Micro Channel, PCI or VL-Bus. 


2.2 
Shared Frame Buffer Interconnect 
(SFBI) 


The SFBI provides a way for the major components to 
access the shared memory. Software provides 
the memo- 
ry management 
mechanism 
to insure one device does not 
corrupt 
the other 
device's 
memory 
areas. The scheme 
aJlows large amounts of data to be shared between de- 
vices by merely passing pointers. 


DataFlow-Playback 


Off-$creen Memory CORAM) 
Per Stream 
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The SFBI is a high-speed (200 Mb/sec) 32/64-bit wide 
bus 
that 
supports 
a combination 
of DRAM 
and 
VRAM. The specification includes the definition of a 
three-signal arbitration 
interface. The board uses a 
22VlO PLD to provide that arbitration function. Once 
having been granted the SFBI, a device has full control 
of the RAS, CAS, WE, OE, address and data buses of 
the 
memory 
array. 
In 
this 
implementation, 
the 
68800DX is responsible for generating the transfer cy- 
cles to VRAM as well as the memory refresh cycles to 
the entire array. 


Note that the SFBI is intended to provide access to 
memory and not communication between devices. The 
specification defines the memory signais and mapping 
so that each device has the same view of the memory 
space. This, however, does not restrict the use of signals 
on the SFBI. Once granted the bus, the master may use 
the signals in any way that would not interfere with the 
other 
devices or their 
memory. 
The configuration 
EEPROM is an example (see Section 5.9). 


See the Shared Frame Buffer Interconnect Specification 
for complete details. 
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2.3 
SynchroLink* Bus 


The SynchroLink bus is a bit-serial bus that is designat- 
ed to facilitate synchronization between devices in the 
subsystem. The SynchroLink 
supports eight devices 
where one device, the arbiter, sources the 8 MHz clock 
and is responsible for issuing "invitations" to each de- 
vice on the bus. When accepting an invitation, a device 
can send a "service request" message, a "service com- 
plete" message or one of many "broadcast" type mes- 
sages. 


The typical utilization 
of the SynchroLink 
on this 
board involves: 


• receiving messages from the 82750PD indicating a 
frame of uncompressed data is available for display 


• signaiing from the 68890A indicating a frame ofvid- 
eo data is available to the 82750PD to compress 
and/or the 68800DX to display 
• etc. 


The use of the SynchroLink for other functions is likely 
in the future. 
Audio 
synchronization, 
single point 


I 


r~ 
IS..! ~ut'u IS a derivative of the Intel 82750PB. 


It is a programmable video processor that supports a 
wide range of compression algorithms. Here, it operates 
in conjunction with the ATI 68800DX graphics con- 
troller to provide acceleration of video decompression! 
playback and with the AT! 68890A video capture chip 
to provide real-time Indeo video compression. The host 
resident driver is responsible for uncompressed data 
movement and for loading the appropriate microcode 
routines 
in 
the 
shared 
memory 
from 
which 
the 
82750PD executes. The 82750PD provides host access 
to its internal registers and to the shared memory 
through both EMS-like memory accesses and indirect 
I/O space accesses. 


Some of the internal features of the 82750PD include: 
25 MHz single cycle execution unit, 512 x 48 instruc- 
tion RAM, 16-bit ALU, dual 16-bit internal bus archi- 
tecture, and a pixel interpolator. 


See the Intel 82750PD Video Processor Data Sheet, Or- 
der No. 272341-001, and the 82750PD Programmer's 
Reference Manual, Order No. 272352-001, for com- 
plete details. 


3.2 
ATI68800DX 


The AT! 68800DX is a high performance graphics con- 
troller designed specifically for high resolution graphics 
and multimedia applications. The 68800DX includes 
VGA and 8514/ A compatibility, 1280 x 1024 x 32 bit 
color, graphics drawing engine, H/W cursor, and linear 
addressing. Additionally, the 68800DX supports color- 
space conversion and scaling in support of the video 
acceleration. 


I 


See the AT! 68890 Data Sheet for complete details. 


4.0 
SCHEMATIC ROAD MAP 


The schematics provided in Section 9.0 of this docu- 
ment were generated with OeCad" and are described in 
Section 5.0. These schematics are currently available as 
paper copy in this document or electronically in OrCad 
format. 


.4.1 
Schematic Description 


Below is a table of the schematic sections and page 
references. 


Schematic 
Sheet 
1 
2 


10 
11 


3 
4 
5 
6 
7 


Function 
ISA Bus Interface 
68800DX 
Graphics 
Controller 
and BIOS ROM 
82750PD 
Video Processor 
68890A 
Capture Controller 
Memory Banks 1 and 2-VRAM 
Memory Banks 3 and 4-DRAM 
68860 Display Interface, 
Feature 
Connector 
Video In, AID Converters 
and 
Line Buffer Chip 
Clock Generation, 
Arbiter and 
Configuration 
ROM 
Pull-ups and Capacitors 
Test Point Headers 


8 


9 
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4.2 
Sheet 1-ISA 
Bus Interface 


Four TIL 
buffers and associated control signals are 
used to form the 16-bit multiplexed host bus AD[15:0]. 
The address lines, SA[15:0], are buffered through two 
74ALS245s that are normally enabled (DIS--...ADR# 
= High), while the data lines, SD[15:0], are buffered 
through 
two 74ALS245s that are normally disabled 
(EN-DBO# and EN-DBI # = High). 


Each device monitors 
LA[23:l7], 
SAI6, AD[15:0], 


BALE and the command 
strobes (IOR #, 
IOW #, 
MEMR # and MEMW #) to detect an access to its 
memory 
or 
I/O 
space. The 
selected device pulls 
DIS~DR 
# 
low, turning 
off the address buffers, 
and 
enables 
the 
byte-wide 
data 
bus 
transceivers 
by pulling EN_DBO# 
and/or 
EN_DB I # low. The 
data direction control (DIR) is set depending on wheth- 
er the access is a read or write. 


Each 
device 
is 
also 
responsible 
for 
controlling 
NOWS#, 
10CHRDY, 
MEMCSl6#, 
and IOCS16# 
for its particular accesses. 


4.3 
Sheet 2-Graphlcs 
Processor and 
BIOS ROMs 


The 68800DX interfaces to the UHBI, the SFBI, and 
the SynchroLink along with a set of control signals for 
the DAC. It also provides the output enables for the 
BIOS ROMs. The ROMs are addressed by CK[4:l] 
generated by the 68800DX and the ISA bus address 
lines SA[l1:l]. 
The data outputs of the ROMs are tied 
to AD[15:0] of the UHBI, which provides the path 
directly to the ISA bus and to the 68800DX. The ROM 
must be implemented as l6-bit wide even though the 
subsystem supports a configuration that appears to be 
8-bit to the host. In this case, the 68800DX handles the 
routing of the data from the high order to low order of 
the AD bus and onto SD[7:0]. 


4.4 
Sheet 3-Vldeo 
Processor 


The 82750PD attaches to the UHBI, the SFBI, and the 
SynchroLink. It utilizes the shared memory for all its 
off-chip activities and microcode storage and the Syn- 
chroLink for "signaling" between components. 


4.5 
Sheet 4-Capture 
Controller 


The 68890A capture controller attaches to the UHBI, 
the SFBI, and the SynchroLink along with a set of con- 
trol signals for the AID converters and the line buffer 
chip. 
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This page of the schematics refers to the capture con- 
troller. It also includes the series termination resistors 
for the RAM address and control signals and a set of 
jumpers that allow different VRAM/DRAM 
configu- 
rations. 


4.6 
Sheet 5-VRAM 


The design allows for two banks of VRAM. Each bank 
is implemented using four 256K x 16 parts. The ad- 
dress, control and parallel port of the VRAM make up 
its interface to the SFBI. The 68800DX is responsible 
for generating the transfer cycles while the serial port 
data accesses are controlled by the 68860 DAC and 
provides the pixel data path to the DAC. 


4.7 
Sheet 6-DRAM 


The remaining memory on the board is made up of two 
banks of DRAM. Again, 256K x 16 parts are used to 
provide up to 4MB of DRAM memory. The DRAM 
area is used mainly as microcode RAM, video data 
buffers, and working storage for the 82750PD video 
processor and the 68890A capture controller. 


4.8 
Sheet 7-DAC, 
Analog Output and 
Feature Connector 


The 68860 DAC provides direct analog RGB output to 
11, a standard 
l5-pin VGA connector. The DAC re- 


quires a separate analog voltage supply, which is pro- 
vided by a TL43lCD regulator. The schematic includes 
a second regulator which is needed only with revision 
A of the DAC. JU5, JU6, and JU7 are available so that 
the DAC may be upgraded to a newer version without 
requiring other parts to be added or removed. 


U24, 74LS15l, allows the 68800DX to read the moni- 
tor type from the connector. 


The 68800DX and the DAC support a VGA pass- 
through port for VGA compatibility mode operation. 
An 8-bit data bus P[7:0] provides this path and provide 
the data for the VGA feature connector. JU8, JU9, and 
JUW allow this port to be configured as VGA compati- 
ble or as an output-only port. 


4.9 
Sheet 8-Video 
Input and A-D 
Section 


Analog video data from composite or s-VHS sources is 
converted to digital through two Phillips video analog 
input chips. U33 (TDA8708) is used for composite vid- 
eo from inputs J4 or J5. Both U33 and U34 (TDA8709) 
are used when the S-VHS input JU3 is used. The data 


I 


from 
these 
devices 
is 
read 
by 
the 
68890A 
via 
DADC[7:0]. A Sony CXK1209Q digital delay line is 
used by the 68890A as a "line buffer" to save previous- 
ly captured video. DDAC[7:0] provides the path from 
the 68890A to the CXK1209 while it is read through 
the same path as the AID converters. 


4.10 Sheet 9-Clock 
Generation, SFBI 
Arbiter and Miscellaneous 


Two different PLLs and four discrete oscillators have 
been included to promote ease of testing and evalua- 
tions. The oscillators are used for fixed mode testing, 
while the PLLs provide the programmability required 
for multiple operating modes. 


U26 (22VIO) is programmed as the SFBI arbiter. In 
order to allow the SFBI to operate at 50 MHz, U68 
(74F174) is used to sample the state of the SFBI request 
signals from the 68800DX, the 82750PD, and the 
68890A. 


A second PAL, U49, is used to combine the interrupt 
requests from each device and drive the ISA bus IRQ. 


The equations for these devices appear in Section 6. 


A serial EEPROM (D37) is used by the 68800DX to 
store setup and configuration information. This is an 
example of a non-memory use of the SFBI. SFBI data 
bits MD[34:32] are used along with a chip select to 
transfer information. 


4.11 Sheets 10and 11-Resistors, 
Capacitors, and Header Pins 


The last two pages of the schematic contain pull-up 
resistors, by-pass capacitors and a set of header pins to 
allow easy probing of signals. 


5.0 
PAL EQUATIONS 


TBD 


6.0 
JUMPER DEFINITIONS 


Jumper definitions are as follows: 


JU1 -ISA 
Interrupt Level Selection 


JU1 
1-2 
3-4 
5-6 
7-8 
9 -10 


IRQ9 
IRQ3 
IRQ5 
IRQ10 
IRQ13 
I 
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JU5,6,7- 
Feature Connector Configuration 


JU5 
JU6 
JU7 
1 - 2 
1 - 2 
1 - 2 
68860 Rev. 1 
open 
open 
open 
68860 Rev. 1+ 


JU8,9,10- 
Feature Connector Configuration 


JU8 
JU9 
JU10 
1 - 2 
1 - 2 
1 - 2 
VGA Compatibility Mode 
2 - 3 
2 - 3 
2 - 3 
Output Only Mode 


JU11,12,13- 
Memory Configuration 


JU11 
JU12 
JU13 
Bank 1 plus 
None 
1 - 2 
open 
open 
Bank 2 
open 
1 - 2 
open 
Bank 3 
open 
open 
1 - 2 
Bank 4 
1 - 2 
3 - 4 
open 
Banks 2 and 3 
1 - 2 
open 
3 - 4 
Banks 2 and 4 
open 
1 - 2 
3 - 4 
Banks 3 and 4 
1 - 2 
3 - 4 
5 - 6 
Banks 2 and 3 and 4 


JU15-Test 


JU15 
open 
Test only - 
removes MCLK 
1 - 2 
Normal 


JU16 - 
68860 Reset 


JU16 
1 - 2 
Hold 68860 in Reset 
2 - 3 
Resets the 68860 with ISA bus reset 


JU17 - 
OSCSource 


JU17 
1 - 2 
OSC from PLL 
2 - 3 
OSC from Oscillator 


JU18 - 
68800DXReset 


JU18 
1 - 2 
Hold 68800DX in Reset 
2 - 3 
Resets the 68800DX with ISA bus reset 
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JU22 - 
82750PD 
Reset 


JU22 
1 - 2 
Hold 82750PD in Reset 
2 - 3 
Resets the 82750PD with ISA bus reset 


JU23 - 
68890A 
Reset 


JU23 
1 - 2 
Hold 68890 in Reset 
2 - 3 
Resets the 68890A with ISA bus reset 


JU24 - 
68860 
Reset 


JU24 
1 - 2 
Hold 68860 in Reset 
2 - 3 
Resets the 68860 with ISA bus reset 


1-350 


7.0 
CONNECTOR DEFINITIONS 


Connector definitions are as follows: 


J 1 - 
VGA Monitor 
- 
15-Pln Sub-D 


J2 - 
VGA Feature 
Connector 
- 
2x13 Pin 
Header 


J3 - 
S-VHS Video 
Input - 
4-Pln Mini-DIN 


J4 - 
Composite 
Video 
Input 2 - 
RCA 


J5 - 
Composite 
Video 
Input 1- 
RCA 


8.0 
SCHEMATICS 


The schematics provided in the following pages were 
generated with OrCad and are described in Section 5.0. 
These schematics are currently available as paper copy 
in this document or electronically in OrCad format. 
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5 
6 
C1,C2,C3,C4,C5,C6 
Capacitor, 
220 pF, NU 


6 
7 
C7, C20,C21, 
C25,C26,C27,C28 
Capacitor, 
470 pF 


7 
11 
ce, C10, C11, C105, C106, C112, C113, C114, C115, 
Capacitor, 
0.01 ,...F 
C116, C117 


8 
7 
C12,C107,C120,C121,C126,C144,C148 
Capacitor, 
Tantalum, 
22 ,...F,16V 


9 
78 
C16,C34,C35,C36,C37,C38,C39,C40,C41,C42, 
Capacitor, 
0.1 ,...F 
C43,C44,C45,C46,C47,C48,C49,C50,C51,C52, 
C53, C54, C55, C56, C57, C58, C59, C60, C61 , C62, 
C63,C64,C65,C66,C67,C68,C69,C70,C71,C72, 
C73, C74, C75, C76, C77, C79, C80, C81, C82, ces, 
C84,C86,C87,C88,C89,C90,C91,C92,C93,C94, 
C95,C96,C97,C98,C99,C100,C101,C102,C103, 
C119,C122,C127,C143,C146,C147,C149,U150, 
C152 


10 
6 
C17,C18,C19,C22,C23,C24 
Capacitor, 
470 pF, NU 


11 
5 
C29,C30,C31,C32,C33 
Capacitor, 
Tantalum, 
10,...F 


12 
10 
C108,C109,C132,C133,C135,C136,C138,C139, 
Capacitor, 
,...F,NU 
C141, C142 


13 
2 
C125, C110 
Capacitor, 
0.001 ,...F 


14 
1 
C111 
Capacitor, 
22 ,...F,16V 


15 
3 
C123, C128, C153 
Capacitor, 
30 pF 


16 
2 
C124, C129 
Capacitor, 
20 pF 


17 
2 
C130,C145 
Capacitor, 
1 ,...F 


18 
1 
C151 
Capacitor, 
0.22 ,...F 


19 
3 
01,02,03 
Diode BAT54S 


20 
1 
04 
1N751 


21 
4 
EC__O,EC __C,EC __B,EC--A 
Edge Card Fingers 


22 
1 
JU1 
Header2X5 


23 
9 
JU5, JU6, JU7, JU11, J15, JU19, JU20, JU26, JU27 
Header 2X1 


24 
9 
JU8,JU9,JU10, 
JU16,JU17,JU18,JU22,JU23,JU24 
Header3X1 


25 
1 
JU12 
Header2X2 


26 
1 
JU13 
Header3X2 


27 
1 
J1 
Connector 
OB15 


28 
1 
J2 
Header 13X2 


29 
1 
J3 
4 Connector, 
S-VHS, Female 


30 
2 
J5,J4 
RCA Jack, PC Mount, RT-Angle 


31 
6 
J6,J7,J8,J9,J10,J11 
Header 26X2 


32 
1 
MMY2 
TTL osc, 14 DIP, 35.46895 
MHz, NU 
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9.0 
BILL OF MATERIALS 
(Continued) 


Item 
Quantity 
Reference 
Part 


33 
1 
MMY3 
TIL 
OSC, 14 DIP, 28.63636 
MHz, NU 


34 
1 
MMY4 
TIL 
OSC, 14 DIP, 40 MHz, NU 


35 
2 
REG2, REG1 
TL431CD 


36 
1 
REG3 
LM7805, + 5V REG, T0220 


37 
6 
RP5, RP6, RP10, RP11 , RP13, RP18 
SIP Resistor, 
15Kx9, Parallel 


38 
2' 
RP14, RP15 
SIP Resistor, 
1Kx9, Parallel 


39 
7 
RX1,RX2,R16,R17,R18,R19,R103 
Resistor, 
15K, %W, 10% 


40 
39 
R1, R2, R3, R4, R5, R7, R8, R9, R10, R11, R12, 
RESISTOR, 
33, %W,10% 
R13,R14,R15,R27,R28,R30,R31,R32,R33, 
R43,R44,R45,R54,R55,R56,R57,R58,R59, 
R60,R61,R62,R63,R64,R65,R66,R93,R94, 
R95 


41 
13 
R20, R21 ,R22, 
R23, R24, R25, R51, R52, R53, 
Resistor, 
75, %W, 10% 
R74,R75,R76,R77 


42 
1 
R26 
Resistor, 
360, %W, 10% 


43 
1 
R34 
Resistor, 
1370, %W, 5% 


44 
1 
R35 
Resistor, 
160, %W, 10% 


45 
2 
R46,R36 
Resistor, 
240, %W, 10% 


46 
1 
R37 
Resistor, 
660, %W, 10% 


47 
1 
R38 
Resistor, 
470, %W, 10% 


48 
5 
R39,R40,R100,R101,R102 
Resistor, 
2K, %W, 10% 


49 
5 
R47,R48,R49,R50,R70 
Resistor, 
100, %W, 10% 
\-, 


50 
4 
R67,R80,R81,R82 
Resistor, 
4.7K, %W, 10% 


51 
4 
R68, R69, R71, R72 
Resistor, 
750, %W, 10% 


52 
1 
R73 
Resistor, 
330, %W, 10% 


53 
2 
R78, R79 
Resistor, 
10K, %W, 10% 


54 
6 
TP1,TP2,TP3,TP4,TP5,TP6 
Test Point 


55 
8 
U1,U2,U3,U4,U5,U6,U7,U8 
256X16 VRAM -70 (SOJ) 


56 
8 
U9,U10,U11,U12,U13,U14,U15,U16 
256X16 
DRAM -70 (ZIP) 


57 
1 
U17 
68800DXI 


58 
1 
U18 
68890A 


59 
1 
U19 
82750PDI 


60 
1 
U22 
CXK1202Q 


61 
1 
U23 
68860_DAC 


62 
1 
U24 
74LS151 


63 
1 
U25 
AT118811-1 


64 
2 
U27, U26 
27256-250 


65 
1 
U28 
22V10-10 
DIP 


66 
4 
U29, U30, U31, U32 
74ALS245 


67 
1 
U33 
TDA8708 


68 
1 
U34 
TDA8709 


69 
2 
U35,U36 
74ALS374 


70 
1 
U37 
93C56 SERIAL EEPROM 


71 
2 
U38,U40 
74LS04 


72 
1 
U39 
74FOO 
I 
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9.0 
BILL OF MATERIALS (Continued) 
, 


Item 
Quantity 


73 
1 
U49 


74 
1 
U58 


75 
1 
U59 


76 
1 
U60 


77 
1 
Y1 


78 
1 
Y2 


79 
1 
Y3 


80 
1 
Y4 


Reference 
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Part 
16R8-25 PLCC 
74F174 
74LS174 
ICD2061 
XTAL, 14.318 MHz 
TTL OSC, 8-Pin DIP, 35.46895 MHz 


TTL OSC, 8-Pin DIP, 28.63636 MHz 
TTL OSC, 8-Pin DIP,40 MHz 
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1.0 
INTRODUCTION 


The 82750PD Video Processor has a Universal HOst 
Bus Interface (UHBI) that supports ISA, EISA, Micro 
Channel", PCI, and VL·Bus system buses. The Intel 
82750PD Universal Host Bus Interface Application 
Note is intended to facilitate the design of host interface 
logic needed to interface the 82750PD Video Processor 
to the system bus. 


The reader of this application note should have an un- 
derstanding of the functional and electrical characteris- 
tics of the desired system bus. In addition, the reader 
should be familiar with the electrical characteristics of 
the Universal Host Bus Interface, as described in the 
82750PD Video Processor 
Data 
Sheet (Order 
No. 


272341). 


Ap·492 


The UHBI is the connection used by the 82750PD Vid- 
eo Processor to communicate as a slave device on the 
system bus. The UHBI allows. the 82750PD Video 
Processor to support several system buses by changing 
the host interface logic. The system bus type is selected 
through hardwiring (pull-up and pull-down resistors) 
the BUSTYP[2:o1 pins in the proper configuration, 
which are sampled upon the rising edge of RESET. 


A generic integrated video/graphics 
system showing 
the UHBI, host interface logic, and the system bus is 
shown below. A typical system includes the 82750PD 
Video Processor, a video I/O device, and a graphics 
controller. 


VRAM 


DRAM 


System Bus 
272378-1 


Video 


Figure 1. Integrated 
Video/Graphics 
System 
Block Diagram 


I 
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2.0 
RESET CONFIGURATION 


The 82750PD Video Processor is initialized as a result 
of the RESET input being asserted. The Reset Se- 
quence Timing is shown below: 


intel~ 


The reset configuration pins (including the UHBI con- 
figuration) are latched upon the falling edge of RESET. 
The setup (T3) and hold (T4) times for latching these 
inputs must be met to ensure proper configuration. 


Refer to the 82750PD Video Processor Data Sheet for 
timing details. 


V\..V\..V\.. •••V\.. V\.. ~ 
••• r»:r-: •••r»:r-:h..~ 
~ 


~ 
••• 
••• 
•••- 


, 


••• 
••• 
••• 
--- 
••••• ,h,•••• 
~ 
I 
- 
, 
I 
T~ 
T4 
~ 
I- 
ll-- r 
~ 
re.t.~ 
~ 
'-- 
CLKOUT 


MCLK 


RESET 


OUTPUTS 


GRANT, 
MD(18:0), 
MPRQ(l,O) 


272378-2 


Figure 2. Reset Sequence 
Timing 


3.0 
UNIVERSAL HOST BUS 
I~TERFACE CONFIGURATION 


One of the initialization configuration options is the 
UHBI configuration. The UHBI is configured through 
hardwiring 
the BUSTYP[2:0] pins upon the falling 
edge of RESET. The BUSTYP[2:0] pins are multi- 
plexed with the MD[2:0] outputs, which are floated 
during Reset. Table 1 describes the decoding for the 
lj"HBI configuration. 


Table 1. UHBI Configuration 


UHBI 
BUSTYP[2] 
BUSTYP[1] 
BUSTYP[O] 
Configuration 


EISA 
Low 
Low 
Low 


ISA 
Low 
Low 
High 


Micro Channel 
Low 
High 
Low 


Reserved 
Low 
High 
High 


PCI 
High 
Low 
Low 


Reserved 
High 
Low 
High 


Reserved 
High 
High 
Low 


VL-Bus 
High 
High 
High 
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4.0 
ISA INTERFACE 


The ISA interface reduces the number of signals re- 
quired to attach to the ISA system bus by multiplexing 
'SA[15:0] with SD[15:0]. The ISA interface logic con- 
sists of four 74ALS245s, four lOK resistors, and one 
inverter. 


The address bus is buffered using two 74ALS245s. The 
DIR pin of the 74ALS245s is tied to Vcc. allowing the 
address to pass from the SA bus to the AD bus. The 
inverted state of the RENA pin is used to enable the 
74ALS245s to drive the address on the AD bus. 


The data bus is also buffered using two 74ALS245s. 
The DIR pin is used to control the direction of the 
transceiver. A low signal allows data to pass from the 
AD bus to the SD bus and is used for ISA read cycles. 
A high signal allows data to pass from the SD bus to 
the AD bus and is used for ISA write cycles. The 
EN#[O] and EN#[llpins 
are used to enable the low 
byte and high byte of the data bus, respectively. 


Four 10K resistors are used to pull up DIR, EN#[Ol, 
EN#[l], 
and RENA. These signals are open collector 
to allow other devices, such as a graphics controller or 
video I/O, to use the same transceivers, 


Unused inputs are tied to Yss through a lOK resistor, 
while unused outputs are not connected. See Figure 3 
for PQFP pin numbers. 
I 


intel~ 


ISA System Bus 
82750PD PQFP Pins 


AEN 1- 
1!.!:5~6 
AEN 


BALE 
147 BALE 
IOCHRDY 
155 IOCHRDY 
IOCS16# 
151 IOCS16# 
IOR# 
143 IOR# 
IOW# 
144 IOW# 
IRQx 
13 IRQ 
MEMCS16# 
152 MEMCS16# 
MEMR# 
146 MEMR# 
MEMW# 
145 MEMW# 
NOWS# 
153 NOWS# 
REFRESH# 
157 REFRESH# 
RESET 
159 RESET 
SBHE# 
142 SBHE# 
SA16 
1~ 
A[16] 
LA[23:17) 
101 102. 104-10F A[23:17) 


11!l 1?O 1??-12 
AD[7:0] 
.5" 


7-.s2~ 


8(7:0) 


_ 
A(7:0) 
ENlI--f.--· 


DIR 
- 
7-.s2~ 


Unused Host 
Interface Pins 


No- 
TIedto Vss 
Connects (thru 10Kreaistor) 


128 
11 


140 
89 
90 
93-95 
97 
99 
100 
133 
141 
149 


8(7:0) 
-- 


A(7:O) 
EN. 
DIR 
-< 
7-.s2~ 
+5V 


Figure 3. ISA Interface Logic 


I 
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5.0 
EISA INTERFACE 
ceiver. A low signal allows data to pass from the AD 
bus to the SD bus and is used for EISA read cycles. A 
high signal allows data to pass from the SD bus to the 
AD 
bus and is used for EISA write cycles. The 
EN#[O], EN#[I], 
and EN#[23] 
pins are used to en- 
able the low byte, second byte, and two high bytes of 
the data bus, respectively. 


The EISA interface reduces the number of signals re- 
quired to attach to the EISA system bus by multiplex- 
ing LA[31:0] with SD[31:0]. The EISA interface logic 
consists of eight 74ALS245s, five lOK resistors, and 
one inverter. 


The address bus is buffered using four 74ALS245s. The 
DIR pin of the 74ALS245s is tied to VCC;,allowing the 
address to pass from the LA bus to the AD bus. The 
inverted state of the RENA pin is used to enable the 
74ALS245s to drive the address on the AD bus. 


Five lOK resistors are used to pull up DIR, EN # [0], 
EN#[ll, 
EN#[23], 
and RENA. 
These signals are 
open collector to allow other devices, such as a graphics 
controller or video I/O, to use the same transceivers. 


The data bus is also buffered using four 74ALS245s. 
.The DIR pin is used to control direction of the trans- 


Unused inputs are tied to Vss through a 10K resistor, 
while unused outputs are not connected. See Figure 4a 
for PQFP .pin numbers . 


EISA System Bus 
82750PD Pins 


AEN 
156 
AEN 
BCLK 
11 
BCLK 
BE#[3:0] 
141 140 139 138 
BE#[3:0] 
CMD# 
146 
CMD# 
EX32# 
152 
EX32# 
EXRDY 
155 
EXRDY 
1016# 
151 
1016# 
IRQx 
, 
13 
IRQ 
MIIO# 
145 
MIIO# 
MSBURST# 
149 
MSBURST# 
NOWS# 
153 
NOWS# 
REFRESH# 
157 
REFRESH# 
RESET 
159 
RESET 
SLBURST# 
147 
SLBURST# 
START# 
143 
START# 
W/R# 
144 
W/A# 


SO[7:0] 
AD[7:0] 


SO[15:8] 
AD[15:8] 


SO[23:16] 
See 
AD[23:16] 


SO[31:24] 
Figure 4b 
I 
AD[31:24] 


LA[7:0] 
OIR 
EN#[23] 
LA[15:8] 
Unused Host Interface Pins 
EN#[O] 


No-Connects 
Tledto 
Vss 
EN#[1] 
LA[23:1Sj 
128 
137 
RENA 


134-136 
LA[31:24] 
142 
, 
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Figure 4a. EISA Interface Logic 
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8(7:0) 
-----I A(7:0) 
ENt -~ 
ClR 


7oW.S245 


8(7:0) 
10- 


A(7:0) 


ENt ~ 


ClR I-++M1119120 
122-12 
AD[7:0) 


7oW.S245 
110111113-11 
AD[15:8) 
101 102104-10 
AD[23:16) 
•••••..:89~90~93-~.:.;11 
O~ AD[31:24) 
8(7:0) 
- 


A(7:0) 


ENt ~ 
.•!IV 
ClR H-t+1•• 
l<J<,> ' 
7oW.S245 
~i~:' . . • 
4-..•.....•.... 
't-+---t-:1.::;:.3.:..t1 
EN#[O) 
8(7:0) 
- 
132 
EN#[l) 
"-- 
A(7:0) 
L....I- •...•-++-+1""'13~3EN#(23) 


~~ 1--I- 
.•.•. 
+-f.l11~29OIR 


7oW.S245 


82750PD Pins 


8(7:0) I-- 


"---- 
A(7:0) 


ENtl--oI-~ 
ClR 
I--~ 


7<1AlS245 


f<11-_~1.:.:3~0 
RENA 


8(7:0) 
10- 
L.._--1 A(7:0) 


ENt.---+-+ 
•.• 
ClR 1--1- 


I~--------- 


7<1AlS245 


I 


EISA System Bus 


50[7:0) - 


50[15:8) 
I-- 


SO[23:16) 


SO[31:24) 


LA[7:0) 


LA[15:8) 


1A[23:16) I-- 


1A[31:24) ~ 


8(7:0) I-- 


A(7:0) 


ENt 1--01- 
•.••.• 
ClR 
~~ 


7<1AlS245 


8(7:0) I-- 


A(7:0) 


EN• .--_ 
•.••••• 
ClR f-- 


7<1AlS245 


Figure 4b. EISA Interface Logic 
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The DIR pin is used to control direction of the trans- 
for PQFP pin numbers. 


Micro Channel 
System Bus 


BE#[3:0] 
CHROY 
CMO# 
051611 
0532# 
IRQx 
MA0E24 
M/IO# 
REFRE5H# 
RE5ET 
50# 
51# 
5BHE# 
5ETUP# 
5FOBK# 


50[7:0] 


50[15:8] 


50[23:16] 


50[31:24] 


LA[7:0] 


LA[15:8] 


LA[23:16] 


LA[31:24] 


82750PD Pins 


141 140 139 138 
BE#[3:0] 
133 
CHROY 
146 
CMO# 
135 
051611/0532# 
T 
13 
IRQ 
151 
MAOE24 
145 
M/I0# 
157 
REFRE5H# 
159 
RE5ET 
143 
50# 
144 
51#· 


142 
5BHE# 
156 
5ETUP# 
134 
5FOBK# 


See 
Figure 5b 


Unused Host Interface Pins 


AO[7:0] 


AO[15:8] 


AO[23:16] 


AO[31:24] 


OIR 
EN#[O] 
EN#[l] 
RENA 
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No-Connect. 
Tied to Vss 


128 
1,1 
136 
137 
147 
149 
152 
153 
155 


Figure 5a. Micro Channel Interface Logic 
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8(7:0) 
-----t "(7:0) 
EN' 1-- •.•• 


OIR 


8(7:0) 
I""- 
-----t 
"(7:0) 


ENt 


OIR t--+-t91119120 
122-127 
AO(7:O] 


7~45 
110111113-118 
AO[15:8] 
101102 
104-109 
AD[23:16] 
899093-10( 
AO[31:24] 
8(7:0) t-- 
t------t 
"(7:0) 


EN' f- 
+5" 
OIR r-++-i.. 
> 
101<> 
74ALS245 
(><4)> • 
> 


• 
• 
• 131 
EN#[O] 
8(7:0) I- 


I-- 
"(7:0) 
•.••.•+-+-_-+-t--t-;1-=32~ 
EN#[1] 
=J-+--e---~H-'1 
•.•• 
299OIR 


Micro Channel 
System Bus 


80(7:0] - 
80[15:8] 
I-- 


80[23:16] 


80[31:24] 


LA(7:0] 


LA[15:8] 


LA[23:16] ~ 


LA[31:24] I- 


74AlS245 


82750PD Pins 


7~45 
~~ 
•....•-,1",,30"'i 
RENA 


8(7:0) I-- 


""'-- 
"(7:0) 
EN'~+- 
••...• 
OIR 
f--I- 


74AlS245 


8(7:0) I-- 
•...---1 "(7:0) 


EN' t--+-I~ 
OIR f-- I- 


7~45 


8(7:0) - 
•...----t "(7:0) 


ENt t--+-I~ 
OIR f-- I- 


74ALS245 


8(7:0) - 


"(7:0) 


ENt L-- 
OIR r----' 


74ALS245 
r 
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Figure 5b. Micro Channel Interface Logic 
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7.0 
PCI INTERFACE 
Unused inputs are tied to VSS through a lOK resistor, 
while unused outputs are not connected. See Figure 6 
for PQFP pin numbers. 
The PCI interface is a glueless interface. No external 
components are required. 


PCI System Bus 
82750PD Pins 


C/BE#[3:0] 
137 136 135 134 
C/BE#[3:0] 
CLK 
11 
CLK 
DEVSEL# 
131 
DEVSEL# 
FRAME# 
133 
FRAME# 
IDSEL 
156 
IDSEL 
IRDY# 
153 
IRDY# 
IRQx 
13 
IRQ 
PAR 
129 
PAR 
RESET 
159 
RESET 
STOP# 
130 
STOP# 
TRDY# 
132 
TRDY# 


119,120,122-127 
110,111,113-11 a 
101,102,104-109 
AD[31:0] 
89,90,93-100 
AD[31:0] 


Unused Host Interface Pins 


No-Connecta 
Tied to Vss 


128 
145 
140 
146 
147 
149 
151 
157 
152 
155 
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Figure 6. PCllnterface 
Logic 
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8.0 
VL-BUS INTERFACE 
ceiver. A low signal allows data to pass from the AD 
bus to the DAT bus and is used for VL-Bus read cycles. 
A high signal allows data to pass from the DAT bus to 
the AD bus and is used for VL-Bus write cycles. The 
EN # pin is used to enable all four bytes of the data 
bus. 


The VL-Bus interface reduces the number of signals 
required to attach to the VL-Bus system bus by multi- 
plexing ADR[31:0) with DAT[31:0). The VL-Bus in- 
terface logic consists of eight 74ALS245s, three 
lOK 


resistors, and one inverter. 


Three lOK resistors are used to pull up DIR, EN # and 
RENA. These signals are open collector to allow other 
devices, such as a graphics controller or Video I/O, to 
use the same transceiver. 


The address bus is buffered using four 74ALS245s. The 
DIR pin of the 74ALS245s is tied to Vcc, allowing the 
address to pass from the ADR bus to the AD bus. The 
inverted state of the RENA pin is used to enable the 
74ALS245s to drive the address on the AD bus. 


The data bus is also buffered using four 74ALS245s. 
The DIR pin is used to control direction of the trans- 


Unused inputs are tied to Vss through a lOK resistor, 
while unused outputs are not connected. See Figure 7a 
for PQFP pin numbers. 


ISA System Bus 


IRQx 
RESET 


VL-Bus System Bus 
82750PD Pins 


BE#[3:0] 
137.136.135.134 
BE#[3:0] 
BLAST# 
14~ 
BLAST# 
BROY# 
153 
BROY# 
0/C# 
139 
0/C# 
10[4:0] 
114B 14514414314? 
10[4:0] 
13 
IRQ 
LAOS# 
147 
LAOS# 
LCLK 
11 
LCLK 
LOEV# 
1:~? 
LDEV# 
LAOY# 
1~55 LAOY# 
MI/O# 
1:~8 
MI/O# 
ROYR# 
1~:;B 
ROYR# 
1~59 RESET 
W/R# 
1:~3 
W/R# 


OAT(7:0] 
AD[7:0] 
OAT[15:8] 
AD[15:8] 
OAT[23:1Sj 
See 
AD [23: 16] 
OAT[31:24J 
Figure 7b 
AD[31:24] 
ADR(7:0] 
OIR 
ADR[15:8] 
Unused Host Interface Pins 
EN# 
RENA 
ADR[23:16] 
No-Connects 
Tledto 
Vss 


128 
141 


ADR[31:24] 
140 
151 
152 
157 


Figure 7a. VL-Bus Interface Logic 
I 
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8(7:0) 
-----I 1'.(7:0) 
ENt1--1- 


DlR 1--- 


8(7:0) I-- 
1------4 A(7:0) 
ENtt-- I- 


DlR 


VL-Bus 
System Bus 


DAT[7:0] - 


DAn15:8] 
!""-" 
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1.0 
FUNCTIONAL DESCRIPTION 


As shown 
by the block 
diagram 
on the front 
page, 


the i860 XP Microprocessor 
consists 
of the following 
units: 


1. Integer 
Registers 
and Core Execution 
Unit 


2. 
Floating-Point 
Registers 
and Control 
Unit 


3. 
Floating-Point 
Adder Unit 


4. 
Floating-Point 
Multiplier 
Unit 


5. Graphics 
Unit 


6. Paging Unit 


7. Instruction 
Cache 


8. Data Cache 


9. Bus and Cache Control 
Unit 


10. Detached 
Concurrency 
Control 
Unit 


The core execution 
unit controls 
overall operation 
of 
the i860 XP microprocessor. 
It executes 
load 
store 


integer, bit, I/O, and control-transfer 
operatio~s, 
and 


fetches 
instructions 
for the floating-point 
unit as well. 


A set of 32 x 32-bit 
general-purpose 
registers 
are 
provided 
for the manipulation 
of integer 
data. Load 
and store instructions 
move 8-, 16-, and 32-bit data 
to ~nd from 
these 
registers. 
Its full set of integer, 
logical, 
and 
control-transfer 
instructions 
give 
the 
core 
unit the 
ability 
to execute 
complete 
systems 
software 
and applications 
programs. 
A trap mecha- 
nism provides 
rapid response 
to exceptions 
and ex- 


ternal interrupts. 
Debugging 
is supported 
by the abili- 
ty to trap on data or instruction 
reference. 


The floating-point 
hardware 
is connected 
to a sepa- 


rate set of floating-point 
registers, 
which can be ac- 
cessed 
as 16 x 64-bit 
registers 
or as 32 x 32-bit 
registers. 
Load and store 
instructions 
can also ac- 
cess these 
same registers 
as 8 x 128-bit registers. 


All floating-point 
and graphics 
instructions 
use these 
registers 
as their source 
and destination 
operands. 


The floating-point 
control 
unit controls 
both the float- 
ing-point 
adder and the floating-point 
multiplier, 
issu- 
ing instructions, 
handling 
all source 
and 
result 
ex- 
ceptions, 
and 
updating 
status 
bits 
in the 
floating- 
point 
status 
register. 
The adder 
and multiplier 
can 
operate 
in parallel, 
producing 
up to two results 
per 
clock. The floating-point 
data types, floating-point 
in- 
structions, 
and 
exception 
handling 
all support 
the 
IEEE Standard 
for Binary 
Floating-Point 
Arithmetic 
(ANSI/IEEE 
Std 754-1985). 


The floating-point 
adder performs 
addition 
subtrac- 
tion, comparison, 
and conversions 
on 64- ~nd 32-bit 
floating-point 
values. 
An adder 
instruction 
executes 


in thre.e clocks; 
however, 
in pipelined 
mode, a new 
result 
IS generated 
every clock. 
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The 
floating-point 
multiplier 
performs 
floating-point 
and integer multiply as well as floating-point 
recipro- 
cal operations 
on 64- and 32-bit 
floating-point 
val- 
ues. A multiplier 
instruction 
executes 
in three to four 
clocks; however, 
in pipe lined mode, a new result can 
be generated 
every 
clock 
for 
single-precision 
and 
every other clock for double 
precision. 


The graphics 
unit supports 
three-dimensional 
draw- 
ing in a graphics 
frame 
buffer, 
with color 
intensity 
shading 
and 
hidden 
surface 
elimination 
via 
the 
Z-buffer 
algorithm. 
The graphics 
unit recognizes 
the 
pixel as an 8-, 16-, or 32-bit integer data type. It can 
compute 
individual 
red, blue, and green color inten- 
sity values within a pixel; but it does so with parallel 
operations 
that take advantage 
of the 64-bit internal 
word size and 64-bit external 
bus. The graphics 
fea- 
• 
tures of the i860 XP microprocessor 
assume that the 
surface 
of 
a solid 
object 
is drawn 
with 
polygon 
patches 
which, 
like the pieces 
of a puzzle, 
collec- 
tively approximate 
the shape 
of the original 
object. 


The color 
intensities 
of the vertices 
of the polygon 
and their distances 
from the viewer 
are known, 
but 
the 
distances 
and 
intensities 
of 
the 
other 
points 
must be calculated 
by interpolation. 
The graphics 
in- 
structions 
of the i860 XP microprocessor 
directly 
aid 
such interpolation. 


The paging unit implements 
protected, 
paged, virtual 
memory. 
The paging 
unit uses two four-way 
set-as- 


sociative 
cache 
memories 
called 
TLBs 
(Translation 
Lookaside 
Buffers) to perform 
the translation 
of logi- 
cal address 
to physical 
address, 
and to check 
for 
access 
violations. 
The 
access 
protection 
scheme 
employs 
two levels of privilege: 
user and supervisor. 
One TLB supports 
4 Kbyte 
pages, 
and has 64 en- 
tries; the other supports 
4 Mbyte pages, and has 16 
entries. 


The instruction 
cache 
is a four-way 
set-associative 
memory of 16 Kbytes, with 32-byte 
lines. It transfers 
up to 64 bits per clock 
(400 Mbyte/sec 
at 50 MHz). 


The data cache is a four-way 
set-associative 
memo- 
ry of 16 Kbytes, with 32-byte 
lines. It transfers 
up to 
128 bits per clock 
(800 Mbyte/sec 
at 50 MHz). The 
i860 
XP microprocessor 
normally 
uses 
write-back 
caching, 
i.e. memory 
writes update the cache 
(if ap- 
plicable) 
without 
necessarily 
updating 
memory 
im- 
mediately; 
however, 
under both software 
and hard- 
ware 
control, 
write-through 
and write-once 
policies 
can be implemented, 
or caching 
can be inhibited. 


The 
caches 
are 
transparent 
to 
applications 
soft- 
ware. 


The bus and cache 
control 
unit performs 
data and 
instruction 
accesses 
for the core unit. It receives 
cy- 
cle requests 
and specifications 
from 
the core 
unit, 


performs 
the data-cache 
or instruction-cache 
miss 
processing, 
controls 
TLB 
translation, 
and 
provides 
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the interface to the external bus. Its pipelined struc- 
ture supports up to three outstanding bus cycles. Its 
burst mode transfers data at up to 400 Mbyte/sec at 
50 MHz. In multiprocessor systems, it maintains 
cache consistency by monitoring bus activity in par- 
allel with other CPU functions. 


The DCCU (detached concurrency control unit) is a 
compatible subset of the external CCU that expe- 
dites loop-level parallelism and synchronization in 
multiprocessor systems. The DCCU consists of reg- 
isters and a counter that allow a single i860 XP mi- 
croprocessor to run binary code compiled for a mul- 
tiprocessor system adhering to the PAX parallel ap- 
plications binary interface (ABI). 


The i860 XP microprocessor may to be used with or 
without an external, secondary cache built from 
82495XP and 82490XP cache components. An 
82495XP and 82490XP cache provides up to 512 
Kbytes of high-speed storage for data and instruc- 
tion combined. In most cases, an 82495XP and 
82490XP cache can provide data to the CPU with 
zero wait states. The larger size of an external cache 
can provide an increased hit rate when the size or 
number of data structures and programs exceeds 
the size of the internal caches. In multiprocessor 
systems, the external cache serves as local memo- 
ry, and can reduce bus traffic. An external cache 
also hides the processor from rest of system, which 
is a double advantage: 
1. The processor can be upgraded without affecting 
design of the memory and other subsystems. 
2. Slower and less expensive memory and 110 sub- 
system designs can be employed without unduly 
lowering overall system performance. 


Refer to the 82495XP Cache Controller/82490XP 
Cache RAM Data Sheet (lntel Order #240956) for 
more information. 


2.0 
PROGRAMMING INTERFACE 


The programmer-visible aspects of the architecture 
of the i860 XP microprocessor include data types, 
registers, instructions, and traps. 


2.1 
Data Types 


The i860 XP microprocessor provides operations for 
integer and floating-point data. Integer operations 
are performed on 32-bit operands with some support 
also for 64-bit operands. Load and store instructions 
can reference 8-bit, 16-bit, 32-bit, 64-bit, and 128-bit 
operands. Floating-point operations are performed 
on IEEE-standard 32- and 64-bit formats. Graphics 
instructions operate on arrays of 8-, 16-, or 32-bit 
pixels. 
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2.1-1 
INTEGER 


An integer is a 32-bit signed value in standard two's 
complement form. A 32-bit integer can represent a 
value in the 
range 
- 2,147,483,648 
(- 231) to 
2,147,483,647 (+ 231 - 
1). Arithmetic operations on 
8- and 16-bit integers can be performed by sign-ex- 
tending the 8- or 16-bit values to 32 bits, then using 
the 32-bit operations. 


There are also add and subtract instructions that op- 
erate on 64-bit long integers. 


Load and store instructions may also reference (in 
addition to the 32- and 64-bit formats previously 
mentioned) 8- and 16-bit items in memory. When an 
8- or 16-bit item is loaded into a register, it is con- 
verted to an integer by sign-extending the value to 
32 bits. When an 8- or 16-bit item is stored from a 
register, the corresponding number of low-order bits 
of the register are used. 


2.1.2 
ORDINAL 


Arithmetic operations are available for 32-bit ordi- 
nals. An ordinal is an unsigned integer. An ordinal 
can 
represent 
values 
in 
the 
range 
0 
to 
4,294,967,295 (+232 - 
1). 


Also, there are add and subtract instructions that op- 
erate on 64-bit ordinals. 


2.1.3 
SINGLE- 
AND DOUBLE-PRECISION 
REAL 


Figure 2.1 shows the real number formats. A single- 
precision real (also called "single real") data type is 
a 32-bit binary floating-point number. Bit 31 is the 
sign bit; bits 30..23 are the exponent; and bits 22..0 
are the fraction. In accordance with ANSIIIEEE 
standard 754, the value of a single-precision real is 
defined as follows: 
1. If e = 0 and f "* 0 or e = 255 then generate a 
floating-point source-exception trap when en- 
countered in a floating-point operation. 
2. If 0 < e !>: 255, then the valueis (-1)5 x 1.f x 


2e-127. 


3. If e = 0 and f= 
0, then the value is signed zero. 


A double-precision real (also called "double real") 
data type is a 64-bit binary floating-point number. Bit 
63 is the sign bit; bits 62..52 are the exponent; and 
bits 51..0 are the fraction. In accordance with ANSII 
IEEE standard 754, the value of a double-precision 
real is defined as follows: 
1. If e = 0 and f "* 0 or e = 2047, then generate a 
floating-point source-exception trap when en- 
countered in a floating-point operation. 


2. If 0 < e < 2047, then the value is (-1)S x 1.f x 


2e-1023. 


3. If e = 0 and f = 0, then the value is signed zero. 


The special values infinity, NaN ("Not a Number"), 
indefinite, and denormal generate a trap when en- 
countered. The trap handler implements IEEE-stan- 
dard results. 


A double real value occupies an even/odd pair of 
floating-point registers. Bits 31..0 are stored in the 
even-numbered floating-point register; bits 63..32 
are stored in the next higher odd-numbered floating- 
point register. 


2.1.4 PIXEL 


A pixel may be 8-, 16-, or 32-bits long, depending on 
color and intensity resolution requirements. Regard- 
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less of the pixel size, the i860 XP microprocessor 
always operates on 64 bits of pixel data at a time. 
The pixel data type is used by two kinds of instruc- 
tions: 
• The selective pixel-store instruction that helps im- 
plement hidden surface elimination. 
• The pixel add instruction that helps implement 
3-~ color intensity shading. 


To perform color intensity shading efficiently in a va- 
riety of applications, the i860 XP microprocessor de- 
fines three pixel formats according to Table 2.1. 


Figure 2.2 illustrates one way of assigning meaning 
to the fields of pixels. These assignments are for 
illustration purposes only. The i860 XP microproces- 2 
sor defines only the field sizes, not the specific use 
of each field. Other ways of using the fields of pixels 
are possible. 


Single-Precision Real 
Double-Precision 
Real 


Sf SO 23-'22 
0 
6S62 
52'51 
0 
Is 
e 
f 
Is 
e 
I· 
f 


\ 


SIGN 2-J 
1 
f 
L-FRACTION 


EXPONENT 
EXPONENT 


FRACTION 
SIGN 
240874-2 
240874-3 


Figure 2.1. Real Number Formats 


Table 2.1. Plxel Formats 


Plxel 
Bits of 
Bits of 
Bltsof 
Bits of 


Size 
Color 1 
Color2 
Color3 
Other 


(In bits) 
Intenslty(1) 
Intenslty(1) 
Intenslty(1) 
Attribute 
(Texture, Color) 


8 
N (";:8) bits of intensity(2) 
8-N 
16M 
6 
I 
6 
I 


4 
0 
32 
8 
8 
8 
8 


NOTES: 
1. The intensity attribute fields may be assigned to colors in any order convenient to the application. 
2. With a-bit pixels, up to a bits can be used for intensity; the remaining bits can be used for any other attribute, such as 
color or texture. Bits that require interpolation (shading), such as those for intensity, must be the low-order bits of the pixel. 
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'.11J0292827262524 
'2J22212019181716 
(514 IJ 12ff 
10 9 8 765452 
I ° 
I 
RED 
I 
GREEN 
BLUE 
TEXTURE 


NOTE: 
These assignments of specific meanings to the fields of pixels are for illustration only. Only the field sizes are defined, 
not the specific use of each field. 


32-BIT 
PIXEL 


16-81T 
PIXEL 
r_ol 
8-BIT 
PIXEL 
~ 
COLOR 
_ 


15 14 IS 12 11 10 98765 
4 s 2 1 0 
I 
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240B74-4 


2.2 
Register Set 


Figure 2.2. Plxel Format Example 


As Figure 2.3 shows, the i860 XP microprocessor 
has the following registers: 


• An integer register file 
• A floating-point register file 
• Control registers psr, epsr, db, dirbase, fir, fsr, 
bear, ccr, p3, p2, p1, pO 
• Special-purpose registers KR, KI, T, MERGE, 
STAT, and NEWCURR 


The control registers are accessible only by load 
and store control-register instructions; the integer 
and floating-point registers are accessed by arithme- 
tic operations and load and store instructions. The 
special-purpose registers KR, KI, and T are used by 
floating-point instructions; MERGE is used by graph- 
ics instructions. NEWCURR and STAT are used for 
concurrency control; they are accessed by memory 
load and store instructions. 


2.2.1 INTEGER REGISTER FILE 


There are 32 integer registers, each 32 bits wide, 
referred to as rO through r31, which are used for 
address computation and scalar integer computa- 
tions. Register rO always returns zero when read. 


2.2.2 FLOATING·POINT REGISTER FILE 


There are 32 floating-point registers, each 32-bits 
wide, referred to as fO through f31, which are used 
for floating-point computations. Registers fO and f1 
always return zero when read. The floating-point 
registers are also used by a set of integer opera- 
tions, primarily for graphics computations.. 
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When accessing 64-bit floating-point or integer val- 
ues, the i860 XP microprocessor uses an even/odd 
pair of registers. When accessing 128-bit values, it 
uses an aligned set of four registers (fO,f4, f8, f12, 
f16, f20, f24, or f28). The instruction must designate 
the lowest register number of the set of registers 
containing 64- or 128-bit values. Misaligned register 
numbers produce undefined results. The register 
with the lowest number contains the least significant 
part of the value. For 128-bit values, the register pair 
with the lower number contains the value from the 
lower memory address; the register pair with the 
higher number contains the value from the higher 
address. 


The 128-bit load and store instructions, along with 
the 128-bit data path between the floating-point reg- 
isters and the data cache, help to sustain an extraor- 
dinarily high rate of computation. 


2.2.3 PROCESSOR STATUS REGISTER 


The processor status register (psr) contains miscel- 
laneous state information for the current process. 
Figure 2.4 shows the format of the psr. 


• BR (Break Read) and BW (Break Write) enable a 
data access trap when the operand address 
matches the address in the db register and a 
read or write (respectively) occurs. 


• Various instructions set CC (Condition Code) ac- 
cording to tests they perform. The branch-on- 
condition-code instructions test its value. The bla 
instruction sets and tests LCC (Loop Condition 
Code). 


• IM (Interrupt Mode), if set, enables external inter- 
rupts on the INT pin; disables interrupts on INT if 
clear. IM does not affect parity error interrupts or 
interrupts on the BERR pin. 


•-nt_I 
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CCR 
PSR 


PO 
DlRllAS£ 


PI 
CB 


P2 
F'SR 


P3 
EI'SII 


DATA 
I. 


BUFFERS 
128 


ADDRESS 
32 
I• 
U KBYTE 
DATA CACHE 
• 
• 
• 
I 
I 
I 


32 
32 


PTAGS 
32 


240874-5 


Figure 2.3.Registers and Data Paths 


• U (User Mode) is set when the i860 XP micro- 
• FT (Floating-Point Trap), OAT (Data Access 
processor is executing in user mode; it is clear 
Trap), IAT (Instruction Access Trap), IN (Inter- 
when the i860 XP microprocessoris executingin 
rupt), and IT (Instruction Trap) are trap flags. 
supervisor mode. In user mode, writes to some 
They are set when the correspondingtrap condi- 
control registers are inhibited.This bit also con- 
tion occurs. IN is set on INT, bus error and parity 
trois the memoryprotection mechanism. 
error. The trap handler examinesthese bits (and 
• PIM (PreviousInterrupt Mode) and PU (Previous 
other trap bits in the epsr) to determine which 
User Mode) save the corresponding status bits 
conditionor conditionshavecausedthe trap. The 
(IM and U) on a trap, because those status bits 
interruptflag (IN)is set on externalinterrupts,bus 
are changed when a trap occurs. They are re- 
errors and parity errors. 
stored into their correspondingstatus bits when 
• OS(DelayedSwitch)is set if a trap occurs during 
returningfrom a trap handler with a branch indi- 
the instructionbeforedual-instructionmodeis en- 
rect instructionwhen a trap flag is set in the psr. 
tered or exited.If OSis set and DIM(DualInstruc- 
tion Mode) is clear, the i860 XP microprocessor 
switchesto dual-instructionmode one instruction 


• 
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BREAK READ 
BREAK WRITE 
CONDITION CODE 
LOOP CONDITION CODE 
INTERRUPT MODE 
. 


PREVIOUS INTERRUPT MODE 
USER MODE 
PREVIOUS USER MODE 
INSTRUCTION TRAP 
, 


INTERRUPT 
INSTRUCTION ACCESS TRAP 
DATA ACCESS TRAP 
FLOATING-POINT TRAP 
ill 


DELAYED SWITCH 
DUAL INSTRUCTION MODE 
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Figure 2.4. Processor Status Register 


after returning from the trap handler. If OS and DIM 
are both set, the i860 XP microprocessor switches 
to sinqle-instruction mode one instruction after re- 
turning from the trap handler. 


• When a trap occurs, the i860 XP microprocessor 
sets DIM if it is executing in dual-instruction 
mode; it clears DIM if it is executing in single-in- 
struction mode. If DIM is set after returning from a 
trap handler, the i860 XP microprocessor re- 
sumes execution in dual-instruction mode. 


• When KNF (Kill Next Floatinq-Point Instruction) is 
set, the next floating-point instruction is sup- 
pressed (except that its dual-instruction mode bit 
is interpreted). A trap handler sets KNF if the 
trapped floating-point instruction should not be 
reexecuted. 
• SC (Shift Count) stores the shift count used by 
the last right-shift instruction. It controls the num- 
ber of shifts executed by the douole-shitt instruc- 
tion. 


• PS (Pixel Size) and PM (Pixel Mask) are used by 
the pixel-store and other graphics instructions. 
The values of PS control pixel size as defined by 
Table 2.2. The bits in PM correspond to pixels to 
be updated by the pixel-store instruction pst.d. 
The low-order bit of PM corresponds to the low- 
order pixel of the 64-bit source operand of pst.d. 
The number of low-order bits of PM that are actu- 
ally used is the number of pixels that fit into 
64·bits, which depends upon PS. If a bit of PM is 
set, then pst.d stores the corresponding pixel. 
Refer also to the pst.d 
instruction in section 10. 
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Table 2.2. Values of PS 


PlxelSlze 
PlxelSlze 
Value 
In Bits 
In Bytes 


00 
8 
1 
01 
16 
2 
10 
32 
4 
11 
(undefined) 
(undefined) 


2.2.4 EXTENDED PROCESSOR STATUS 
REGISTER 


The extended processor status register (epsr) con- 
tains additional state information for the current pro- 
cess beyond that stored in the psr. Figure 2.5 shows 
the format of the epsr. 
• The processor type is 2 for the i860 XP micro- 
processor. 


• The stepping number has a unique value that dis- 
tinguishes among different revisions of the proc- 
essor. 


• IL (Interlock) is set if a trap occurs after a lock 
instruction but before the last BRDY11 of the load 
or store following the subsequent unlock instruc- 
tion. IL indicates to the trap handler that a locked 
sequence has been interrupted. When the trap 
handler finds IL set, it should scan backwards for 
the lock instruction and restart at that point. The 
absence of a lock instruction within 30-33 
in- 
structions of the trap indicates a programming er- 
ror. 
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Figure 2.5. Extended Processor Status Register 


• WP (write protect) controls the semantics of the 
W bit of page table entries. A clear W bit in either 
the directory or the page table entry causes 
writes to be trapped. When WP is clear, writes 
are trapped in user mode, but not in supervisor 
mode. When WP is set, writes are trapped in both 
user and supervisor modes. 
• PEF (parity error flag) is set by the i860 XP micro- 
processor when a parity error trap occurs. As 
soon as PEF is set, further parity error and bus 
error traps are masked. Software must clear PEF 
to reenable such traps. PEF is set at RESET. 


• BEF (bus error flag) is set by the i860 XP micro- 
processor when the BERR pin is asserted, indi- 
cating a bus error. As soon as BEF is set, further 
parity error and bus error traps are masked. Soft- 
ware must clear BEF to reenable such traps. BEF 
is set at RESET. 
• INT (Interrupt) is the value of the INT input pin. 
• DCS (Data Cache Size) is a read-only field that 
tells the size of the on-chip data cache. The num- 
ber of bytes actually available is 212 + 
DCS; 
therefore, a value of zero indicates 4 Kbytes, one 
indicates 8 Kbytes, etc. The value of DCS for the 
i860 XP microprocessor is two, which indicates 
16 Kbytes. 


• PBM (Page-Table Bit Mode) has no effect in 
the i860 XP microprocessor. PBM is used by the 
i860 XR microprocessor. 


• BE (Big Endian) controls the ordering of bytes 
within a data item in memory. Normally (Le.when 
BE is clear) the i860 XP microprocessor operates 
in little endian mode, in which the addressed byte 
is the low-order byte. When BE is set (big endian 


mode), the low-order three bits of all 32-bit data 
load and store addresses are complemented, 
then masked to the appropriate boundary for 
alignment. This causes the addressed byte to be 
the most significant byte. Big endian mode af- 
fects not only the memory load and store instruc- 
tions but also the Idlo, stlo, Idlnt, and seye in- 
structions. 
• OF (Overflow Flag) is set by adds, addu, subs, 


and subu when integer overflow occurs. For 
adds and subs, OF is set if the carry from bit 31 
is different than the carry from bit 30. For addu, 
OF is set if there is a carry from bit 31. For subu, 
OF is set if there is no carry from bit 31. Under all 
other conditions, it is cleared by these instruc- 
tions. OF may be changed by arithmetic instruc- 
tions in either user or supervisor mode. It may be 
changed by the st.e instruction in supervisor 
mode only. OF controls the function of the Intovr 
instruction. Inside the trap handler, OF may not 
be valid for traps other than one caused by In- 
tovr. 


• BS (bus or parity error trap in supervisor mode) is 
set by the i860 XP microprocessor when a bus or 
parity error occurs during a supervisor mode. The 
Bus Error Address Register (BEAR) and the BS 
bit have valid information only if the Bus Error pin 
(BERR) is asserted in the same clock as BRDY11 
or one clock after. In all other conditions the con- 
tents of the BEAR register and the BS bit are 
undefined. The BS bit is set when a bus error or 
parity error occurs during a supervisor mode 
memory access cycle. This is true even though 
the processor may have switched to user mode 
by the time these errors are reported. The operat- 


• 
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ing system can use this bit to decide, for exam- 
ple, whether to abort the process (user mode) or 
reboot the system (supervisor mode). In order for 
the BS bit to be set correctly, as well as the Bus 
Error Address Register to latch the correct ad- 
dress which caused the error,' the Bus Error 
(BERR) input must be asserted by external hard- 
ware either in the same clock as BRDY#, or one 
clock after. Refer to section 2.2.10 for more infor- 
mation. 


• 
01 (trap on delayed instruction) is set by the 
i860 XP microprocessor when a trap occurs on a 
delayed instruction (the instruction located after a 
delayed branch instruction). When 01 is set, the 
trap handler must restart the interrupted proce- 
dure from the branch instruction rather than at 
the address in fir. 


• TAl (trap on autoincrement instruction) is set by 
the i860 XP microprocessor when a trap occurs 
on an instruction with autoincrement. When TAl is 
set, the trap handler should undo the autoincre- 
ment (that is, restore src2 to its original value). 


• PT (trap on pipeline use) indicates to the i860 XP 
microprocessor that a trap should be generated 
and PI should be set when it executes an instruc- 
tion that uses the floating-point or graphics unit. 
Such instructions include all the instructions des- 
ignated "Floating-Point Unit" in Table 2.9, plus 
the pfld instruction. PT is set and cleared only by 
software. It can be used by the trap handler to 
avoid unnecessary saving and restoring of the 
pipelines (refer to section 2.8). When a trap due 
to PT occurs, the floating-point operation has not 
started, and the pipelines have not been ad- 
vanced. Such a trap also sets the IT bit of psr. 


infel~ 


• The behavior of PI (pipeline instruction) depends 
on the setting of PT. If PT = 0, the i860 XP mi- 
croprocessor sets PI when any pipelined instruc- 
tion or pfld is executed. If PT = 1, the processor 
sets PI and traps when it decodes any instruction 
that uses the pipes, whether scalar or pipelined. 
The PI bit is not affected by the Kill Next Floating- 
point (KNF) bit in the Processor Status Register. 
If KNF is set and the next instruction uses the 
floating-point or load pipelines, the PI bit will be 
set even though the instruction is not executed. 
Refer to section 2.8. 
• SO (strong ordering) indicates whether the proc- 
essor is in strong ordering mode (SO= 1) or weak 
ordering mode (SO=O). SO is set if the EWBE# 
pin is active (LOW) at RESET. (Refer to the para- 
graphs on write cycle reordering in section 5.) 


2.2.5 
DATA 
BREAKPOINT 
REGISTER 


The data breakpoint register (db) is used to gener- 
ate a trap when the i860 XP microprocessor access- 
es an operand at the virtual address stored in this 
register. The trap is enabled by BR and BW in psr. 
When comparing, a number of low order bits of the 
address are ignored, depending on the size of the 
operand. For example, a 16-bit access ignores the 
low-order bit of the address when comparing to db; 
a 32-bit access ignores the low-order two bits. This 
ensures that any access that overlaps the address 
contained in the register will generate a trap. The 
trap occurs before the register or memory update by 
the load or store instruction. 


LATE 
BACK -OFF 
MODE 
• 
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Figure 2.6. Directory 
Base Register 
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2.2.6 
DIRECTORY 
BASE 
REGISTER 


The directory 
base register dlrbase 
(shown in Figure 
2.5) controls 
address 
translation, 
caching, 
and bus 
options. 


• 
ATE (Address 
Translation 
Enable), when set, en- 
ables the virtual-address 
translation 
algorithm. 


• 
DPS (DRAM 
Page Size) controls 
how many bits 
to ignore 
when 
comparing 
the current 
bus-cycle 
address 
with the previous 
bus-cycle 
address 
to 
generate 
the NENE # signal. This feature 
allows 
for 
higher 
speeds 
when 
using 
static 
column 
or 
page-mode 
DRAMs 
and consecutive 
reads 
and 
writes 
access 
the 
same 
column 
or page. 
The 
comparison 
ignores the low-order 
12 + DPS bits. 


A value 
of zero 
is appropriate 
for one 
bank 
of 
255K x n RAMs, 
1 for 1M X n RAMS, etc. For 
interleaved 
memory, 
increase 
DPS 
by one 
for 
each 
power 
of interleaving-add 
one for 2-way, 


two for 4-way, etc. 


• 
When 
BL (Bus Lock) is set, external 
bus access- 
es are 
locked. 
The 
LOCK # 
signal 
is asserted 
with 
the 
next 
bus 
cycle 
(excluding 
instruction 
fetch 
and write-back 
cycles) 
whose 
internal 
bus 
request is generated 
after BL is set. It remains set 
on every subsequent 
bus cycle as long as BL re- 
mains set. The LOCK # signal 
is deasserted 
on 
the 
next 
load 
or 
store 
instruction 
after 
BL 
is 
cleared. 
Traps 
immediately 
clear 
BL. The 
lock 
and unlock 
instructions 
control 
the BL bit. The 
result of modifying 
BL with the st.c 
instruction 
is 
not defined. 


• 
ITI (Cache 
and TLB 
Invalidate), 
when 
set in the 
value "that is loaded 
into dlrbase, 
causes 
all en- 
tries 
in the instruction 
cache 
and virtual 
tags 
in 
the address-translation 
cache 
(TLB) to be invali- 
dated. Also invalidates 
all virtual tags in the data 
cache. The ITI bit does not remain set in dlrbase. 
ITI always 
appears 
as zero 
when 
reading 
dlr- 
base. 


• 
When software 
sets the LB bit, the i850 XP micro- 
processor 
enters 
two-clock 
late back-off 
mode. 


This mode gives two additional 
clock 
periods 
of 
decision 
time to the external 
logic that may need 
to use the BOFF # signal to cancel a bus cycle or 
data transfer. 
If the processor 
enters 
one-clock 
late back-off 
mode during 
RESET via configura- 
tion pin strapping, 
the LB bit has no effect, 
and it 


is 
impossible 
to 
enter 
two-clock 
late 
back-off 
mode. 
Furthermore, 
software 
cannot 
exit 
two- 
clock 
late back-off 
mode once it is activated; 
the 
LB bit cannot 
be cleared 
except 
by resetting 
the 
processor. 


• 
When 
CS8 
(Code 
Size 8-Bit) 
is set, 
instruction 
cache 
misses are processed 
as 8-bit bus cycles. 
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When .thls bit is clear, instruction 
cache 
misses 
are 
processed 
as 54-bit 
bus cycles. 
This bit can not be 
set by software; 
hardware 
sets this bit at initialization 
time. It can be cleared 
by software 
(one time only) to 
allow 
the system 
to execute 
out of 54-bit 
memory 
after 
bootstrapping 
from 
8-bit 
EPROM. 
A 
nonde- 


layed branch to code in 54-bit memory should direct- 
ly follow 
the st.c 
(store 
control 
register) 
instruction 
that clears CS8, in order to make the transition 
from 
8-bit to 54-bit memory 
occur at the correct 
time. The 
branch 
instruction 
must 
be 
aligned 
on 
a 
54-bit 


boundary. 


• 
RB (Replacement 
Block) identifies 
the cache 
line 
(block) 
to be replaced 
by cache 
replacement 
al- 


gorithms. 
RB conditions 
the cache 
flush 
instruc- 


tion flush, 
which 
is discussed 
in Section 
10. Ta- 


ble 2.3 explains 
the values of RB. 
• 


• 
RC 
(Replacement 
Control) 
controls 
cache 
re- 


placement 
algorithms. 
Table 2.4 explains 
the sig- 
nificance 
of the values 
of RC. 


• 
DTB (Directory 
Table Base) contains 
the high-or- 


der 20 bits of the physical 
address 
of the page 
directory 
when address translation 
is enabled 
(i.e. 


ATE = 1). The low-order 
12 bits of the address 
are zeros. 


Table 2.3. Values 
of RB 


Value 
Replace 
Replace 
Instruction 


TLBBlock 
and Data Cache 
Block 


00 
0 
0 
01 
1 
1 
10 
2 
2 
1 1 
3 
3 


Table 2.4. Values 
of RC 


Value 
Meaning 


00 
Selects 
the normal (random) 
replacement 
algorithm 
where any block 


in the set may be replaced 
on cache 
misses in all caches. 


01 
Instruction, 
data, and TLB cache misses 


replace the block selected 
by RB. This 


mode is used for cache and TLB testing. 


10 
Data cache misses replace the block 
selected 
by RB. Instruction 
and TLB 
caches 
use random replacement. 
This 


mode is used when flushing 
the data 
cache with the flush 
instruction. 


11 
Disables data cache replacement. 
Disables TLB replacement. 
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2.2.7 
FAULT 
INSTRUCTION 
REGISTER 


When 
a trap 
occurs, 
this register 
contains 
the ad- 
dress of the trapping 
instruction 
(not necessarily 
the 
instruction 
that created 
the conditions 
that required 
the trap). The fir is a read-only 
register. 
In single-in- 
struction 
mode, 
using a Id.e instruction 
to read the 
fir 
anytime 
except the first time after a trap saves in 
ides! the address 
of the Id.e instruction; 
in dual-in- 
struction 
mode, the address of its floating-point 
com- 
panion 
(address 
of the Id.e 
- 
4) is saved. 


2.2.8 
FLOATING-POINT 
STATUS 
REGISTER 


The floating-point 
status 
register 
(fsr) 
contains 
the 
floating-point 
trap and rounding-mode 
status for the 
current 
process. 
Figure 2.7 shows 
its format. 


• 
If FZ (Flush Zero) is clear and underflow 
occurs, 
a result-exception 
trap is generated. 
When 
FZ is 
set and underflow 
occurs, the result is set to zero, 


and no trap due to underflow 
occurs. 


• 
If TI (Trap Inexact) 
is clear, inexact 
results do not 
cause a trap. If TI is set, inexact 
results cause a 
trap. The sticky 
inexact 
flag (SI) is set whenever 
an inexact 
result 
is produced, 
regardless 
of the 
setting 
of TI. 


intel~ 


• 
RM (Rounding 
Mode) 
specifies 
one 
of the four 
rounding 
modes 
defined 
by the 
IEEE standard. 


Given a true result b that cannot 
be represented 
by the target data type, the i860 XP microproces- 
sor determines 
the two representable 
numbers 
8 
and c that most closely 
bracket b 
in value (8 < 


b < c). The i860 XP microprocessor 
then rounds 
(changes) b to 8 or c according 
to the mode se- 
lected 
by RM as defined 
in Table 2.5. Rounding 
introduces 
an error in the result that is less than 
one least-significant 
bit. 


Table 2.5. Values 
of RM 


Value 
Rounding 
Mode 
Rounding 
Action 


00 
Round to 
Closer to b of 8 or c; 


nearest or even 
if equally close, 
select even number 
(the one whose. 
least significant 
bit 
is zero). 


01 
Round down 
8 
(toward 
- 00) 


10 
Round up 
c 
(toward + 00) 


11 
Chop 
Smaller in 
(toward zero) 
magnitude 
of 8 or c. 


FLUSH ZERO--------------------, 
TRAP INEXACT-----------------~ 
ROUNDINGNODE------------------, 
UPDATE-------------,-----~ 
FLOATING-POINTTRAP ENABLE --------.:.---.:.--, 
STICkY INEXACTFLAG -------'-'-----, 
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NULTIPLlER UNDERFLOW----------, 


NULTIPLIEROVERFLOW~=========:-1 
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1 
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Figure 
2.7. Floating-Point 
Status 
Register 
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• The U-bit (Update Bit), if set in the value that is 
loaded into fsr 
by a st.c instruction, enables up- 
dating of the result-status bits (AE, AA, AI, AO, 
AU, MA, MI, MO, and MU) in the first-stage of the 
floating-point adder and multiplier pipelines. If this 
bit is clear, the result-status bits are unaffected 
by a st.c instruction; st.c ignores the correspond- 
ing bits in the value that is being loaded. An st.c 
always updates fsr bits 21..17 and 8..0 directly. 
The U-bit does not remain set; it always appears 
as zero when read. 
• The FTE (Floating-Point Trap Enable) bit, if clear, 
disables all floating-point traps (invalid input oper- 
and, overflow, underflow, and inexact result). 


• SI (Sticky Inexact) is set when the last-stage re- 
sult of either the multiplier or adder is inexact (Le. 
when either AI or MI is set). SI is "sticky" in the 
sense that it remains set until reset by software. 
AI and MI, on the other hand, can by changed by 
the subsequent floating-point instruction. 


• SE (Source Exception) is set when one of the 
source operands of a floating-point operation is 
invalid; it is cleared when all the input operands 
are valid. Invalid input operands include denor- 
mals, infinities, and all NaNs (both quiet and sig- 
naling). 
• When read from the fsr, the result-status bits MA, 
MI, MO, and MU (Multiplier Add-One, Inexact, 
Overflow, and Underflow, respectively) describe 
the last-stage result of the multiplier. 


When read from the fsr, the result-status bits AA, 
AI, AO, AU, and AE (Adder Add-One, Inexact, 
Overflow, Underflow, and Exponent, respectively) 
describe the last-stage result of the adder. The 
high-order three bits of the t t-bit exponent of the 
adder result are stored in the AE field. 
The Adder Add-One and Multiplier Add-One bits 
indicate that the absolute value of the result frac- 
tion grew by one least-significant bit due to 
rounding. AA and MA are not influenced by the 
sign of the result. 
After a floating-point operation in a given unit (ad- 
der or multiplier), the result-status bits of that unit 
are undefined until the point at which result ex- 
ceptions are reported. 
When written to the fsr with the U-bit set, the 
result-status bits are placed into the first stage of 
the adder and multiplier pipelines. When the 
processor executes pipelined operations, it prop- 
agates the result-status bits of a particular unit 
(multiplier or adder) one stage for each pipelined 
floating-point operation for that unit. When they 
reach the last stage, they replace the normal re- 
sult-status bits in the fsr and generate traps, if 
enabled. When the U-bit is not set, result-status 
bits in the word being written to the fsr are ig- 
nored. 
i 
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In a floating-point dual-operation instruction (e.g. 
add- and-multiply or subtract-and-multiply), both 
the multiplier and the adder may set exception 
bits. The result-status bits for a particular unit re- 
main set until the next operation that uses that 
unit. 


• RR (Result Register) specifies which floating- 
point register (fO-f31)was the destination register 
when a result-exception trap occurs due to a sca- 
lar operation. 
• IRP (Integer (Graphics) Pipe Result Precision), 


MRP (Multiplier Pipe Result Precision), and ARP 
(Adder Pipe Result Precision) aid in restoring 
pipeline state after a trap or process switch. Each 
defines the precision of the last-stage result in 
the corresponding pipeline. One of these bits is 
set when the result in the last stage of the corre- 
sponding pipeline is double precision; it is cleared 
if the result is single precision. 


• LRP1 and LRPO(Load Pipe Result Precision) to- 
gether define the size of the last-stage result of 
the load pipeline. They are encoded as Table 2.6 
shows. 


Table 2.6. Values of LRP1 and LRPO 


LRP1 
LRPO 
pfld Length 


0 
0 
(reserved) 
0 
1 
4 Bytes 
1 
0 
8 Bytes 
1 
1 
16 Bytes 


2.2.9 KR, KI, T, AND MERGE REGISTERS 


The KR, KI, and T registers are special-purpose reg- 
isters used by the dual-operation floating-point in- 
structions pfam, pfsm, pfmam, and pfmsm, which 
initiate both an adder operation and a multiplier op- 
eration. The KR, KI, and T registers can store values 
from one dual-operation instruction and supply them 
as input~ to subsequent dual-operation instructions. 
(Refer to Figure 2.16.) 


The MERGE register is used only by the graphics 
instructions. The purpose of the MERGE register is 
to accumulate (or merge) the results of multiple-ad- 
dition operations that use as operands the color-in- 
tensity values from pixels or distance values from a 
Z-buffer. The accumulated results can then 
be 
stored in one 64-bit operation. 


Two multiple-addition instructions and an OR in- 
struction use the MERGE register. The addition in- 
structions are designed to add interpolation values 
to each color-intensity field in an array of pixels or to 
each distance value in a Z-buffer. 
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Refer to the instruction 
descriptions 
in section 
10 for 
more information 
about these 
registers. 


2.2.10 
BUS ERROR 
ADDRESS 
REGISTER 


The 
bear 
helps 
the 
trap 
handler 
determine 
faulty 
memory 
locations. 
The 
i860 
XP 
microprocessor 
loads a valid address 
into bear 
under these 
condi- 
tions: 


• 
For bus errors, the bear 
receives 
the address 
of 
the cycle for which the BERR signal is asserted, 
if 
external 
hardware 
asserts 
BERR 
in the 
same 
clock 
as it asserts 
BRDY # or one clock 
later. 


• 
For parity errors on a read, the bear receives 
the 
address 
of the cycle 
during which 
the processor 
detects 
the 
error, 
if external 
hardware 
asserts 
PEN # with BRDY # for that cycle. 


If external 
hardware 
does not meet these conditions, 
the contents 
of the bear 
are undefined. 


A valid address 
in bear is accurate 
to 29 bits; that is, 


address 
signals 
A31-A3 
are latched 
in the high-or- 
der 29 bits of bear. 
At RESET and after every trap, 
software 
must 
read the 
bear 
before 
further 
parity 
and bus error traps can occur. 
The bear 
is a read- 
only register. 


2.2.11 
PRIVILEGED 
REGISTERS 


The registers 
pO, p1 p2, and p3 are provided 
for the 
operating 
system to use. They do not affect 
proces- 
sor operation. 
They can be accessed 
by the Id.c and 
st.c 
instructions, 
but they can be written 
only in su- 
pervisor 
mode. They may be used to store informa- 
tion such as the interrupt 
stack pointer, 
current 
user 
stack 
pointer 
at the beginning 
of the trap 
handler, 


register 
values 
during trap handling, 
processor 
ID in 
a multiprocessor 
system, 
or for any other purpose. 


2.2.12 
CONCURRENCY 
CONTROL 
REGISTER 


The concurrency 
control 
register 
(ccr) 
controls 
the 
operation 
of the internal 
Concurrency 
Control 
Unit 
(CCU), which 
is described 
in section 
2.5. The 
ccr 
can be written 
in supervisor 
mode only, but can be 
read in user or supervisor 
mode. 
Figure 
2.8 shows 
the format 
of the ccr. 


DO (Detached 
Only) bit and CO (CCU On) bit togeth- 
er specify the CCU configuration. 
DO, when set, indi- 
cates that there 
is no external 
CCU. CO (CCU On) 
bit, when set, indicates 
that the Concurrency 
Control 
Architecture 
is enabled. 
Table 
2.7 summarizes 
the 
modes 
defined 
by CO and 
DO bits. The 
reserved 
combinations 
should 
not be used by software. 


If the DCCU is on (CO = DO = 1), the processor 
in- 
tercepts 
and interprets 
all memory 
loads and stores 
which 
are to the CCU address 
space, 
which 
is the 
two pages defined 
by CCUBASE. 
Loads and stores 
to that address 
range do not go to memory, 
but to 
the DCCU. 


Table 
2.7. Values 
of CO and DO 


CO 
DO 
Mode 


0 
0 
External CCU, or no CCU 
0 
1 
reserved 
1 
0 
reserved 
1 
1 
Internal CCU (DCCU) only 


CCUBASE 
is the virtual address 
of the memory 
area 
into which the CCU registers 
are mapped. 
Software 
must set bit 12 to zero, because the CCUBASE 
must 
be aligned on a two page (8 Kbyte) boundary. 
This is 
because 
an external 
CCU contains 
supervisor 
regis- 
ters mapped 
to the second 
page. 


DETACHED 
ONLY 
--------, 


CCU 
ON ---------, 
~ ! 


1502928272625242522212019181716151-1152111098765452 
1 
0 
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0 :::::::::::f~::::::::::::::::::::I:::f:t:::::: g 8 :::f:::::! 
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Figure 
2.8. Concurrency 
Control 
Register 
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2.2.13 NEWCURR REGISTER 


The NEWCURR register is part of the detached CCU 
(concurrency control unit). It a 32-bit counter that 
supplies an iteration count for loop execution. (Refer 
to section 2.5.) 


NEWCURR is architecturally a 64-bit register, but 
only the low-order 32 bits are provided in this imple- 
mentation. 
Compiler and operating-system 
data 
structures should provide for a 64-bit size for future 
implementation. 


2.2.14 STAT REGISTER 


The STAT register is part of the detached CCU (con- 
currency control unit). As Figure 2.9 shows, it con- 
tains the following bits: 
InLoop 
Indicates that the processor is currently 
executing a concurrent loop. This bit is 
set when a processor starts a concur- 
rent, non-nested loop, and it is cleared 
when the processor enters serial code 
when not nested or idle. It can also be 
read or written directly. 
Nested 
Indicates whether the processor is in the 
nested state. InLoop is copied into this 
bit when starting a nested loop. Other- 
wise, it can be read or written directly. 
Detached 
Always contains the value of ccr bit DO. 


STAT is architecturally a 64-bit register. Compiler 
and operating-system data structures should provide 
for a 64-bit size for future implementation. 


2.3 Addressing 


Memory is addressed in byte units with a paged vir- 
tual-address space of 232 bytes. Data and instruc- 
tions can be located anywhere in this address 
space. Address arithmetic uses 32-bit input values 
and produces 32-bit results. The low-order 32 bits of 
the result are used in case of overflow. 
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Normally, multibyte data values are stored in memo- 
ry in little endian format, i.e. with the least significant 
byte at the lowest memory address. As an option, 
the ordering can be dynamically selected by soft- 
ware in supervisor mode. The i860 XP microproces- 
sor also offers big endian mode, in which the most 
significant byte of a data item is at the lowest ad- 
dress. Figure 2.10 defines by example how data is 
transferred from memory over the bus into a register 
in both modes. Big endian and little endian data ar- 
eas should not be mixed within a 64-bit data word. 
Illustrations of data structures in this data sheet 
show data stored in little endian mode, i.e. the right- 
most (Iow-order) byte is at the lowest memory ad- 
dress. 


Code accesses are always done with little endian 
addressing. This implies that instructions appear dif- 
ferently than documented here when accessed as 
big endian data. Intel Corporation recommends that 
disassemblers running in a big endian system con- 
vert instructions that have been read as data back to 
little endian form and present them in the format 
documented here. 


Page directories and page tables are also accessed 
in little endian mode, regardless of the value of the 
BE bit. 


Big endian mode affects not only the memory load 
and store instructions but also the Idlo, stlo, Idlnt, 
and scyc instructions. 


Alignment requirements are as follows (any violation 
results in a data-access trap): 
• 128-bit values are aligned on 16-byte boundaries 
when referenced in memory (i.e. the four least 
significant address bits must be zero). 
• 64-bit values are aligned on 8-byte boundaries 
when referenced in memory (i.e. the three least 
significant address bits must be zero). 
• 32-bit values are aligned on 4-byte boundaries 
when referenced in memory (i.e. the two least 
significant address bits must be zero). 


;$1.$029282726.25.2420$2221201918171615141.$1211109876540$2 
1 
0 
I:f:::i:::":::~:]i,:::i:::f~::i::g:~:::i:::i:::f:::!:::M~:::i:::f::::~:::!:::~:~::!:::i:::f:::@:"IMt:::il 


YfIj 


InLoop 
-----------~ 
Nested 
------------, 


Detached -----------.d 


[j RESERVED BY INTEL CORPORATION 
m READ ONLY 
240874-11 


Figure 2.9. Concurrency Status Register 
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d63 
dO 
IHGFEDCBAI 
~ 
----- 


LITILE ENDIAN 
BIG ENDIAN 
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Byt. Enabl•• 
Byt. Enabl•• 
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Dala Bus 
r16 
A••• rted 
Dala Bu. 
r16 


(BEnjlO) 
d63 
dO 
d31 dO 
(BEnjlO) 
'd63 
dO 
d31 dO 


Id.b 0(,0),,16 
0 0[] 


7 Q[] 


Id.b 1(,0),,16 
1 
6 
Id.b 2(,0),,16 
2 
5 
Id.b 3(,0), r16 
3 
4 
Id.b 4(,0), r16 
4 
3 
Id.b 5(,0), r16 
5 
2 
Id.b 6(,O),r16 
6 
1 
Id.b 7(,O),r16 
7 
0 


d63 
dO 
d31 dO 
d63 
dO 
d31 dO 


Id•• O(rO), r16 
1:0 0n 


7:6 
lS:J []] 


Id.s 2(rO), r16 
3:2 
DC 
5:4 
FE 
FE 
Id.s 4(rO), r16 
5:4 
FE 
FE 
3:2 
DC 
DC 
Id•• 6(rO), r16 
7:6 
HG 
HG 
1:0 
BA 
BA 


d63 
dO 
d31 dO 
d63 
dO 
d31 dO 


Id.1 O(rO)" 
16 
3:0 
r;;:j IDCBAI 
7:4 
E;] 
IHGFEI 
Id.l 4(rO), r16 
7:4 
HGFE 
HGFE 
3:0 
DCBA 
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NOTE: 
64- and 128-bit 
big endian 
acce.sses 
are treated 
the same as little endian 
accesses 


Figure 2.10. Little and Big Endlan Memory Transfers 


• 16-bit values are aligned on 2-byte boundaries 
when referenced in memory (i.e. the least signifi- 
cant address bit must be zero). 


2.4 Virtual Addressing 


When address translation is enabled, the processor 
maps instruction and data virtual addresses into 
physical addresses before referencing memory. This 
address transformation is compatible with that of the 
Intel386 and Intel486 microprocessors and imple- 
ments the basic features needed for page-oriented 
virtual-memory systems and page-level protection. 


The address translation is optional. Address transla- 
tion is disabled when the processor is reset. It is 
enabled when a store (st.e) to dlrbase sets the ATE 
bit. The operating system typically does this during 
software initialization. Address translation is dis- 
abled again when st.e clears the ATE bit. The ATE 
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bit must be set if the operating system is to imple- 
ment page-oriented protection or page-oriented vir- 
tual memory. 


2.4.1 PAGE FRAME 


A page frame is a unit of contiguous addresses of 
physical main memory. A page is the collection of 
data that occupies a page frame when that data is 
present in main memory or occupies some location 
in secondary storage when there is not sufficient 
space in main memory. 


The i860 XP microprocessor architecture supports 
two sizes of pages and page frames: four Mbytes 
and four Kbytes. Four Kbyte page frames begin on 
four Kbyte boundaries and are fixed in size. Four 
Mbyte page frames begin on four Mbyte boundaries 
and are fixed in size. The four Kbyte address trans- 
formation is compatible with that of the Intel 486 mi- 
croprocessor. 
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2.4.2 VIRTUAL ADDRESS 


A virtual address refers indirectly to a physical ad- 
dress by specifying a page and an offset within that 
page. Figure 2.11 shows the formats of virtual ad- 
dressess. The format for virtual addresses that refer 
to four Mbyte pages is different from that of four 
Kbyte pages. 


Figure 2.12 shows how the i860 XP microprocessor 
converts a virtual address into the physical address 
by consulting page tables. The addressing mecha- 
nism uses the DIR field as an index into a page di- 
rectory. For 4K pages, it uses the PAGE field as an 
index into the page table determined by the page 
directory and uses the' OFFSET field to address a 
byte within the page determined by the page table. 
For 4M pages, the page directory entry determines 
the page address, and the OFFSET field addresses 
a byte within that page table. 


1860TM XP MICROPROCESSOR 


2.4.3 PAGE TABLES 


A page table is simply an array of 32-bit page specifi- 
ers. A page table is itself a page, and contains 
4 Kbytes of data or at most 1K 32-bit entries. 


At the highest level is a page directory. The page 
directory holds up to 1K entries that address either 
page tables of the second level or 4-Mbyte pages. 


A page table of the second level addresses up to 1K 
4-Kbyte pages. All the tables addressed by one 
page directory, therefore, can address 1M 4-Kbyte 
pages. 


Whether 4-Mbyte pages, 4·Kbyte pages, or some 
combination of the two are used, one page directory 
can cover the entire four gigabyte physical address 
space of the i860 XP microprocessor (1K page di- 
rectory entries x 4M page or 1K page directory en- 
tries x 1K page table entries x 4K page). 


FORMAT 
FOR 
4 KBYTE 
PAGE 


'J1.RJ2112827262524 
2522 '2120111 18171615 14 15 12 111098765452 
1 0 
I 
DlR 
PAGE 
OFFSET 


FORMAT 
FOR 
4 MBYTE 
PAGE 


'J1.RJ21128 272625242522 
~M~~"~~14~Ul1mIl8765452 
10 
I 
DIR 
OFFSET 
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Figure 2.11. Formats of Virtual Addresses 
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....• 
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•......• 
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01RI 
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Figure 2.12. Address Translation 
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The physical address of the current page directory 
is 
stored in the DTB field of the dlrbase 
register. 
Mem- 
ory management 
software 
has the option 
of using 
one page directory 
for all processes, 
one page direc- 
tory for each 
process, 
or some 
combination 
of the 
two. 


2.4.4.1 
Page .Frame Address 


The page frame address 
specifies 
the physical 
start- 
ing address 
of a page. In a page directory, 
the page 
frame address 
is either the address 
of a page table 
or the address 
of the four 
Mbyte 
page frame 
that 
contains 
the desired 
memory 
operand. 
In a second- 
level page table, the page frame 
address 
is the ad- 
dress 
of the 4-Kbyte 
page frame 
that 
contains 
the 
desired 
memory 
operand. 
2.4.4 
PAGE-TABLE 
ENTRIES 


Page-table 
entries 
(PTEs) have one of the formats 
shown 
by Figure 2..13. 


PAGE 
DIR 
'JIJ0211282726252-12J22 
f 
,{Hllr8 
7 
6151-1 
JUf110 
11 8 765 
-I 
J 2 
1 
0 
~~~~~;'I 
PAGE rR3Al~~2~DDRESS !~::~li:,,::~I~: 
:~l! 
:g: AVAIL 
:,,:11 
D A 
~ 
~ 
U 
W 
P 


PAGE\:::::::::::::::~~~jt~~~jt~:::l~~~t!:t~~ 


PAGE 
DIR 
ENTRY 
~ KBYTE 
PAGE 


PRESENT 
WRITABLE 
USER 
ACCESSED 
PAGE SIZE (0 
INDICATES 
4 KBYTE) 
AVAILABLE 
FOR SYSTE~S 
PROGRA~~ER 
USE 
i 
, 


'JIJ0211282726252-1 
2J22 
2120 
1111817 16 15 1-1IJ 
12 1110 
11 876 
5 
-I 
J2 
1 
0 
I 
PAGE FRA~E 
ADDRESS 
31..12 
I AVAIL 11 
0 :!IA :"If,~U W P 
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PRESENT --------------::---------, 
WRITABLE ----------------,---:--:-----, 
USER----------------------, 


WRITE-THROUGH 
------------------, 
CACHE-DISABLE 
--------~-------__, 
ACCESSED -----------------__, 
DIRTY --------------------, 
PAGE SIZE(l 
INDICATES 
4~BYTE)---------., 


AVAILABLE 
FOR SYSTE~S 
PROGRA~~ER 
USE -----'1 
! 
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PAGE 
TABLE 
ENTRY 
4 KBYTE 
ONLY 


PRESENT 


I 


WRITABLE 
USER 
WRITE-THROUGH 
CACHE-DISABLE 
ACCESSED 
DIRTY 
1 


AVAILABLE 
FOR SYSTE~S 
PROGRA~~ER 
USE 
i 


'JIJO 211282726252-1 
2J 22 2120111181716151-1 
IJ 
12 1110 
11 87 
6 5 -I 
J 
2 
1 
0 
I 
PAGE rRA~E 
ADDRESS 31.. 
12 
AVAIL ~6i- D A 
C W U W P~ 
::::: ::::: 
D 
T 


[IRESERVED BY INTEL CORPORATION (SHOULD 
BE ZERO) 
240874-17 
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Figure 2.13. Formats 
of Page Table Entries 


2.4.4.2 Present Bit 


The P (present) bit indicates whether a page table 
entry can be used in address translation. P= 1 indi- 
cates that the entry can be used. When P= 0 in ei- 
ther level of page tables, the entry is not valid for 
address translation, and the rest of the entry is avail- 
able for software use; none of the other bits in the 
entry is tested by the hardware. If P=0 in either lev- 
el of page tables when an attempt is made to use a 
page-table entry for address translation, the proces- 
sor signals either a data-access fault or an instruc- 
tion-access fault. In software systems that support 
paged virtual memory, the trap handler can bring the 
required page into physical memory. 


Note that there is no P bit for the page directory 
itself. The page directory may be not-present while 
the associated process is suspended, but the oper- 
ating system must ensure that the page directory 
indicated by the dirbase image associated with the 
process is present in physical memory before the 
process is dispatched. 


2.4.4.3 Wrltable and User Bits 


The W (writable) and U (user) bits are used for page- 
level protection, which the i860 XP microprocessor 
performs at the same time as address translation. 
The concept of privilege for pages is implemented 
by assigning each page to one of two levels: 
Supervisor level 
For the 
operating 
system 
(U= 0) 
and other systems software 
and related data. 
User level (U= 1) 
For applications procedures 
and data. 


The U bit of the psr indicates whether the i860 XP 
microprocessor is executing at user or supervisor 
level. The i860 XP microprocessor maintains the 
U bit of psr as follows: 


• The i860 XP microprocessor clears the psr U bit 
to indicate supervisor level when a trap occurs 
(including when the trap 
instruction causes the 
trap). The prior value of U is copied into PU. 
• The i860 XP microprocessor copies the psr 
PU bit into the U bit when an indirect branch is 
executed and one of the trap bits is set. If PU was 
one, the i860 XP microprocessor enters user lev- 
el. 


With the U bit of psr and the Wand 
U bits of the 
page table entries, the i860 XP microprocessor im- 
plements the following protection rules: 
• When at user level, a read or write of a supervi- 
sor-level page causes a trap. 
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• When at user level, a write to a page whose W bit 
is not set causes.a trap. 


• When at user level, a store (st.c) to certain con- 
trol 'registers is ignored. 


• When at user level, privileged instructions (Idlo, 
stlo, scyc, Idlnt) have no effect. 


When the i860 XP microprocessor is executing at 
supervisor level, all pages are addressable, but, 
when it is executing at user level, only pages that 
belong to the user level are addressable. 


When the i860 XP microprocessor is executing at 
supervisor level, all pages are readable. Whether a 
page is writable depends upon the write-protection 
mode controlled by WP of epsr: 
WP=0 
All pages are writable. 
WP= 1 A write to page whose W bit is not set 
causes a trap. 


When the i860 XP microprocessor is executing at 
user level, only pages that belong to user level and 
are marked writable are actually writable; pages that 
belong to supervisor level are neither readable nor 
writable from user level. 


2.4.4.4 Write-Through 
Bit 


The i860 XP microprocessor implement both write- 
back and write-through caching policies for the on- 
chip instruction and data caches. If WT is set, the 
write-through policy is applied to data from the cor- 
responding page. If WT is clear, the normal write- 
back policy is applied to data from the page. 


For four-Mbyte pages, the WT bit of the page direc- 
tory entry is used. For four-Kbyte pages, only the WT 
bit of the second-level page table entry is used; the 
WT bit of the page directory entry is not referenced 
by the processor, but is reserved. 


The value of the WT bit is driven externally on the 
PWT pin, so that external caches can employ the 
same policy used internally. 


2.4.4.5 Cache Disable Bit 


If a page's CD (cache disable) bit is set, data from 
the page is not placed in the internal instruction or 
data caches (regardless of the value of the WT bit). 
Clearing CD permits the processor to place data 
from the associated page into internal caches. 


For four-Mbyte pages, the CD bit of the page direc- 
tory entry is used. For four-Kbyte pages, only the CD 
bit of.the second-level page table entry is used; the 
CD bit of the page directory entry is not referenced 
by the processor, but is reserved. 
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The value 
of the CD bit is driven 
externally 
on the 
PCD pin, so that 
cacheability 
can be the same 
in 
both internal 
and external 
caches. 


2.4.4.6 
Accessed 
and Dirty Bits 


The 
A (accessed) 
and 
D (dirty) 
bits 
provide 
data 
about page usage in both levels of the page tables. 


The i860 XP microprocessor 
sets the A-bit before a 
read 
or write 
operation 
to a page. 
For four-Kbyte 
pages, it sets the A-bit of both levels of page tables. 


The processor 
tests the dirty bit before a write, and, 
under 
certain 
conditions, 
causes 
traps. 
The 
trap 
handler 
then has the opportunity 
to maintain 
appro- 
priate values 
in the dirty bits. For four-M byte pages, 
the D bit of the page directory 
entry is used. For four- 
Kbyte pages, only the D bit of the second-level 
page 
table 
entry 
is used; the D bit of the page directory 
entry is not referenced 
by the processor, 
but is re- 
served. 
The precise 
algorithm 
for using these bits is 
specified 
in section 
2.4.5. 


An 
operating 
system 
that 
supports 
paged 
virtual 
memory 
can use the D and A bits to determine 
what 
pages to eliminate 
from physical 
memory 
when the 
demand 
for memory 
exceeds 
the physical 
memory 
available. 
The D and A bits are normally 
initialized 
to 
zero by the 
operating 
system. 
The 
processor 
sets 
the A bit when a page is accessed 
either by a read 
or write operation. 
When a data-access 
fault occurs, 
the trap handler sets the D bit if an allowable 
write is 
being performed, 
then reexecutes 
the instruction. 


The operating 
system is responsible 
for coordinating 
its updates 
to the accessed 
and dirty bits with up- 
dates by the CPU and by other processors 
that may 
share the page tables. 
The i860 XP microprocessor 
automatically 
asserts 
the LOCK# 
signal while test- 
ing and setting 
the A bit. 


2.4.4.7 
Page Tables 
for Trap Handlers 


When 
paging 
is enabled 
(ATE = 1), software 
that 
creates 
page tables and directories 
must assure that 
A = 1 always 
in the PTEs and PDEs for the code 
pages 
of the trap 
handler 
and the first 
data 
page 
accessed 
by 
the 
handler. 
Preallocation 
of 
these 
pages is required 
in case a trap occurs during a lock 
sequence. 
Otherwise, 
recursive 
traps would be gen- 
erated, 
as the 
A-bit 
would 
need 
to be set by the 
translation 
hardware, 
which is a trapping 
situation 
in 
itself. 
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2.4.4.8 
Combining 
Protection 
of Both Levels of 
Page Tables 


For any four-Kbyte 
page, the protection 
attributes 
of 


its page directory 
entry may differ from those 
of its 
page table entry. The i860 XP microprocessor 
com- 
putes the effective 
protection 
attributes 
for a page 
by examining 
the 
protection 
attributes 
in both 
the 
directory 
and the page table and choosing 
the more 
restrictive 
of the two. 


2.4.5 
ADDRESS 
TRANSLATION 
ALGORITHM 


The 
following 
algorithm 
defines 
the 
translation 
of 
each virtual address 
to a physical 
address. 
Let DIR, 


PAGE, and OFFSET 
be the fields 
of the virtual 
ad- 


dress; 
let PFA1 and PFA2 
be the page 
frame 
ad- 


dress fields of the first and second 
level page tables 


respectively; 
DTB is the page directory 
table 
base 
address 
stored 
in the dlrbase 
register. 


1. Read 
the 
PDE 
(Page 
Directory 
Entry) 
at the 


physical 
address 
formed 
by DTB:DIR:OO. 


2. If P in the PDE is zero, generate 
a data: 
or in- 


struction-access 
fault. 


3. If W in the PDE is zero, the operation 
is write, 


and either the U bit of the PSR is set or WP = 1, 
generate 
a data-access 
fault. 


4. If the U bit in the PDE is zero and U bit in the psr 
is set, 
generate 
a data- 
or instruction-access 


fault. 


5. If A in the PDE is zero and the TLB 
miss oc- 


curred 
inside 
a locked 
sequence, 
generate 
a 
data or instruction 
access fault. (The trap allows 
software 
to set A to one 
and 
restart 
the 
se- 
quence. This helps external 
bus hardware 
deter- 


mine unambiguously 
what address 
corresponds 


to a locked 
semaphore.) 


6. 
If bit 7 of the PDE is one (four Mbyte page), and 
the operation 
is write, and D = 0 in the PDE, 


generate 
a data-access 
fault. 


7. If A = 1 in the PDE, continue 
at step 11. Other- 
wise, assert 
LOCK # . 


8. Perform the PDE read as in step 1 and the P, W 
and U bit checks 
as in steps 2 through 4. 


9. Write the PDE with A bit set. 


10. Deassert 
LOCK#. 


11. 
If bit 7 of the PDE is one (four Mbyte page), form 
the physical address as PFA 1:OFFSET, 
and exit 
address translation. 
In this case, PFA1 is 10 bits 
and OFFSET 
is 22 bits. 


12. The remaining 'steps are for four Kbyte pages. If 


the A-bit in the PDE was zero before translation 
began, assert 
LOCK # . 
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13. Fetch the PTE at the physical address formed 
by PFA1:PAGE:00. 
14. Perform the P-, W-, U-, and A-bit checks as in 
steps 2 through 5 with the second-level PTE. If 
A = zero in the PTE, and the TLB miss oc- 
curred inside a locked sequence, generate a 
data or instruction access fault. LOCK11 re- 
mains active. 


15. If the operation is write, and D in the PTE is 
zero, generate a data access fault. 


16. If the A-bit in the PDE was already active before 
translation began, and the A-bit in the PTE is 
alre~dy active, go to step 20. 


17. If LOCKII is not already active, assert it and 
refetch the PTE. 


18. Perform the U-, W-, and P-bit checks and A-bit 
setting in the PTE as in steps 8 through 9. Do 
the locked write update of the PTEto unlock the 
bus, even if the A-bit in.the PTE is already one. 
19. Deassert LOCKII. 
20. Form the physical address as PFA2:0FFSET. In 
this case, PFA2 is 20 bits and OFFSET is 12 
bits. 


During translation, the i860 XP microprocessor looks 
only in external memory for page directories and 
page tables. The data cache is not searched. There- 
fore, any code that modifies page directories or 
page tables must keep them out of the cache. The 
tables should either be kept in noncacheable memo- 
ry or in write-through pages or should be flushed 
from the cache. 


The i860 XP microprocessor expects page directo- 
ries and page tables to be in little endian format. The 
operating system must maintain these tables in little 
endian format either by setting BE to zero when ma- 
nipulating the tables or by complementing bit two of 
the 32-bit address when loading or storing e,ntries. 


2.4.6 ADDRESS TRANSLATION FAULTS 


The address translation fault can be signalled as ei- 
ther an instruction access fault or a data-access 
fault. The instruction causing the fault can be reexe- 
cuted upon returning from the trap handler. 


2.5 Detached CCU 


The i860 XP microprocessor supports parallel pro- 
cessing, where multiple processors work simulta- 
neously on different parts of the same problem. The 
Concurrency Control Unit (CCU) controls work shar- 
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ing among CPUs, in multiprocessor systems. The 
CCU is a VLSI chip that allows multiple processors 
to work together to execute portions of a single pro- 
gram in parallel. The CCU performs the iteration as- 
signment for loop parallelization. Accesses to the 
CCU for synchronization are much faster than ac- 
cesses to shared memory semaphores. The CCU is 
memory mapped, and its internal registers are ac- 
cessed via memory load and store operations. 


To take advantage of the parallel architecture, soft- 
ware must be compiled by parallelizing compilers 
that generate instructions to access the CCU. How- 
ever, such instructions cannot run on a system that 
does not include a CCU. To allow an application 
compiled for parallel execution to run on any system 
based on the i860 XP microprocessor, a "Detached 
Only" CCU (DCCU, also referred to as "internal 
CCU") is implemented in the i860 XP microproces- 
sor. The DCCU is a compatible subset of the exter- 
nal CCU, consisting of the minimal set of features 
required for a single CPU. The DCCU alone neither 
increases performance nor concurrency, but does 
allow software designed for parallel processing to 
run unmodified on a single CPU. 


2.5.1 DCCU INITIALIZATION 


After reset, the i860 XP microprocessor DCCUis dis- 
abled (CO and DO bits in ccr are cleared). To en- 
able the DCCU,the CO and DO bits in ccr 
must be 
set by softWare. Before turning on the CCU, the op- 
erating system must invalidate the TLB and flush the 
data cache to make sure that they do not contain 
data from the CCU pages. The TLB is invalidated by 
setting ITI = 1 in the dlrbase 
register. Also, the 
flush instruction must be used once per each line of 
the data cache to invalidate the physical address of 
the cache entry, if the two pages at the CCUBASE 
address may have been cached. The flush is un- 
needed if page tables or external hardware have 
prohibited caching of the CCUBASE pages. 


Neither the external CCU nor the DCCU can be ac- 
cessed within four instructions after ccr is modified. 


2.5.2 DCCU ADDRESSING 


The CCU facilities are memory-mapped, manipulat- 
ed by normal load and store instructions. The DCCU 
is memory-mapped to a single 4 Kbyte user page. 
When the DCCU is active, all accesses to this page 
are satisfied by the DCCU,and no external bus cycle 
is generated. The address space of two adjacent 
pages beginning on an 8 Kbyte boundary is reserved 
for the CCU. The first (lower address) page contains 
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locations accessible in user mode (which includes 
the DCCU registers), and the second page contains 
locations accessible in supervisor mode (used for 
external CCU only). The base address of these 
pages is specified by the CCUBASE field in ccr. Ac- 
cesses to the second page in DCCU-only mode 
have no effect on the DCCU, and are treated as 
normal memory accesses. 


When the DCCU is active, accesses to its address 
page use only the virtual address, and no translation 
is done on the DCCU access. However, the access- 
es to an external CCU go through normal address 
translation. The operating system should make sure 
that the page table entries for the CCU pages are 
set so that no fault occurs during address transla- 
tion. If an external CCU is used, the two PTEs for the 
CCU should have CD = 1 (caching disabled) and 
page frame addresses that match the external hard- 
ware addresses of the CCU. Accesses to the DCCU 
that cause a TLB miss do not cause the PTE to be 
loaded into the TLB. 


If the external CCU is used when address translation 
is disabled (ATE=0), external hardware must deac- 
tivate KEN# for such accesses, to avoid caching 
external CCU accesses. 


2.5.3 DCCU INTERNALS 


The DCCU consists of an address decoder, a 32-bit 
counter (NEWCURR), and three bits of state infor- 
mation (InLoop, Nested, and Detached). InLoop, 
Nested and Detached correspond to bits 0, 1, and 2 
respectively of the external CCU STAT register. The 
Detached bit always reflects the value of the DO bit 
in ccr. 


Several addresses within the DCCU memory page 
are decoded to cause actions to NEWCURR, In- 
Loop, and Nested state bits. The CCU register to be 
accessed is specified by address bits 11-3. The val- 
id CCU addresses are shown in Table 2.8 with their 
mnemonics. Accesses to these address may also 
have side effects within the DCCU. Refer to the 
i86(}TM Microprocessor 
Family Programmer's 
Refer- 
ence 
Manual 
for programming information. Loads 
from any other addresses within the DCCU memory 
page return zero; stores to any other addresses 
have no effect. Access to the DCCU by any load or 
store instructions other than Id.x and st.x produce 
undefined results. 


Assemblers should encode address bits 2-0 as zero 
for accesses in little-endian mode. However, in big- 
endian mode (epsr BE bit = 1), DCCU accesses 
should have address bit 2 active. Thus, software for 
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big-endian access to the DCCU must differ from lit- 
tle-endian software. That allows an external CCU to 
be accessed in both big and little endian modes. 


When reading from the DCCU,the access latency is 
the same as reading data from the data cache-the 
data is ready for use as a source by the second 
instruction after the load. The first instruction after 
the load may use the data, but that instruction will 
experience a one-clock freeze before the data be- 
comes available. 


2.6 Instruction Set 


Table 2.9 shows the complete set of instructions for 
the i860 XP microprocessor, grouped by function 
within processing unit. Refer to Section 10 for an 
algorithmic definition of each instruction. The in- 
struction set of the i860 XP microprocessor is fully 
upward compatible with that of the i860 XR micro- 
processor, extended in a few ways to better serve 
certain application domains. User-level software ap- 
plications written for the i860 XR microprocessor will 
run unmodified on the i860 XP microprocessor, but 
some supervisor code (for example, trap handlers) 
may need minor modifications. The i860 XR micro- 
processor instruction set has been extended with 
the following instructions: 


• Idlo, stlo: 1/0 load and store instructions 
• Idlnt: Load interrupt instruction to perform an in- 
terrupt acknowledge cycle and read the interrupt 
vector. Used to emulate the Intel 486 interrupt 
acknowledge sequence. 


• scyc: A special-cycle instruction, used to gener- 
ate bus cycles that signal invalidation and syn- 
chronization of an external cache. 
• pfld.q: A pipelined, floating-point load of 128 bits. 


Table 2.8. CCU Addresses 


Little 
Big 
Mnemonic 
A11-A8 
A7-A4 
Endian 
Endian 
A3-AD 
A3-AD 


cbr-," 
0000 
Oabc 
bOOO 
d100 
cget 
1111 
0110 
0000 
0100 
cnewcurr 
1111 
1100 
0000 
0100 
cstat 
1111 
1100 
1000 
1100 
cstatci 
1111 
1101 
0000 
0100 
cstatn 
1111 
1101 
1000 
1100 
cclm 
1111 
1110 
1000 
1100 
cver 
1111 
1111 
1000 
1100 


NOTE: 
Variable 
I is a 4-bit 
index formed 
by A6-A3. 
Let its binary 
form be represented 
by the symbols 
abed. 
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Table 
2.9. Instruction 
Set (1 of 2) 


Core Unit 


Mnemonic 
Description 


Load and Store 
Instructions 


Id.x 
Load integer 
st.x 
Store integer 
fld.y 
F·Pload 
fst.y 
F-P store 
pfld.y 
Pipelined 
F-P load 
pst.d 
Pixel store 


Register 
to Register 
Move 


ixfr 
Transfer 
integer to F-P register 


Integer 
Arithmetic 
Instructions 


addu 
Add unsigned 
adds 
Add signed 
subu 
Subtract 
unsigned 
subs 
Subtract 
signed 


Shift 
Instructions 


shl 
Shift left 
shr 
Shift right 
shra 
Shift right arithmetic 
shrd 
Shift right double 


Logical 
Instructions 


and 
Logical AND 
andh 
Logical AND high 
andnot 
Logical AND NOT 
andnoth 
Logical AND NOT high 
or 
Logical OR 
- 


orh 
Logical OR high 
xor 
Logical exclusive 
OR 
xorh 
Logical exclusive 
OR high 


Control-Transfer 
Instructions 


br 
Branch direct 
bri 
Branch indirect 
be 
Branch on CC 
bc.t 
Branch on CC taken 
bnc 
Branch on not CC 
bnc.t 
Branch on not CC taken 
bte 
Branch if equal 
btne 
Branch if not equal 
bla 
Branch on LCC and add 
call 
Subroutine 
call 
calli 
Indirect subroutine 
call 


intovr 
Software 
trap on integer overflow 
trap 
Software 
trap 


Floating-Point 
Unit 


Mnemonic 
Description 


Register 
to Register 
Move 


fxfr 
Transfer 
F-P to integer register 


F-P Multiplier 
Instructions 


fmul.p 
F-P multiply 
pfmul.p 
Pipelined 
F-P multiply 
pfmul3.dd 
3-Stage pipe lined F-P multiply 


fmlow.p 
F-P multiply low 


frcp.p 
F-P reciprocal 


fsqr.p 
F-P reciprocal 
square root 


F-P Adder 
Instructions 


fadd.p 
F-P add 
pfadd.p 
Pipelined 
F-P add 


famov.r 
F-P adder move 
pfamov.r 
Pipelined 
F-P adder move 


fsub.p 
F-P subtract 
pfsub.p 
Pipelined 
F-P subtract 
pfgt.p 
Pipelined greater-than 
compare 
pfeq.p 
Pipelined equal compare 
fix.v 
F-P to integer conversion 
pfix.v 
Pipelined 
F-P to integer conversion 
ftrunc.v 
F-P to integer truncation 


Dual-Operation 
Instructions 


pfam.p 
Pipelined 
F-P add and multiply 
pfsm.p 
Pipelined 
F-P subtract 
and multiply 
pfmam.p 
Pipelined 
F-P multiply with add 
pfmsm.p 
Pipelined 
F-P multiply with subtract 


Long 
Integer 
Instructions 


fisub.z 
Long-integer 
subtract 
pfisub.z 
Pipelined 
long-integer 
subtract 
fiadd.z 
Long-integer 
add 
pfiadd.z 
Pipe lined long-integer 
add 


Graphics 
Instructions 


fzchks 
16-bit Z-buffer 
check 
pfzchds 
Pipelined 
16-bit Z-buffer 
check 
fzchkl 
32-bit Z-buffer 
check 
pfzchkl 
Pipelined 32-bit Z-buffer 
check 
faddp 
Add with pixel merge 
pfaddp 
Pipelined add with pixel merge 
faddz 
Add with Z merge 
pfaddz 
Pipelined add with Z merge 
form 
OR with MERGE register 
pform 
Pipelined OR with MERGE register 


• 
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Table 2 9 Instruction Set (2 of 2) 
.. 


Core Unit 


Mnemonic 
Description 


1/0 Instructions 


Idio.x 
Load 1/0 
stio.x 
Store 1/0 
Idint.x 
Load interrupt vector 


System Control Instructions 


flush 
Cache flush 
Id.c 
Load from control register 
st.c 
Store to control register 
lock 
Begin interlocked sequence 
unlock 
End interlocked sequence 
scyc.x 
Special bus cycles 


Assembler Pseudo-Operations 


Register to Register Move 


mov 
Integer move 
fmov.r 
F-P reg-reg move 
pfmov.r 
Pipslined F-P reg-reg move 
nop 
Core no-operation 
fnop 
F-P no-operation 
pfle.p 
Pipelined F-P less-than or equal 


The architecture of the i860 XP microprocessor uses 
parallelism to increase the rate at which operations 
may be introduced into the unit. Parallelism in the 
i860 XP microprocessor is not transparent; rather, 
programmers have complete control over parallel- 
ism and therefore can achieve maximum perform- 
ance for a variety of computational problems. 


2.6.1 PIPELlNED AND SCALAR OPERATIONS 


One type of parallelism used within the floating-point 
unit is "pipelining". The pipelined architecture treats 
each operation as a series of more primitive opera- 
tions (called "stages") that can be executed in par- 
allel. Consider just the floating-point adder as an ex- 
ample. Let A represent the operation of the adder. 
Let the stages be represented by A1' A2' and A3. 
The stages are designed such that Ai + 1for one ad- 
der instruction can execute in parallel with Ai for the 
next adder instruction. Furthermore, each Ai can be 
executed in just one clock. The pipelining within the 
multiplier and graphics units can be described simi- 
larly, except that the number of stages may be differ- 
ent. 


Figure 2.14 illustrates three-stage 
pipelining as 
found in the floating-point adder (also in the floating- 
point multiplier when single-precision input operands 
are employed). The central columns of the table rep- 
resent the three stages of the pipeline. Each stage 
holds intermediate results and also (when intro- 


, duced into the first stage by software) holds status 
information pertaining to those results. The table as- 
sumes that the instruction stream consists of a se- 
ries of consecutive floating-point instructions, all of 
one type (Le.all adder instructions or all single-preci- 
sion multiplier instructions). The instructions are rep- 
resented as A, B, etc. The rows of the table repre- 
sent the states of the unit at successive clock cy- 
cles. Each time a pipelined operation is performed, 
the result of the last stage of the pipeline is stored in 
the destination register tdest, the pipeline is ad- 
vanced one stage, and the input operands of the 
operation are transferred to the first stage of the 
pipeline. 


Pipeline 
Clock 
Instruction 
Stage 1 
Stage 2 
Stage 3 
Result 


1 
A 
A 


A 


, 


2 
B 
B 


3 
C 
C 
B 
A 


4 
D 
D 
C 
B 
A -+ (des! of D 


5 
E 
E 
D 
C 
B -+ (des! of E 


6 
F 
F 
E 
D 
C -+ (des! of F 


Figure 2.14. Plpellned Instruction 
Execution 
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In the i860 XP microprocessor, the number of pipe- 
line stages ranges from one to three. A pipelined 
operation with a three-stage pipeline stores the re- 
sult of the third prior operation. A pipelined operation 
with a two-stage pipeline stores the result of the sec- 
ond prior operation. A pipelined operation with a 
one-stage pipeline stores the result of the prior oper- 
ation. 


There are four floating-point pipelines: one for the 
multiplier, one for the adder, one for the graphics 
unit, and one for floating-point loads. The adder 
pipeline has three stages. The number of stages in 
the multiplier pipeline depends on the precision of 
the source operands in the pipeline; it may have two 
or three stages. The graphics unit has one stage for 
all precisions. The load pipeline has three stages for 
all precisions. 


Changing the FZ (flush zero), RM (rounding mode), 
or RR (result register) bits of fsr while there are re- 
sults in either the multiplier or adder pipeline produc- 
es effects that are not defined. 


2.6.1.1 
Scalar Mode 


In addition to the pipelined execution mode, the 
i860 XP microprocessor also can execute f1oating- 
point instructions in "scalar" mode. Most floating- 
point instructions have both pipelined and scalar 
variants, distinguished by a bit in the instruction en- 
coding. In scalar mode, the floating-point unit does 
not start a new operation until the previous floating- 
point operation is completed. The scalar operation 
passes through all stages of its pipeline before a 
new operation is introduced, and the result is stored 
automatically. Scalar mode is used when the next 
operation depends on results from the previous few 
floating-point operations (or when the compiler or 
programmer does not want to deal with pipelining). 


2.6.1.2 
Plpellnlng Status Information 


Result status information in the fsr consists of the 
AA, AI, AD, AU, and AE bits, in the case of the ad- 
der, and the MA, MI, MO, and MU bits, in the case of 
the multiplier. This information arrives at the fsr via 
the pipeline in one of two ways: 
1. It is calculated by the last stage of the pipeline. 
This is the normal case. 
2. It is propagated from the first stage of the pipe- 
line. This method is used when restoring the 
state of the pipeline after a preemption. When a 
store instruction updates the fsr and the the U bit 
being written into the fsr is set, the store updates 
the result status bits in the first stage of both the 
adder and multiplier pipelines. When software 
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changes the result-status bits of the first stage of 
a particular unit (multiplier or adder), the updated 
result-status bits are propagated one stage for 
each pipelined floating-point operation for that 
unit. In this case, each stage of the adder and 
multiplier pipelines holds its own copy of the rele- 
vant bits of the fsr. When they reach the last 
stage, they override the normal result-status bits 
computed from the last-stage result. 


At the next floating-point instruction (or at certain 
core instructions), after the result reaches the last 
stage, the i860 XP microprocessor traps if any of the 
status bits of the fsr indicate exceptions. Note that 
the instruction that creates the exceptional condition 
is not the instruction at which the trap occurs. 


2.6.1.3 
Precision In the Pipelines 


In pipelined mode, when a floating-point operation is 
initiated, the result of an earlier pipelined floating- 
point operation is returned. The result precision of 
the current instruction applies to the operation being 
initiated. The precision of the value stored in tdest is 
that which was specified by the instruction that initia- 
ted that operation. 


If tdest is the same as fsrc1 or fsrc2, the value being 
stored in tdest is used as the input operand. In this 
case, the precision of tdest must be the same as the 
source precision. 


The multiplier pipeline has two stages when the 
source operands are double-precision and three 
stages when they are single. This means that a pipe- 
lined multiplier operation stores the result of the sec- 
ond previous multiplier operation for double-preci- 
sion inputs and third previous for single-precision in- 
puts (except when changing precisions). 


2.6.1.4 
Transition between Scalar and Plpellned 
Operations 


When a scalar operation is executed, it passes 
through all stages of the pipeline; therefore, any un- 
stored results in the affected pipeline are lost. To 
avoid losing information, the last pipelined opera- 
tions before a scalar operation should be dummy 
pipelined operations that unload unstored results 
from the affected pipeline. 


After a scalar operation, the values of all pipeline 
stages of the affected unit (except the last) are un- 
defined. No spurious result-exception traps result 
when the undefined values are subsequently stored 
by pipelined operations; however, the values should 
not be referenced as source operands. 


• 
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For best performance a scalar operation should not 
immediately precede a pipelined operation whose 
tdest is nonzero. 


2.6.1.5 Plpelined LO~lds 


The pfld instruction is optimized for accesses that 
miss the data cache and transfer directly from mem- 
ory. Therefore, even when there is a data cache hit, 
a pfld may generate a bus cycle. The data from the 
internal cache is used only if it was modified. Other- 
wise, data is taken from the external bus, even if it 
resides in the on-board cache. 


The pfld FIFa can be extended externally, due to 
the facts that a pfld always generates a bus cycle 
and that such a cycle can be identified externally by 
the value on the CTYP pin. Software written for an 
externally-extended pfld pipeline must ensure that it 
does not pfld from a location that was modified in 
the data cache. When a pfld cache hit to a modified 
line occurs, the pfld pipeline length used by the 
i860 XP microprocessor is three stages. The modi- 
fied data from the cache is put into the internal 
three-stage data FIFa, and the third pfld instruction' 
after the data cache hit will update its tdest register 
with the modified data. 


2.6.2 DUAL-INSTRUCTION MODE 


Another form of parallelism results from the fact that 
the i860 XP microprocessor can execute both a 


floating-point and a core instruction simultaneously. 
Such parallel execution is called dual-instruction 
mode. When executing in dual-instruction mode the 
instruction sequence consists of 64-bit aligned in- 
struction pairs, with a floating-point instruction in the 
lower 32 bits and a core instruction in the upper 32 
bits. Table 2.9 identifies which instructions are exe- 
cuted by the core unit and which by the floating- 
point unit. 


Programmers specify dual-instruction mode either 
by including in the mnemonic of a floating-point in· 
struction a d. prefix or by using the Assembler direc- 
tives .dual ...• enddual. Both of the specifications 
cause the D-bit of floating-point instructions to be 
set. If the i860 XP microprocessor is executing in 
single-instruction mode and encounters a floating- 
point instruction with the D-bit set, one more 32-bit 
instruction is executed before dual-mode execution 
begi~s. If th~ i860 XP microprocessor is executing in 
dual-Instruction mode and a floating-point instruction 
is encountered with a clear D-bit, then one more pair 
of instructions is executed before resuming single-in- 
struction mode. Figure 2.15 illustrates two variations 
of this sequence of events: one for extended se- 
quences of dual-instructions and one for a single in- 
struction pair. 


Note that d.fnop cannot be used to initiate dual in- 
struction mode. 


31 
o 
op 
d.fp-op 


63 
core-op or d.fp-op 
core-op 
d.fp-op 


core-op 
d.fp-op 
core-op 
fp-op 


core 
op 
fp-op 
op 
op 


1 
Entor Dual Instruction Modo 


Initiat. 
Exit from 
Dual Instruction Mod• 


••• 
LeaveDual Instruction Mode 
! 
240874-18 


Figure 2.15. Dual-Instruction 
Mode Transitions (1 of 2) 


31 
o 
op 
d.fp-op 


63 
fp-op 
I 
core-op 
fp-op 
op 
op 
1 
Temporary 
Dual Inrructlon 
Mod. 


240874-19 
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Figure 2.15. Dual-Instruction 
Mode Transitions (2 of 2) 


intel~ 


When a 64-bit dual-instruction pair sequentially fol- 
lows a delayed branch instruction in dual-instruction 
mode, both 32-bit instructions are executed. 


2.6.3 DUAL-OPERATION INSTRUCTIONS 


Special 
dual-operation 
floating-point 
instructions 
(add-and-multiply, subtract-and-multiply) use both 
the multiplier and adder units within the floating- 
point unit in parallel to efficiently execute such com- 
mon tasks as evaluating systems of linear equa- 
tions, performing the Fast Fourier Transform (FFT), 
and performing graphics transformations. 


The instruction classes pfam 
fsrc1, fsrc2, fdest, 
pfmam 
fsrc1, fsrc2, fdest (add and multiply), pfsm 
tsrct, 
fsrc2, fdest, and pfmsm 
tsrct, 
fsrc2, fdest 
(subtract and multiply) initiate both an adder opera- 
tion and a multiplier operation. Six operands are re- 
quired, but the instruction format specifies only three 
operands; therefore, there are special provisions for 
specifying the operands. These special provisions 
consist of: 
• Three special registers (KR, KI, and T) that can 
store values from one dual-operation instruction 
and supply them as inputs to subsequent dual-op- 
eration instructions. 
- 
The constant registers KR and KI can store 
the value of fsrc1 and subsequently supply 
that value to the multiplier pipeline in place of 
fsrc1. 
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- 
The transfer register T can store the last-stage 
result of the multiplier pipeline and subse- 
quently supply that value to the adder pipeline 
in place of fsrc 1. 
• A four-bit data-path control field in the opcode 
(opC)that specifies the operands and loading of 
the special registers. 
1. Operand-1 of the multiplier can be KR, KI, or 
fsrc1. 


2. Operand-2 of the multiplier can be fsrc2, the 
last-stage result of the multiplier pipeline, or 
the last-stage result of the adder pipeline. 


3. Operand-1 of the adder can be fsrc1, the 
T-register, the last-stage result of the multiplier 
pipeline, or the last-stage result of the adder 
pipeline. 
2 


4. Operand-2 of the adder can be fsrc2, the last- 
stage result of the multiplier pipeline, or the 
last-stage result of the adder pipeline. 


Figure 2.16 shows all the possible data paths sur- 
rounding the adder and multiplier. The ope field in 
these instructions selects different data paths. Sec- 
tion 10 shows the various encodings of the ope 
field. 


Note 
that 
the 
mnemonics 
pfam.p, 
pfsm.p, 
pfmam.p, and pfmsm.p are never used as such in 
the assembly language; these mnemonics are used 
here to designate classes of related instructions. 
Each value of ope has a unique mnemonic associ- 
ated with it. 


Single Precision 
3-Stage Multiplier and Adder 
Double Precision 
2-Stage Multiplier, 3-Stage Adder 


t.rcl 
t.rc2 tdest 


ADDER 
------------ 
result 


240874-20 
240874-21 


t.rcl 
t.rc2 Idest 


Figure 2.16. Dual-Operation Data Paths 
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2.7 Addressing Modes 


Data access 
is limited to load and store instructions. 
Memory 
addresses 
are computed 
from two fields of 
load and store instructions: 
isrc1 
and isrc2. 
1. isrc1 either contains 
the identifier 
of a 32-bit inte- 
ger register 
or contains 
an immediate 
16-bit ad- 
dress offset. 


2. isrc2 always specifies 
a register. 


Because 
either isrc1 or isrc2 may be null (zero), 
a 
variety 
of useful addressing 
modes 
result: 
offset + register 
Useful 
for 
accessing 
fields 
within 
a record, where register 
points 
to the beginning 
of the 
record. 
Useful 
for 
accessing 
items in a stack frame, 
where 
register is ra, the register 
used 
tor pointing 
to the beginning 
of 
the stack frame. 
register + register 
Useful 
for two-dimensional 
ar- 
rays or for array access 
within 
the stack frame. 
register 
Useful as the end result of any 
arbitraryaddress 
calculation. 
offset 
Absolute 
address 
into the first 
or last 32K of the 
logical 
ad- 
dress space. 


In addition, 
the floating-point 
load and store instruc- 
tions 
may select 
autoincrement 
addressing. 
In this 
mode isrc2 is replaced 
by the sum of isrc1 and isrc2 
after performing 
the load or store. This mode makes 
stepping 
through 
arrays 
more 
efficient, 
because 
it 
eliminates 
one address-calculation 
instruction. 


2.8 Traps and Interrupts 


Traps 
are caused 
by exceptional 
conditions 
detect- 
ed 
in 
programs 
or 
by 
external 
interrupts. 
Traps 
cause 
interruption 
of normal 
program 
flow 
to exe- 
cute 
a special 
program 
known 
as a trap 
handler. 
Traps are divided into the types shown in Table 2.10. 


2.8.1 
TRAP 
HANDLER 
INVOCATION 


This section 
applies to traps other than reset. When 
a trap occurs, 
execution 
of the current 
instruction 
is 
aborted. 
Except for bus error and parity error traps, 
the 
instruction 
is restartable. 
The 
processor 
takes 
the following 
steps while transferring 
control 
to the 
trap handler: 


1. Copies 
U (user mode) 
of the par into PU (previ- 
ous U). 


2. Copies 
IM (interrupt 
mode) 
into 
PIM 
(previous 
IM). 
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3. Sets U to zero (supervisor 
mode). 


4. Sets IM to zero (interrupts 
disabled). 
5. If the processor 
is in dual instruction 
mode, it sets 
DIM; otherwise 
it clears 
DIM. 


6. If the processor 
is in single-instruction 
mode and 
the 
next instruction 
will be executed 
in dual-in- 
struction 
mode 
or if the processor 
is in dual-in- 
struction 
mode 
and 
the 
next 
instruction 
will 
be 
executed 
in single-instruction 
mode, 
OS is set; 
otherwise, 
it is cleared. 
7. The appropriate 
trap type bits in par and epar are 
set (IT, IN, IAT, OAT, FT, OF, 
IL, PI, PT, BEF, 
PEF). Several 
bits may be set if the correspond- 
ing trap conditions 
occur simultaneously. 
8. An address 
is placed in the fault instruction 
regis- 
ter (fir) to help locate 
the trapped 
instruction. 
In 
single-instruction 
mode, the address 
in fir is the 
address 
of the trapped 
instruction 
itself. 
In dual- 
instruction 
mode, the address 
in fir is that of the 
floating-point 
half of the dual instruction. 
If an in- 
struction 
or data access fault occurred, 
the asso- 


ciated 
core 
instruction 
is the 
high-order 
half 
of 
the dual instruction 
(fir + 4). In dual-instruction 
mode, 
when 
a data 
access 
fault 
occurs 
in the 
absence 
of other 
trap 
conditions, 
the 
floating- 
point half of the dual instruction 
will already 
have 
been 
executed 
(except 
in the 
case 
of the fxfr 
instruction). 


The processor 
begins executing 
the trap handler 
by 
transferring 
execution 
to 
virtual 
address 
OxFFFFFFOO. The trap handler 
begins 
execution 
in 
single-instruction 
mode. 
The trap 
handler 
must ex- 
amine the trap-type 
bits in psr (IT, IN, IAT, OAT, FT) 
and epar (OF, IL, PT, PI, BEF, PEF) to determine 
the 
cause or causes 
of the trap. 


2.8.2 
INSTRUCTION 
FAULT 


This fault 
is caused 
by any of the following 
condi- 
tions. 
In all cases 
the processor 
sets the IT bit be- 
fore entering 
the trap handler. 


1. By the trap instruction. 
When trap is executed 
in 
dual-instruction 
mode, the floating-point 
compan- 
ion of the trap 
instruction 
is not executed 
before 
the trap is taken. 


2. By the Intovr instruction. 
The trap occurs 
only if 
OF in epar is set when Intovr is executed. 
To 
distinguish 
between 
cases 
1 and 2, the trap han- 
dier must examine 
the instruction 
addressed 
by 
fir. The trap handler 
should 
clear OF before 
re- 
turning. 
When 
Intovr causes 
a trap 
in dual-in- 
struction 
mode, 
the floating-point 
companion 
of 
the Intovr instruction 
is completely 
executed 
be- 
fore the trap is taken. 
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Table 2 10 Types of Traps 


Indication 
Caused by 
Type 
fsr 
Condition 
Instruction 
psr 
epsr 


Instruction 
IT 
OF 
Software traps 
trap 
Fault 
Intovr 
IL 
Missing unlock 
Any 
PT&PI 
Pipeline usage 
Any scalar or pipelined 
instruction that uses a 
pipeline 


Floating 
FT 
SE 
Floating-point source 
Any M- or A-unit except 
Point 
exception 
fmlow 
Fault 


. 
Floating-point result 
Any M- or A-unit except 
exception 
fmlow, pfgt, and pfeq. 
AO,MO 
overflow 
Reported on any F-P 
AU,MU 
underflow 
instruction, pst, fst, and 
AI,MI 
inexact result 
sometimes fld, pfld, and 
Ixfr 


Instruction 
IAT 
Address translation 
Any 
Access Fault 
exception during instruction 
fetch 


Data 
OAT 
Load/store address 
Any load/store 
Access 
translation exception 
Fault 
Misaligned operand address 
Any load/store 
Operand address matches 
Any load/store 
db register 


Parity 
IN 
PEF 
Parityerror on data pins during bus read operation 
Error Fault 
when PEN# pin active 


Bus Error Fault 
IN 
BEF 
External interrupt signal on BERR pin 


Interrupt 
IN 
INT 
External interrupt signal on INT pin 


Reset 
None 
PEF,BEF 
Hardware RESETsignal 


3. By violation of lock/unlock 
protocol, explained 
below. (Note that trap and Intovr should not be 
used within a locked sequence; otherwise, it 
would be difficult to distinguish between this and 
the prior cases.) 
4. By execution of an instruction that uses a pipeline 
when the PT bit of epsr is set. (Refer to section 
2.8.2.2.) 


2.8.2.1 Lock Protoclol 


The lock protocol requires the following sequence of 
activities: 
1. lock 
2. Any load or store instruction. For compatibility 
with future processor generations, this should be 
a load. 


3. unlock 
4. Any load or store instruction. For compatibility 
with future processor generations, this should be 
a store. 


There may be other instructions between any of 
these steps. The bus is locked after step 2, and re· 
mains locked until step 4. Step 4 must follow step 1 
by 30 instructions or less; otherwise, an instruction 
trap occurs. In case of a trap, IL is also set. If the 
load or store instruction of step 2 accesses a previ- 
ously unaccessed page (A=0), the bus is locked 
briefly while the A bit is set, unlocked, then locked 
again to satisfy the lock 
instruction and start the 
locked sequence. 


2.8.2.2 Using PT and PI Bits 


The PI and PT bits are provided to help the trap 
handler avoid unnecessarily saving and restoring the 
pipelines (refer to the section "Pipeline Preemption" 
in the i860 Microprocessor Family Programmer's 
Reference Manual). 


Trap handlers that use PI or PT must initially exam- 
ine fsr. If a pending trap exists-that 
is, if the FTE 
(floating-point trap enable) bit is set and any of the 
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floating-point exception bits (AI, so, AU, MI, MO, 
MU) is active-the 
trap handler must save the pipe- 
lines. The i860 XP microprocessor, like the i860 XR 
microprocessor, may set an fsr exception bit before 
the floating-point trap is generated, and this pending 
trap relies on information in the pipeline. For exam- 
ple, an external interrupt might invoke the trap han- 
dier between the scalar floating-point instruction that 
produces an overflow and the next floating-point op- 
eration-the 
one that would cause a branch to the 
trap handler for the floating-point trap. Refer to sec- 
tion 2.2.4. 


If no pending trap exists, the handler can follow ei- 
ther of the following two methods: 


• Using both PT and PI: Upon invocation, the trap 
handler saves the state of PI and PT (in epsr), 
but does not save the pipes. If PI is found set 
(which means that the interrupted code needs 
the state information currently in the floating- 
point pipelines), the handler sets PT and clears PI 
(with a single st.c to epsr instruction), then con- 
tinues with trap processing. If the pipes are used 
during trap handling (even by a scalar instruc- 
tion), a trap will be generated with IT and PI set 
by hardware. The trap handler may then check PI 
and PT, and if both are set, clear PT, PI, and IT, 
save the pipes, set an indication that they were 
saved, and restart execution from the instruction 
that caused the trap. At the end of trap handling, 
the trap handler restores the pipes if they were 
saved, and restores PI and PT to their values be- 
fore the trap. This method avoids both saving and 
restoring the pipes, assuming that most trap han- 
dling sequences do not alter the pipes, and there- 
fore a trap for PT= 1 will not happen very often. 
• Using only 
PI: Another approach is to leave 
PT=0, using only the PI bit, which the processor 
sets each time a pipelined instruction or pfld is 
encountered (even if the floating point instruction 
is suppressed due to KNF = 1). The trap handler 
saves PI, saves the pipes if.PI is set, sets an indi- 
cation that they were saved, and clears PI. At the 
end of trap handling, the trap handler restores the 
pipes if they were saved, and restores PI to its 
value before the trap. With this method, the pipes 
are sometimes saved and restored unnecessarily 
if the trap handler code does not use the pipes. 
This method is advised when it is known that the 
trap handler uses the pipes. 


2.8.3 FLOATING-POINT FAULT 


The floating-point fault is reported on floating-point 
instructions, pst, fst, and sometimes fld, pfld, and 
ixfr. The floating-point faults of the i860 XP micro- 
processor support the floating-point exceptions de- 
fined by the IEEE standard as well as some other 
useful classes of exceptions. The i860 XP micro- 
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processor divides these into two classes: source ex- 
ceptions and result exceptions. The numerics library 
supplied by Intel provides the IEEE standard default 
handling for all these exceptions. 


2.8.3.1 Source Exception Faults 


All exceptional operands, including infinities, denor- 
malized numbers and NaNs, cause a floating-point 
fault and set SE in the fsr. Source exceptions are 
reported on the instruction that initiates the opera- 
tion. For pipelined operations, the pipeline is not ad- 
vanced. 


SE is undefined for faults on fld, pfld, fst, pat, and 
Ixfr instructions under these conditions: 


• In single-instruction mode, always. 
• In dual-instruction mode, when the companion in- 
struction is not a multiplier or adder operation. 


2.8.3.2 Result Exception Faults 


The result exceptions include: 


• Overflow. The absolute value of the rounded true 
result would exceed the largest positive finite 
number in the destination format. 


• Underflow (when FZ is clear). The absolute value 
of the rounded true result would be smaller than 
the smallest positive finite number in the destina- 
tion format. 


• Inexact result (when TI is set). The result is not 
exactly representable in the destination format. 
For example, the fraction % cannot be precisely 
represented in binary form. This exception occurs 
frequently and indicates that some (generally ac- 
ceptable) accuracy has been lost. 


The point at which a result exception is reported de- 
pends upon whether pipelined operations are being 
used: 


• Scalar [nonplpellned) 
operations. 
Result ex- 
ceptions are reported on the next floating-point, 
fst.x, or pst.x(and 
sometimes fld, pfld, ixfr) in- 
struction after the scalar operation. When a trap 
occurs, the last-stage of the affected unit con- 
tains the result of the scalar operation. 


• Plpelined operations. 
Result exceptions are re- 
ported when the.result is in the last stage and the 
next floating-point (and sometimes fld, pfld, ixfr) 
instruction is executed. When a trap occurs, the 
pipeline is not advanced, and the last-stage re- 
sults (that caused the trap) remain unchanged. 


When no trap occurs (either because FTE is clear or 
because no exception occurred), the pipeline is ad- 
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vanced 
normally 
by the new floating-point 
operation. 
The result-status 
bits of the affected 
unit are unde- 
fined until the point that result exceptions 
are report- 
ed. At this point, the last-stage 
result-status 
bits (bits 
29 ..22 and 16..9 of the far) reflect 
the values 
in the 
last stages 
of both the adder and multiplier. 
For ex- 
ample, 
if the 
last-stage 
result 
in the 
multiplier 
has 
overflowed 
and a pfadd 
is started, a trap occurs and 
MO is set 


For scalar 
operations, 
the 
RR bits of far 
report 
in 
which 
register 
the result was stored. 
RR is updated 
when the scalar instruction 
is initiated. 
The result ex- 
ception 
trap, 
however, 
occurs 
on a subsequent 
in- 
struction. 
Programmers 
must 
prevent 
intervening 
stores to far from modifying 
the RR bits. Prevention 
may take one of the following 
forms: 


• 
Before 
any store to far when 
a result exception 
may be pending, 
execute 
a dummy floating-point 
operation 
to trigger the result-exception 
trap. 


• 
Always 
read 
from 
fsr 
before 
storing 
to it, and 
mask 
updates 
so 
that 
the 
RR 
bits 
are 
not 
changed. 


For pipelined 
operations, 
RR is cleared; 
the result is 
in the last stage 
of the pipeline 
of the appropriate 
unit. The trap handler 
must flush the pipeline, 
saving 
the results and the status bits. 


In either 
pipe lined or scalar 
mode, the trap handler 
must 
compute 
the 
result 
to be returned. 
In either 
case, the result delivered 
by the CPU has the same 
significand 
as the true result 
and has an exponent 
that is the low-order 
bits of the true result. The trap 
handler 
can 
inspect 
the delivered 
result, 
compute 
the result appropriate 
for that instruction 
(a NaN or 
an infinity, for example), 
and store the computed 
re- 
sult. If RR is nonzero, 
the trap 
handler 
must store 
the computed 
result in the register 
specified 
by RR; 
if RR is 'zero, it must load the last stage of the pipe- 
line with the computed 
result 
instead 
of the saved 
result. 


Result 
exceptions 
may be reported 
for both the ad- 
der and multiplier 
at the same time. In this case, the 
trap handler should fix up the last stage of both pipe- 
lines. 


2_8.4 
INSTRUCTION 
ACCESS 
FAULT 


This 
trap 
occurs 
during 
address 
translation 
for in- 
struction 
fetches 
in any of these cases: 


• 
The address 
fetched 
is in a page whose 
P (pres- 
ent) bit in the page table is clear (not present). 
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• 
The 
address 
fetched 
is in a supervisor 
mode 
page, but the processor 
is in user mode. 


• 
The address fetched 
is in a page whose 
PTE has 
A = 0, and the access 
occurs 
during 
a locked 
sequence 
(Le. between 
lock and unlock). 


Note 
that 
several 
instructions 
are fetched 
at one 
time, 
either 
due to instruction 
prefetching 
or to in- 
struction 
caching. 
Therefore, 
a trap 
handler 
can 
change 
from supervisor 
to user mode and continue 
to 
execute 
instructions 
fetched 
from 
a supervisor 
page. An instruction 
access 
trap occurs 
only when 
the next group of instructions 
is fetched 
from a su- 
pervisor page (up to eight instructions 
later). If, in the 
meantime, 
the handler 
branches 
to a user page, no 
instruction 
access 
trap occurs. 
No protection 
viola- 
tion results, 
because 
the processor 
does not permit 
data accesses 
to supervisor 
pages while 
running 
in 
user mode. 


2_8.5 
DATA 
ACCESS 
FAULT 


This trap results from an abnormal 
condition 
detect- 
ed during data operand 
fetch 
or store. Such an ex- 
ception can be due only to one of the following 
caus- 
es: 


• 
An 
attempt 
is being 
made 
to write 
to 
a page 
whose 
D (dirty) bit is clear. 


• 
A memory 
operand 
is misaligned 
(is not located 
at an address 
that 
is a multiple 
of the length 
of 
the data). 


• 
The address 
stored in the debug register 
is equal 
to one of the addresses 
spanned 
by the operand. 


• 
The operand 
is in a not-present 
page. 


• 
An attempt 
is being made from user level to write 
to a read-only 
page or to access a supervisor-lev- 
el page. 


• 
The operand 
is in a page whose 
PTE has A = 0, 
and the access occurs during a locked 
sequence 
(Le. between 
lock and unlock). 


• 
Write protection 
(determined 
by epsr bit WP = 1) 
is violated 
in supervisor 
mode. 


When 
a data 
access 
trap 
is taken 
on a pipelined 
floating-point 
instruction 
that occurs 
immediately 
af- 
ter the load or store instruction 
that causes the trap, 
the destination 
register of the pipelined 
floating-point 
instruction 
may be partially 
updated. 
Correct 
execu- 
tion will occur when the trap handler resumes 
execu- 


tion after 
handling 
the DAT, because 
the pipelined 
floating-point 
instruction 
will then correctly 
update its 
destination 
register. 
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2.8.6 PARITY ERROR TRAP 


If the PEN# pin is active and the bus unit detects a 
parity error during a bus read operation, the proces- 
sor sets PEF and IN, then generates a trap. Further 
parity error traps are masked as soon as PEF is set. 
To reenable such traps, software must clear PEF 
and unfreeze BEAR by executing Id.e bear, rdest. 


The interrupted program is not restartable. BS (bus 
or parity error trap in supervisor mode) is set by the 
i860 XP microprocessor when a parity error occurs 
while the processor is in supervisor mode. The oper- 
ating system can use this bit to decide, for example, 
whether to abort the process (user mode) or reboot 
the system (supervisor mode). 


2.8.7 BUS ERROR TRAP 


When external hardware asserts the BERR pin, the 
processor sets BEF (bus error flag) and IN (inter- 
rupt), and then traps. Further BERR traps are 
masked as soon as BEF is set by hardware. To 
reenable such traps, software must clear BEF and 
unfreeze BEAR by executing Id.e bear, rdest. 


BS (bus or parity error trap in supervisor mode) is set 
by the i860 XP microprocessor when a bus error oc- 
curs while the processor is in supervisor mode. The 
operating system can use this bit to decide, for ex- 
ample, whether to abort the process (user mode) or 
reboot the system (supervisor mode). 


2.8.8 INTERRUPT TRAP 


An interrupt is an event that is signaled from an ex- 
ternal source. If the processor is executing with in- 
terrupts enabled (IM set in the psr), the processor 
sets the interrupt bit IN in the psr and INT in the 
epsr, then generates an interrupt trap. 


Vectored interrupts are implemented by interrupt 
controllers and software. Software can use the Idlnt 
instruction to generate an interrupt acknowledge 
(INTA) cycle. This instruction generates a bus cycle 
with INTA cycle specifications, and places the data 
returned from the bus to the destination register. 
Tags are not checked in the data cache for hit, and 
the cycle is not burstable, 


The Intel 486 microprocessor generates two INTA 
cycles as a response to an interrupt and inserts four 
idle clocks in between. To generate an interrupt ac- 
knowledge sequence that is compatible with the 
Intel 486 microprocessor, the Idlnt instruction se- 
quence documented in section 5.1.4 should be exe- 
cuted. 
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2.8.9 RESET TRAP 


When the i860 XP microprocessor is reset, execu- 
tion begins in single-instruction mode at virtual ad- 
dress OxFFFFFFOO.This is the same address as for 
other traps. The reset trap can be distinguished from 
other traps by the fact that no trap bits are set. The 
instruction cache is flushed. The bits OPS, Bl, and 
ATE in dlrbase are cleared. CS8 is initialized by the 
value at the INT pin at the end of reset. The read- 
only fields of the epsr. are set to identify the proces- 
sor, while the Il, WP, and PBM bits are cleared. The 
bits U, IM, BR, and BW in psr are cleared, as are the 
trap bits FT, OAT, IAT, IN, and IT. All other bits of 
psr and all other register contents are undefined. 
Refer to Table 2.11 for a summary of these initial 
settings. 


The software must ensure that the control registers 
are properly initialized before performing operations 
that depend on the values of those registers. 


Reset code must initialize the floating-point pipeline 
state to zero with floating-point traps disabled to en- 
sure that no spurious floating-point traps are gener- 
ated. 


After a RESET the i860 XP microprocessor starts 
execution at supervisor level (U= 0). Before branch- 
ing to the first user-level instruction, the RESET trap 
handler or subsequent initialization code has to set 
PU and a trap bit so that an indirect branch instruc- 
tion will copy PU to U, thereby changing to user lev- 
el. 


2.9 Debugging 


The i860 XP microprocessor supports debugging 
with both data and instruction breakpoints. The fea- 
tures of the i860 XP microprocessor architecture 
that support debugging include: 
• db 
(data breakpoint register), which 
permits 
specification of a data address that the i860 XP 
microprocessor will monitor. 
• BR (break read) and BW (break write) bits of the 
psr, which enable trapping of either reads or 
writes (respectively) to the address in db. 


• OAT (data access trap) bit of the psr, which al- 
lows the trap handler to determine when a data 
breakpoint was the cause of the trap. 
• trap instruction that can be used to set break- 
points in code. Any number of code breakpoints 
can be set. The values of the isrc1 and 
isrc2 
fields help identify which breakpoint has oc- 
curred. 


• IT (instruction trap) bit of the psr, which allows 
the trap handler to determine when a trap in- 
struction was the cause of the trap. 
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Table 2.11. Register and Cache Values after Reset 


Registers 
Initial Vaule 


Integer Registers 
Undefined 
Floating-Point Registers 
Undefined 
psr 
U,IM, BR, BW, FT, DAT,IAT,IN,IT 
= 0; 


others are undefined 
epsr 
IL, WP, PBM, BE, PT = 0; BEF, PEF = 1; 
Processor Type, Stepping Number, DCS, 
SO are read only; others are undefined 
db 
Undefined 
dlrbase 
DPS, BL, LB, ATE = 0; others are undefined 
fir 
Undefined 
far 
Undefined 
bear 
Undefined 
p3-pO 
Undefined 
ccr 
CO, DO = 0; others are undefined 
KR, KI, T, MERGE 
Undefined 
NEWCURR 
Undefined 
STATUS 
InLoop, Nested, Detached = 0 


Caches 
Initial Value 


Instruction Cache 
All entries invalid 
Data Cache 
All entries invalid 
TLB 
All entries invalid 


3.0 
ON-CHIP CACHES· 


By holding data, instructions, and address transla- 
tion on-chip, the caches of the i860 XP microproces- 
sor provide the following advantages: 
1. Low chip count for the CPU subsystem. 
2. Wide processor-to-cache path: 16 bytes for data, 


8 bytes for instructions. 
3. Fast access without requiring much additional 
high-speed design 
in the 
system. The 
fast 
(50 MHz) cache-access circuitry is hidden on 
chip; the external bus can respond more slowly 
without significantly degrading performance. 


3.1 Address Translation Caches 


The i860 XP microprocessor allows both four Kbyte 
and four Mbyte page sizes, and a separate transla- 
tion look-aside buffer (TLB) is used to cache ad- 
dress translation information for each page size. The 
TLB for four-Kbyte pages (Figure 3.1) has 64 entries, 
and the TLB for four-Mbyte pages (Figure 3.2) has 
16 entries. Both are four-way set associative. The 
TLBs function when paging is enabled. When a page 
is first accessed, its translation information is saved 
in the appropriate TLB along with other page attri- 
butes, such as access rights and cacheability. Every 
address translation operation looks up the virtual ad- 
dress simultaneously in both TLBs. Only if the nec- 


essary paging information is not in either of the 
caches must the paging tables in memory be refer- 
enced. Both TLBs employ a random replacement al- 
gorithm to choose which of the four ways to replace. 


If an instruction's virtual address is found in the in- 
struction cache, the virtual address is not translated, 
and code access rights are not verified. However, 
when an instruction's virtual address is not found in 
the cache, address translation does occur, and all 
access rights are verified. The virtual addresses of 
data are always translated, and access rights are 
always verified. 


The i860 XP microprocessor requires simultaneous 
access to data and instruction caches, but the TLBs 
can service only one address translation at a time. 
Data address translation has higher priority in the 
TLBs than instruction address translation, if both are 
required at the same time. 


Any data or instruction access fault halts address 
translation at once, and the TLB is not updated. If a 
directory read causes an access fault, the page ta- 
ble is not read at all. 


If the paging unit generates a fault (in setting the D 
bit for the first write to a nondirty page, for example), 
the corresponding entry is deleted from the TLB. 
Therefore, software does not need to invalidate the 
TLB entry in response to DAT.or IAT faults. 
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NOTES: 
D 
Dirty 
CD 
Cache Disable 
WT 
Write-Through 
U 
User Mode 
W 
Writable 
V 
Validity 
PHYSICAL 
ADDRESS 
240874-22 


Figure 3.1. 4K TLB Organization 


NOTES: 
D 
Dirty 
CD 
Cache Disable 
WT 
Write-Through 
U 
UserMode 
W 
Writable 
V 
Validity 


VIRTUAL 
ADDRESS 
',$1.$0292827262524.'2.$22,'2120191617161514/.$1211109" 
iT 6.$ 
'" J.2 
'0 


TAG 
BYTE SELECT 


240874-23 
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PHYSICAL 
ADDRESS 


PHYSICAL ADDRESS 


PHYSICAL 
ADDRESS 


8543210 


PAGE rRAt.lE 
BYTE SELECT 


PHYSICAL 
ADDRESS 


Figure 3.2. 4M TLB Organization 


If TLB replacement is initiated during a locked se- 
quence generated by the lock instruction and if an- 
other locked sequence has to be executed to set the 
A-bit, the paging unit generates an access fault. This 
helps external hardware implement "locking by ad- 
dress" by preventing generation of nested lock se- 
quences. 


3.2 
Internal Instruction and Data 
Caches 


The i860 XP microprocessor has separate data and 
instruction caches on-chip. Having separate caches 
for instructions and data allows simultaneous cache 
look-up. Up to two instructions and 128 bits of data 
can be accessed simultaneously from these caches. 
The data and instruction caches hold 16 Kbytes 
each. A line can be filled from memory with a four- 
transfer burst. 


The caches are fully transparent to applications soft- 
ware. Snooping (address monitoring) is designed 
into both instruction and data caches, to maintain 
cache consistency in multiprocessor systems. 


Each cache has two sets of tags: virtual tags used 
for internal access, and physical 
tags used for 
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snooping. Figure 3.3 shows how the bits of both vir- 
tual and physical addresses are mapped for cach- 
ing. The presence of both virtual and physical tags 
supports aliasing, 
a situation in which the TLBs as- 
sociate a single physical address with two or more 
virtual addresses. 


Any area of memory can be cached, although both 
software and hardware can disallow certain areas 
from being cached-software 
by setting the CD bit in 
their page table entries; hardware by deasserting the 
KEN11 signal for bus cycles with addresses that fall 
in those areas. (Data reads from the two four-Kbyte 
pages pointed to by the CCUBASE field of ccr are 
not cached (and the CACHE11 signal is inactive), if 
the DCCUis activated by setting CO of the ccr regis- 
ter. This is independent of the value of KEN11.) 
When both software and hardware agree that a re- 
quested datum is cacheable, the i860 XP microproc- 
essor fetches an entire 32-byte line and places it 
into the appropriate cache. Cache line fills are gen- 
erated only for read misses, not for write misses. A 
store that misses the cache does not copy the 
missed line into cache from memory, but rather 
posts the datum in a write buffer, then sends it to the 
external bus when the bus is available. 


CACHE 
TAG 


240874-24 


SET 
SELECT 


EXTERNALLY 
GENERATED 
INQUIRY 
(SNOOP) 
ADDRESSES 


Figure 3.3. Cache Address 
Usage 


• 


2-41 


1860TM XP MICROPROCESSOR 


• 


3.2.1 
DATA CACHE 


Figure 3.4 shows the organization of the data cache. 
The data cache has two status bits per physical tag 
and one validity status bit for the virtual tag. A virtual 
tag hit is possible only when the validity bit of the 
virtual tag is set and the state of the physical tag is 
M, E, or S. 


Aliasing support is built into the cache look-up algo- 
rithm. Even though a physical line may be aliased, 
the processor never enters the line twice in the data 
cache. If a virtual address is not found among the 
virtual tags in the data cache, a bus cycle is initiated 
(except a read is not issued at this time if the bus 
pipeline is full) and, at the same time, the physical 
tags are searched for the physical address (which by 
this time has been retrieved from the paging unit). 
For reads, if the physical address is found, the data 
returned from the bus is ignored, on-chip data is 
used, and the virtual tag is replaced with the new 
one. For writes, if a virtual address is not found, the 
write is issued on the bus and memory is updated. If 
the physical address is found, the line in cache is 
updated, and the virtual tag is replaced with the new 
one. However, the cache state (M, E, or S) of the 
physical-address tag does not change when the vir- 
tual tag is overwritten. 


Note that the BE (big endian) bit of epsr has no 
influence on data cache behavior. Data items are 
kept in cache in exactly the same ordering as in ex- 
ternal memory. Byte-shifting operations invokedby 
the BE bit upon loads and stores occur at the input 
to the register files only. 


3.2.1.1 
Data Cache 
Update 
Policies 


To minimize bus traffic, a write-back policy is normal- 
ly used. The write-back policy (also called copy-back 
and deferred-write) 
reduces bus traffic by eliminating 


many unnecessary writes. Writes to a line in the 
cache are not immediately forwarded to main mem- 
ory; instead, they are accumulated in the cache. The 
modified cache line is written to main memory only 
when its cache space is needed for other data, 
when the modified data is needed by another proc- 
essor, or when a flush procedure is executed. 


Under the write-back policy, a write that hits the 
cache utilizes it for two cycles (one to check the 
virtual tags for hit, another to update the cache line). 
However, the cache pipeline allows successive 
store hits to operate at one per cycle. The proces- 
sor's internal write buffers can hold two successive 
stores, preventing a freeze upon store miss. 


Under a write-through 
policy, a write request to a line 
in the cache triggers updates to both cache and 
main memory. An address decoder, for example, 
can select the write-through policy for writes to video 
RAM, where it is necessary that writes be seen on 
the video display. Software, by setting the WT page- 
table bit, can select the write-through policy for spe- 
cific areas of memory-those 
that are used for inter- 


processor message queues, for example. 


A 
write-once 
policy combines write-through with 
write-back. Write-through is employed for the first 
write to a cache line, while subsequent writes to the 
same line follow the write-back policy. Write-once is 
valuable 
in multiprocessor systems to 
maintain 
cache consistency with the least possible bus traffic. 
The first write broadcasts to other processor nodes 
the fact that a line has been modified. Write-once is 
also used if a second-level cache is attached to the 
i860 XP microprocessor to maintain consistency be- 
tween the first- and second-level caches. 


The external system can dynamically change the up- 
date policy (write-back, write-through, write-once) of 
the i860 XP microprocessor with each cache line. 


NOTES: 
v 
Ox18 
Oxl0 
0 


M Modified 
'":::> 
V 
Ox18 
Oxl0 
0 
E Exclusive 
i 
S Shared 
v 
Ox18 
Oxl0 
8 
0 
I Invalid 
1---32-BYTE LlNES----l 
V Validity 
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Figure 3.4. Data Cache Organization 
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3.2.2 INSTRUCTION CACHE 


Figure 3.5 shows the organization of the instruction 
cache. The instruction cache has one validity bit that 
is common to both virtual and physical tags. Aliasing 
support for instructions consists not simply of chang- 
ing the virtual tag, but rather fetching a line whenev- 
er a virtual tag miss occurs. If the physical address 
already exists in the instruction cache, its line and its 
tags are overwritten. So, even though a physical line 
may be aliased, the processor never enters the line 
twice in the instruction cache. 


3.2.3 CACHE REPLACEMENT ALGORITHM 


The 
data, 
instruction, 
and 
address-translation 
caches all use similar algorithms to choose which of 
the four cache blocks will be overwritten when a 
miss causes a line fetch. 


First, the first invalid line (if any) in a set of four is 
replaced (in the order 0, 1, 2, 3). When there are no 
more invalid lines in a set, a pseudorandom replace- 
ment algorithm chooses which valid lines to replace. 
The algorithm is controlled by counters inside the 
chip. RESET initializes these counters to zero, so 
that the "randomness" 
is deterministic and two 
iB60 XP CPUs executing the same code on identical 
boards have exactly the same series of cache hits, 
misses, and replacements. 
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Setting ITI to invalidate the caches and TLBs also 
resets the counters used to select the set used for 
cache line replacement. This brings the iB60 XP mi- 
croprocessor cache-replacement mechanism to a 
known state without resetting the whole Chip. 


When the flush instruction is used to write back 
modified lines in the data cache, the flush routine 
must alter the RC (replacement control) field of dlr- 
base. Therefore, replacement is not random. In- 
stead, the block (or "way") replaced is the one se- 
lected by the RB (replacement block) field of dlr- 
base. 


3.2.4 CACHE CONSISTENCY PROTOCOL 


The iB60TMXP Microprocessor implements cache 
consistency via its use of a MESI (Modified, Exclu- 
sive, Shared, Invalid) protocol. 


3.2.4.1 Data Cache States 


Each line of the data cache of the iB60 XP micro- 
processor can be in one of the states defined in Ta- 
ble 3.1. Note that the instruction cache of the 
iB60 XP only implements the "SI" part of the MESI 
protocol, because the instruction cache is not writa- 
ble. 


Ox18 
Oxl0 
8 
0 


NOTE: 
V Validity 


Ox18 
Ox10 


Ox18 
Oxl0 
0 


1--32-BYTE 
lINES----l 
240874-26 


Figure 3.5. Instruction Cache Organization 


Table 3.1. MESI Cache Line States 


Cache Line State: 
M 
E 
S 
I 
Modified 
Exclusive 
Shared 
Invalid 


This cache line is valid? 
Yes 
Yes 
Yes 
No 


The memory copy is ... 
. .. out of date 
... valid 
... valid 
- 


Copies exist in other caches? 
No 
No 
Maybe 
Maybe 


A write to this line ... 
. .. does not go 
... does not go 
... goes to bus 
... goes 
to bus 
to bus 
and updates 
directly to bus 
the cache 
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Table 3.2. Internally Initiated Cache State Transitions 


State 
Next State after Read 
Next State after Write" 


I 
If WB/WT 11 = 1; E; else S 
Write-through 
Line fill 
I 
S 
S 
Write·through 
If WB/WTII = 1, E; else S 
E 
E 
M 
M 
M 
M 


NOTE: 
• "Write" 
does not include 
write· backs due to replacement. 
Those 
can only cause 
an M to I 
transition. 


The state of a cache line can change as the result of 
either internal or external activity related to that line. 
Table 3.2 presents the line state transitions that re- 
suit from internal activity of the i860 XP microproces- 
sor in the data cache. 


External 
cache-consistency 
support 
is 
provided 
through inquiry cycles. Inquiry cycles are initiated by 
other processors in a multiprocessor system to 
check whether an address is cached in the internal 
cache of the i860 XP microprocessor. Table 3.3 
shows the line state transitions initiated by inquiry 
cycles. 


Table 3.3. Inquiry-Initiated 
Cache State Transitions 


State 
INV=O 
INV=1 


I 
I 
I 
S 
S 
I 
E 
S 
I 
M 
S; write back the line 
I; write back the line 


3.2.4.2 Write-Once Policy 


A write-once cache policy can be implemented 
through use of the WB/WT 11 input pin. The signal 
on this pin is sampled in both read and write cycles. 
A read miss causes a line to enter either S or E after 
the line fill. If WB/WT 11 is sampled LOW at the time 
of NAil 
or the first BRDYII activation, the line en- 
ters S state, forcing the next write hit to this line to 
show up on the bus. If WB/WTII 
is sampled HIGH, 


the line enters E state. In write-through cycles, the 
state of a line is changed from S to E when WB/ 
WTII 
is sampled HIGH, so that subsequent writes 
will not be written through to the bus. Thus, if this 
signal is driven LOW on read cycles and HIGH on 
write cycles, a write-once cache policy is implement- 
ed. The easiest way to implement write-once (in sys- 
tems not using the 82495XP cache controller) is to 
tie this pin to the W/R# 
output of the processor. 
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If the WT bit in the page table entry is set, the 
i860 XP microprocessor ignores the WB/WT 11 sig- 
nal for the cycles that hit that page and always per- 
forms a write-through. In other words, hardware can- 
not 
override software's 
selection 
of 
the 
write- 
through policy. 


3.2.4.3 Locked Access 


Locked 
accesses 
are those data loads and stores \ 


that occur after a lock instruction up to and including 
the first load or store after the corresponding unlock 
instruction. 


State transitions for locked accesses differ from 
those in Table 3.2 in ways that guarantee that 
locked accesses are seen by all processors in the 
system. Any locked load or store generates both a 
cache look-up and an external bus cycle, regardless 
of cache hit or miss. 
1. In a locked read: 
a. If the required data is not found in the cache, 
the data from the bus is used. The data is 
placed in the cache if it is cacheable and 
KEN11 is also asserted. 


b. If the required data is found in an unmodified 
(E or S) state, the data from the bus is used. 


c. If the data is found in the cache in a modified 
(M) state, the cached data is used, and the 
bus data is ignored, as long as no inquiry 
write-back occurs before the BRDY11 of the 
bus cycle. If, however, an intervening inquiry 
write-back changes the line to S or I state, the 
bus data is used. 
2. A locked store is forced through the cache and 


issued on the bus. No more data accesses occur 
until the last BRDY11 for the store. If the store 
hits the internal cache, the cache update is done 
after the last BRDY11 from the bus. Note that the 
line written by a locked store remains in M state 
in spite of the write-through to the bus, because 
the length of the write-through is less than the 
line size of 32 bytes. 


Locked accesses are totally serializing 
in the sense 
that: 
1. All loads and stores that precede the lock in- 
struction are issued on the bus (if they miss the 
cache) before the first locked access is issued. 
The locked access can be issued before the last 
BRDY11 of the prior cycle if NA11 is activated in 
response to the prior cycle. 
2. No load or store after the last locked access is 
issued internally or on the bus until the final 
BRDY11 for all locked accesses. 


To maximize performance, instruction fetches during 
the locked sequence are not serializing. When NA11 
invokes pipelining, instruction fetches may be issued 
while locked data fetches or stores remain on the 
bus. 


3.3 
Internal Cache Consistency 


Both the instruction and the data caches can be 
snooped by externally generated inquiry cycles, and 
the result of the look-up is presented on the HIT11 
and HITMIt output pins. These inquiry cycles help 
maintain consistency with caches of other proces- 
sors. However, software must take care not to cre- 
ate inconsistencies such as the following among the 
internal caches (including the TLBs): 
1. Changing the address space while leaving virtual- 
address tags from the prior space in the instruc- 
tion or data cache. 
2. Changing instructions in memory (or in the data 
cache) without changing them in the instruction 
cache. 


3. Changing page table information in memory (or in 
the data cache) without changing the same infor- 
mation in the TLBs. 


Under certain circumstances, such as 1/0 
refer- 
ences, self-modifying code, page-table updates, or 
shared data in a multiprocessing system, it is neces- 
sary to bypass, to invalidate, or to flush the caches. 
The i860 XP microprocessor provides the following 
methods for doing this: 


• Bypassing Instruction 
and Data Caches. 
1. If deasserted during cache-miss processing, 
the KEN11 pin disables instruction and data 
caching of the referenced data. 
2. If the CD bit of the associated page table is 
set, caching of a page is disabled. The value of 
the CD bit is output on the PCD pin for use by 
external caches. 
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3. If the WT bit of the associated page table is 
set, caching is not disabled, but writes pass 
through the cache. The value of the WT bit is 
output on the PWT pin for use by external 
caches. (Note that WT does not affect policy 
for the instruction cache, because the instruc- 
tion cache is not writable. However, when an 
instruction from a page having the WT bit of 
the PTE set is placed in the data cache, the 
write-through policy applies just as for a data 
page.) 


• Invalidating 
Cache 
Entries. 
Storing 
to 
the 
dlrbase 
register with the ITI bit set invalidates 
each line of the instruction and address-transla- 
tion caches. In the data cache, it invalidates the 
virtual tags, but not the physical tags. 


• Flushing the Data Cache. The data cache is • 
flushed by a software routine that uses the flush 
instruction. The flush instruction speeds up write- 
backs. The same effect (writing back modified 
lines) can be achieved with the load instruction 
IdJ, but this would be more than twice as slow- 
the load must first do four bus transfers to get 
new data, then writel:lack the modified line. The 
flush instruction causes the write-backs without 
requiring a read from external memory to replace 
the modified line. 


3.3.1 ADDRESS SPACE CONSISTENCY 


In a multitasking virtual-address system, the operat- 
ing system may intentionally employ aliasing, where 
several processes use the same physical memory 
while accessing it with different virtual addresses. 
When the operating system switches control from 
one process to the next, it changes the DTB field of 
the dlrbase to point to a different page directory that 
defines the new address space. When this happens, 
all caches must be invalidated: the TLBs, so that the 
new page directory is read into the TLBs; the data 
and instruction caches, so that virtual addresses 
from the new space don't accidently match cached 
virtual addresses from the old space. 


The caches are invalidated by setting the ITI bit 
when writing to dlrbase. Invalidating the instruction 
cache invalidates both the physical and the virtual 
tags, because the instruction cache has one status 
(valid) bit, which is common to both physical and 
virtual tags. In the data cache, setting ITI does not 
invalidate physical tags. However, any modified lines 
will eventually be written back when their space is 
required for lines from the new address space or 
when external agents on the bus express a need for 
the modified data via inquiry cycles. 
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The caches are invalidated by setting the ITI bit 
when writing to dlrbase. Note, however, that the op- 
erating system code that flushes the caches must 
be present during the flushing. Typically this code 
has the same virtual address for all processes. 


NOTE: 
The mapping of the page(s) containing the cur- 
rently executing instruction, the next six in- 
structions, and any data referenced by these 
instructions should not be different in the new 
page tables when the DTB is changed. 


Enabling or disabling address translation (via the 
ATE bit) is similar to changing the OTB, in that the 
address mapping is changed. The virtual tags in the 
data and instruction cache must be invalidated prior 
to changing ATE. 


3.3.2 
INSTRUCTION 
CACHE 
CONSISTENCY 


When software modifies a page containing instruc- 
tions (as when a debugger replaces an instruction 
with the trap instruction to set a breakpoint). the in- 
struction cache can become inconsistent for any of 
the following reasons: 
• Because the data cache uses a write-back policy, 
changes to cached instruction pages do not im- 
mediately update memory. 


• Changes to instructions do not automatically up- 
date the instruction cache. 
• Instruction cache misses are not checked in the 
data cache. 


Software must ensure that modified lines containing 
instructions are written to main memory before the 
instruction cache tries to read them. There are two 


, methods for this: 


1. Flush the data cache using the flush instruction. 
Note that to make the instruction cache consist- 
ent with the data cache, the data cache must be 
flushed before invalidating the instruction cache. 
2. Mark all instruction pages as WT (write through) 
so that modifications to instructions are immedi- 
ately written to memory. This is the better alterna- 
tive. 


In either case, the instruction cache must be invali- 
dated (by a store to dlrbase 
with ITI set) after a 
code page has been modified, so that the updated 
instructions will be read from memory. 
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3.3.3 
PAGE TABLE 
CONSISTENCY 


When the operating system modifies page tables or 
directories, the TLBs can become inconsistent with 
the modifications for any of the following reasons: 
• Because the data cache uses a write-back policy, 
updates to cached page tables do not immediate- 
ly update memory. 


• Changes to page tables do not automatically up- 
date the TLB. 
• The i860 XP microprocessor searches only exter- 
nal memory for page directories and page tables 
in the translation process. The data cache is not 
searched. (Data is not transferred from the data 
cache to the TLBs during TLB replacement cy- 
cles.) 


Software must ensure that modified lines containing 
page table entries are written to main memory be- 
fore the paging unit tries to read them. There are two 
methods for this: 
1. Keep page tables and directories in noncachea- 
ble memory or write-through pages. 
2. Flush the data cache using the flush instruction. 


The processor itself invalidates the affected TLB en- 
try, when a trap is triggered by the need to set the A 
or D bit. In other cases, after a page table or directo- 
ry has been modified, software must invalidate the 
TLBs (by a store to dlrbase with ITI set) so that the 
updated entries will be read from memory. 


The data cache does not need flushing if the pro- 
gram is modifying only the P, U, W, A, or D bits of a 
PTE (as long as the page frame address is not 
changed and the PTE itself is not in the data cache.) 
The i860 XP CPU does not use the TLB for cache 
line write-backs; it writes to the address in the physi- 
cal tag. 


Thus, a trap handler can service a data access trap 
for D-bit zero merely by setting D= 1. When setting 
the P or A bits, there is no need to invalidate or flush 
any caches, because the processor does not load 
entries into the TLB that have P=O or A=O. 


Two potential TLB inconsistencies are avoided auto- 
matically by the i860 XP microprocessor. 
1. If the paging unit issues a write cycle (to set the A 
bit, for example), this cycle is snooped by the 
data cache for invalidation. 
2. Any TLB entry that causes a DAT or IAT is auto- 
matically invalidated. 
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3.3.4 
CONSISTENCY OF CACHEABILlTY 


Normally, an operating system ensures that the 
page attributes (CD and WT) of a memory access 
are consistent with the cache contents. However, 
the operating system can fail to maintain consisten- 
cy by the following actions: 
• Changing the CD or WT bits while related lines 
are in the cache. 


• Aliasing a physical address with virtual addresses 
that have differing CD or WT bits. 


In these situations, the i860 XP microprocessor 
gives priority to cache state. For example: 
1. If a read or write request is to a noncacheable 
page (CD=1), but the data (or code) is found in 
cache, the request is satisfied by the cache, and 
no external cycle is issued. 
2. If the physical address of a read or write request 
hits in the cache but the virtual address misses, 
the virtual tag is overwritten by the new virtual 
address, but the CD bit of the new virtual address 
is ignored. 
3. If a store to a write-through page (WT=1) hits a 
cache line in E or M state, no write-through cycle 
is issued; only the cache is updated. 


3.3.5 
LOAD PIPE CONSISTENCY 


The pfld (pipelined floating-point load) instruction fa- 
cilitates transfer of data from memory to registers, 
and avoids placing data in the data cache. When 
large amounts of data are used, pfld allows the pro- 
grammer to keep rarely-used data out of the cache. 
The i860 XP microprocessor ensures consistency 
between cached data and pfld references. It checks 
the data cache and, upon a data cache hit to a modi- 
fied line, forwards data from cache into the three- 
stage pfld pipeline. 


3.3.6 
SUMMARY 


Table 3.4 summarizes flush and invalidation require- 
ments, assuming that WT is set in the PTEs of in- 
struction and page-table pages: 
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Table 3.4. Summary of 
Cache Flushing And Invalidation 


Flush 
Invalidate 
Action 
Data 
Caches 
Cache 
(ITI) 


Setting A 
No 
No 
Setting P 
No 
No 
Clearing P 
No 
Yes 
Setting 0 
No 
NO(4) 
Changing protection (U,W) 
No 
Yes 
Setting CD or WT 
Yes 
Yes 
Changing PFA in a used(1)PTE 
No 
Yes 
Changing dlrbase DTB 
No 
Yes 
Changing dlrbase ATE 
No 
Yes 
Changing epsr WP 
No 
No 
Setting ccr DO and CO 
Yes(2) 
Yes(2) 
Modifying code 
No(3) 
Yes 


NOTES: 
1. ':Used" means a PTE that at some past time had P set. 
2. If data from either of the CCU pages could have been 
cached. 
3. Assuming all instructions and their page directories and 
page tables are in write-through or noncacheable pages. 
4. If one PTE is aliased to two or more different PDE's, the 
TLB must be invalidated on the first write to a non-modified 
page. The TLB can be invalidated by setting the ITI bit in 
the directory base register. 


4.0 
HARDWARE 
INTERFACE 


In the following description of hardware interface, 
the # symbol at the end of a signal name indicates 
that the active or asserted state occurs when the 
signal is at a low voltage. When no # is present after 
the signal mime, the signal is asserted when at the 
high voltage level. 


4.1 Pins Overview 


Figure 4.1 identifies functional groupings of the pins. 
Table 4.1 lists every pin by its identifier, gives a brief 
description of its function, and lists some of its char- 
acteristics. All output pins are tristate, except BREQ, 
HIT#, HITM#, HLDA, LOCK#, and PCHK#. 


•• 
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KBO,KB1 
Cache Block 
HIGH 
HLDA, BOFF# 


LEN 
Length 
HIGH 
HLDA, BOFF# 


LOCK # 
Address 
Lock 
LOW 
M/IO# 
Memory/la 
HLDA, BOFF# 
NENE# 
Next Near 
LOW 
HLDA, BOFF# 
PCD 
Page Cache Disable 
HIGH 
HLDA, BOFF# 
PCHK# 
Parity Check 
LOW 
PCYC 
Page Cycle 
HIGH 
HLDA, BOFF# 
PWT 
Page Write-Through 
HIGH 
HLDA, BOFF# 
TOO 
Test Output 
Nonscan 
Mode 
W/R# 
Write/Read 
HLDA, BOFF# 


Input/Output 
Pins 


A31-A3 
Address 
HIGH 
AHOLD, 
HLDA, BOFF# 
063-00 
Data 
HIGH 
HLDA, BOFF# 
DP7-DPO 
Data Parity 
HIGH 
HLDA, BOFF# 


Input Pins 


AHOLD 
Address 
Hold 
HIGH 
Synch 
BERR 
Bus Error 
HIGH 
Synch 
BOFF# 
Back-Off 
LOW 
Synch 
RSRVD# 
Intel Reserved 
BRDY# 
Burst Ready 
LOW 
Synch 
BYPASS # 
Intel Reserved 
LOW 
CLK 
Clock 
RESET 
Reset 
HIGH 
Asynch 
EADS# 
External Address 
Status 
LOW 
Synch 
EWBE# 
External Write Buffer Empty 
LOW 
Synch 
FLlNE# 
Flush Line 
LOW 
Synch 
HOLD 
Bus Hold 
HIGH 
Synch 
INT/CS8 
Interrupt/Code-Size 
8 
HIGH 
Asynch 
INV 
Invalidate 
HIGH 
Synch 
KEN# 
Cache Enable 
LOW 
Synch 
NA# 
Next Address 
LOW 
Synch 
PEN# 
Parity Enable 
LOW 
Synch 
TCK 
Test Clock 
TDI 
Test Data Input 
Synch 
PUll-up 
TMS 
Test Mode Select 
Synch 
Pull-up 
TRST# 
Test Reset 
LOW 
Asynch 
Pull-up 
WB/WT# 
Write-Back/Write- 
Through 
Synch 
SPARE 
Intel Reserved 
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The pins O/C#, 
W/R#, 
and M/IO# 
define 
bus cy- 
cle types. 
They 
are summarized 
in Table 
4.2. 
For 
data 
transfers 
to 
or from 
memory, 
two 
additional 
pins, CTYP 
and 
PCYC, provide 
further 
information 
regarding 
the type of transfer, 
as shown in Table 4.3. 


Table 
4.4 shows 
how the LEN and CACHE# 
pins 
determine 
cycle length. 


t1 
I\. 


A31-A3 
) 


DATA 
ADDRESS n 
~ 


BE7M 
BEOM 


I' 


.A 
I\. 


< 063-00 
~ 
" 


.A 
I\.~--~----~~l 
< DP7-DPO 


~NM 
I' 
"':":;:~-1------~U·:: 
ADSM 


,: LEN 


.:, PCHKM 
PARITY 
Table 4.2. ADS# 
Initiated 
Bus Cycle 
Definitions 


MIIO# 
D/C# 
W/R# 
Bus Cycle Initiated 


0 
0 
0 
Interrupt Acknowledge 
0 
0 
1 
Special Cycle 
0 
1 
0 
1/0 Read 
0 
1 
1 
I/O Write 
1 
0 
0 
Code Read 
1 
0 
1 
Reserved 
1 
1 
0 
Memory Read 
1 
1 
1 
Memory Write 


BRDYM 
NAM 
CYCLE 
CONTROL 


,: CACHEM 


::, LOCKM 


:,: NENEM 


:,: PWT 
KENM 
CACHE 
CONTROL 


,,: PCD 


:,: HITM 
WB WTM 


AHOLD 


:,: HITIAM 
EADSM 


INV 
CACHE 
CONSISTENCY 


,:, KBO. 


,:, KBl 


:" BREQ 


rUNEM 


HOLD 
BUS 
ARBITRAnON 
Table 
4.3. Memory 
Data Transfer 
Cycle 
Types 
:,: HLDA 


PCYC 
CTYP 
W/R# 
Data Transfer 
Type 


0 
0 
0 
Normal read 
0 
1 
0 
Pipelined load (pfld instruction) 
1 
0 
0 
Page directory read 
1 
1 
0 
Page table read 
0 
0 
1 
Write-through (S-state hit) 
0 
1 
1 
Store miss or write-back 
1 
0 
1 
Page directory update 
1 
1 
1 
Page table update 


:,: D/CM 
INT/cS8 


..:B;:::ER::.:;R:-"-MINTERRUPT 
CYCLE 
,:, WIRM 


DErlNITION •.•.":rP:::C:=::YC~_. 
", CTYP 


TCK 


TDI 
::' TOO 
BOUNDARY 
SCAN 
HoIS 


TRSTM 


RESET 
ClK 


EWBEM 


BYPASS M 
NOTE: 
PCYC and CTYP are defined only for memory data transfer 
cycles (O/C# 
= 1, MIIO# = 1) 
240874-27 


Figure 4.1. Signal 
Grouping 


Table 4.4. Cycle 
Length 
Definition 


W/R# 
LEN 
CACHE # 
KEN# 
Cycle 
Description 
Burst 
Length 


0 
0 
1 
- 
Noncacheable* 
* 64-bit (or less) read 
1 
0 
0 
- 
1 
Noncacheable 
64-bit (or less) read 
1 
1 
0 
1 
- 
64-bit (or less) write 
1 
- 
0 
1 
- 
1/0 and Special Cycles 
1 
0 
1 
1 
- 
Noncacheable 
128-bit read (p)fld.q 
2 
0 
1 
- 
1 
Noncacheable 
128-bit read (p)fld.q 
2 
1 
1 
1 
- 
128-bit write fst.q 
2 
0 
- 
0 
0 
Cache line fill 
4 
1 
- 
0 
- 
Cache write-back 
4 


NOTE: 
•* Includes CS8-mode code fetches, which may be cached by the processor. 
-Indicates 
"don't care" values. 
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4.2 Signal Description 


In this section descriptions of all pins are presented 
in alphabetical order. 


4.2.1 A31-A3 
(ADDRESS PINS) 


The 29-bit address bus (A31-A3) identifies address- 
es to a 64-bit location. Separate byte-enable signals 
(BE?#-BEO#) 
identify which bytes should be ac- 
cessed within the 64-bit location. 


The address lines are bidirectional. The i860 XP mi- 
croprocessor drives the address lines unless it is in a 
hold state. The system drives address lines A31-A5 
to perform cache line inquiries (refer to the EADS# 
signal description). 


4.2.2 ADS# (ADDRESS STATUS) 


The i860 XP microprocessor asserts ADS# to iden- 
tify the first clock period of each bus cycle, the clock 
period during which new values become valid on the 
address bus and cycle-definition pins. This signal is 
held active for one clock. 


If BOFF# is asserted, the processor floats ADS# 
two clocks after sampling BOFF# (and not, like all 
other pins, on the next clock). This is to ensure that 
ADS# is deasserted before it floats, and therefore is 
never left floating active. 


ADS# can be asserted while AHOLD is active to 
initiate a cache write-back cycle. 


4.2.3 AHOLD (ADDRESS HOLD) 


The external system asserts AHOLD to perform a 
cache inquiry. In response to assertion of AHOLD, 
the i860 XP microprocessor immediately (in the next 
Clock)stops driving the address bus (A31-A3 lines). 
The other buses remain active, and data can be 
transferred for previously issued read or write bus 
cycles during address hold. AHOLD is recognized 
even during RESET and LOCK#. The earliest that 
AHOLD can be deasserted is the clock after EADS# 
is asserted to start the inquiry. 


If HITM# 
has activated due to an inquiry, the 
i860 
XP 
microprocessor 
asserts 
ADS# 
while 
AHOLD is active to start the write-back of the modi- 
fied line that was the target of the inquiry. 


4.2.4 BEU-BEO# 
(BYTE ENABLES) 


The byte-enable pins are driven with the address. 
BE?# applies to D63-D56, BEO# applies to D?- 
DO. 
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In write cycles (noncacheable writes as well as 
cache line write-backs), the BEn# signals determine 
which bytes must be written into external memory 
for the current cycle. 


In read cycles, the BEn# values indicate which byte 
the load instruction has requested. In all noncachea- 
ble read cycles (CACHE# or KEN# deasserted), 
the byte enables match the length and address of 
the requested data. Cacheable read cycles (KEN# 
asserted), however, result in four 64-bit memory 
transfers to fill an entire 32-byte cache line. The 
BEn# pins activated are those that represent the 
operand of the load instruction that caused the line 
fill, and these same BEn# pins remain activated for 
as long as A31-A5. All 64 bits must be returned for 
each cacheable cycle without regard for the BEn# 
signals. 


While in CS8 mode, BE2#-BEO# 
serve as (active- 
high) lower-order address bits for instruction fetches 
(from the ROM). Data fetches and stores are not 
affected by CS8 mode, and BE2#-BEO# 
retain 
their normal byte-enable function for data. 


4.2.5 BERR (BUS ERROR) 


This is a nonmaskabJe interrupt input, which sup- 
ports bus error handling or other urgent circum- 
stances. BERR is not masked by the IM bit of the 
psr nor by lock cycles. When BERR is activated, the 
i860 XP microprocessor vectors to the trap handler 
and sets the bus error flag (BEF) in the epsr. Activa- 
tion of BERR causes the physical address of the 
current bus cycle to be latched into the Bus Error 
Address Register (BEAR). However, the BEAR will 
only latch valid information if BERR is asserted in 
the same clock as BRDY# or one clock after. In all 
other conditions the contents of the BEAR register, 
as well as the BS bit in the Extended Processor 
Status Register (ESPR) are undefined. BERR is 
rising-edge sensitive. Once the trap has occurred, 
further BEF traps cannot occur until software has 
cleared BEF and read BEAR. 


BERR does not terminate outstanding bus cycles. 
Therefore, the system must still activate BRDY# a 
sufficient number of times or activate BOFF# for 
those cycles. Even though activating BOFF# tem- 
porarily halts the erring cycles, the i860 XP micro- 
processor will retry them when BOFF# is deassert- 
ed, in spite of BERR. 


Timing of BERR is not influenced by late back-off 
mode. 


4.2.6 BOFF# (BACK-OFF) 


The system can assert this signal to abort all out- 
standing bus cycles that have not yet completed. In 
response to BOFF#, the i860 XP microprocessor 


intel~ 


immediately (in the next clock) floats its bus, except 
for ADS#, 
which is floated one clock later. The 
processor floats all the same pins normally floated 
during bus hold; however, unlike a bus hold, HLDA is 
not asserted. (HLDA is asserted only in response to 
HOLD; no acknowledgment is required for BOFF# .). 
Any data and BRDY# returned to the processor 
while BOFF# is asserted are ignored. The proces- 
sor remains in bus hold until BOFF# is deasserted, 
at which time it restarts the bus cycles by driving the 
address and cycle definition pins and asserting 
ADS#. When BOFF# deactivates, ADS# may be 
asserted the following clock. Thus a BOFF# dura- 
tion of one clock results in not floating ADS# at all. 
BOFF# cannot be used to force the pins to float 
during RESET; use HOLD for that purpose. 


4.2.7 BRDY# (BURST READY) 


The input BRDY# indicates either that the external 
system has driven valid data on the data pins in re- 
sponse to a read request or that the external system 
has latched the data in response to a write request. 
The CPU ignores this signal when no bus requests 
are outstanding. During a bus cycle, BRDY# is sam- 
pled at each clock, starting with the .clock after as- 
sertion of ADS# and continuing until all data for the 
cycle has been transferred. When BRDY# is sam- 
pled active in a read cycle, the data present on the 
pins is sampled. 


Configuring the processor inputs to CMOS or TTL 
levels. 
If the BRDY1# signals is sampled active for at least 
one clock during a three clock window; one clock 
before and two clocks after RESET deactivation, the 
input pins of the processor will respond to the CMOS 
levels. If it is sampled not active for at least 4 clocks; 
two clocks before RESET deactivates and two 
clocks after RESET deactivates the input pins will 
respond to TTL levels. 


4.2.8 BREQ (BUS REQUESn 


BREQ allows the i860 XP microprocessor to share 
the local bus with other bus masters. An external 
bus arbiter can use BREQ to implement an "on de- 
mand only" policy for granting the bus to the i860 XP 
microprocessor. The i860 XP microprocessor as- 
serts BREQ the clock after it realizes an internal re- 
quest for the bus. The system should sample this pin 
only when the i860 XP microprocessor is not in con- 
trol of the bus (that is, when HLDA, BOFF#, or 
AHOLD is active). BREQ is undefined when the 
i860 XP microprocessor is driving the bus. BREQ 
may be deasserted between assertions of ADS#, 
but this does not imply that the CPU does not need 
the bus. 
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4.2.9 BYPASS# (BYPASS) 


This pin is reserved by Intel Corporation and should 
be tied HIGH to vcc through a resistor. When LOW, 
the phase-locked loop that generates the internal 
clock is unused. In this case, the internal clock has 
more skew relative to the external CLK, and the A.C. 
timing parameters are not guaranteed. 
. 


4.2.10 CACHE# (CACHEABILlTY) 


This output signal indicates internal cacheability of a 
bus request. Its timing follows that of the address 
bus. 


The i860 XP microprocessor asserts CACHE# for 
cacheable reads and code fetches to announce its 
intention to cache the data. If CACHE# is asserted 2 
on a read cycle and if the KEN# input is active, the 
cycle is a burst line fill. If CACHE# is inactive in a 
read cycle, the i860 XP microprocessor does not 
cache the returned data, regardless of the KEN# 
pin. CACHE# is also asserted for cache line write- 
backs. 


CACHE# is inactive for noncacheable reads (for ex- 
ample, pfld, 
Idlo, Idlnt), TLB replacements, and 
store misses. 


Table 4.4 shows how cacheability determines the 
number of data transfers in a cycle. 


Note that the CACHE# output is always inactive for 
CS8 (Code-Size 8 bits) mode instruction fetches so 
that the instructions are fetched with single-transfer 
cycles. However, the code fetched may then be 
placed in the instruction cache, unless KEN# was 
inactive. 


4.2.11 ClK 
(CLOCK) 


The CLK input determines execution rate and timing 
of the i860 XP microprocessor. External timing pa- 
rameters are specified relative to the rising edge of 
this signal. The i860 XP microprocessor can utilize a 
clock rate of 50 Mhz. The internal operating frequen- 
cy is the same as the external clock. This signal re- 
quires TTL levels. 


4.2.12 CTYP (CYCLE TYPE) 


CTYP is one of the bus cycle definition signals. Ta- 
bles 4.2 and 4.3 show the types of bus cycle gener- 
ated. CTYP is defined only for data write and read 
requests. The value of this pin changes only when 
ADS# is asserted. 


2-51 


i860™ 
XP MICROPROCESSOR 


4.2.13 D/C# 
(DATA/CODE) 


D/C# 
specifies whether the current request is for 
data or instructions. The datal code line is one of the 
bus cycle definition pins. Tables 4.2 and 4.3 show 
the types of bus cycle generated. The value of this 
pin changes only when ADS# is asserted. 


4.2.14 D63-DO (DATA PINS) 


The bus interface has 64 bidirectional data pins 
(063-00) 
to transfer data in eight- to 64-bit quanti- 
ties. Pins 07 - DOtransfer the least significant byte; 
pins 063-056 
transfer the most significant byte. In 
read cycles, all 64 bits of the data bus are latched, 
even in CS8-mode instruction fetches when only the 
low-order eight bits are used. In write cycles, the 
i860 XP microprocessor does not drive 063-00 
in 
the clock of ADS#, but in the following clock. 


4.2.15 DP7-DPO (DATA PARITY) 


There is one parity signal for each byte of the data 
bus. They are driven by the i860 XP microprocessor 
with even parity information on writes with the same 
timing as write data. likewise, if parity checking is 
enabled by PEN#, the system must drive even pari- 
ty information on these pins with the same timing as 
read information to ensure that the correct parity 
check status is indicated by the i860 XP microproc- 
essor. "Even parity" means that the total number of 
set bits in a byte, including the parity bit,is 
even. 


Refer also to the PCHK# signal. 


4.2.16 EADS# (EXTERNAL ADDRESS STATUS) 


This signal indicates that a valid external address 
has been driven onto address pins A31-A5 
of the 
i860 XP microprocessor to be used for a cache in- 
quiry. This signal is recognized while the processor 
is in hold (HLDA is driven active), while forced off the 
bus with BOFF# input, or while AHOLD is asserted. 
The i860 XP microprocessor ignores EADS# at all 
other times. EADS# is not recognized if HITM# is 
active, nor during the clock after ADS#, nor during 
the clock after a valid assertion of EADS#. Table 
4.5 shows when EAOS is first sampled. It is then 
sampled in every clock as long as the hold remains 
active and HITM# remains inactive. 
Table 4.5. EADS# Sample Time 


Trigger 
EADS# First Sampled 


AHOLD 
Second clock after AHOLD asserted 
HOLD 
First clock after HLDA asserted 
BOFF# 
Second clock after BOFF# asserted 


INVand FLlNE# are sampled in the same clock pe- 
riod that EADS# 
is validly asserted. HIT# 
and 
HITM# may be asserted as the results of a cache 
inquiry. 
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4.2.17 EWBE# (EXTERNAL WRITE BUFFER 
EMPTY) 


Putting the processor in strong ordering mode. 
At reset, the value of EWBE# determines the order- 
ing mode.The processor enters strong ordering 
mode if EWBE# is sampled active for at least four 
clocks; Two clocks before RESET deactivates and 
two clocks after RESET deactivates. It enters weak 
ordering mode if EWBE# is sampled not active for 
at least one clock in the window "one clock before 
RESET deactivates and two clocks after RESET de- 
activates" . 


EWBE# in weak ordering mode does not affect 
processor operation. In strong ordering mode the 
processor is affected only during a store command. 


In strong ordering mode, the external system asserts 
EWBE# as long as all external write buffers are 
empty. If an external write buffer is not empty 
(EWBE# deasserted) or the internal write buffer is 
not empty, the processor delays data cache updates 
so as to keep the external order of writes the same 
as the programmed order. 


In systems that do not have external write buffers, 
EWBE# can be tied to Vss. if strong ordering is de- 
sired, or to vcc. if weak ordering is acceptable. Re- 
fer to sections 5.3.3 and 5.3.4 for more explanation 
and for other ways to control write ordering. 


4.2.18 FLlNE# (FLUSH LINE) 


The system asserts FLlNE# 
to request that the 
i860 XP microprocessor write back a modified cache 
line before other outstanding bus cycles are com- 
pleted, if the line is hit by an external inquiry. If this 
pin is active in the same clock that EADS# is assert- 
ed, the write-back cycle is initiated, and the i860 XP 
microprocessor expects BRDY#s for the write-back 
before outstanding cycles (if any) are returned. If 
data transfer for another cycle is currently in prog- 
ress when FLlNE# is asserted (i.e. first BRDY11 re- 
turned before HITM# asserted), the i860 XP micro- 
processor waits until the data transfers for that burst 
have completed, and only then does it assert the 
ADS# for the write-back. If the first BRDY# has not 
yet occurred for an outstanding cycle, NA# must be 
activated to trigger ADS# for the write-back. 


Putting the processor in the one-clock late back-off 
mode. 


At RESET,the FLlNE 11 pin functions as a configura- 
tion input. The processor enters one-clock 
late 
back-off mode if FLlNEII 
is sampled active for at 
least one clock during a three clock window; one 
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clock before and two clocks after RESET deactiva- 
tion. If it is sampled not active for at least four 
clocks; two clocks before RESET deactivates and 
two clocks after RESET deactivates it will stay in 
normal mode. 


4.2.19 HIT# (CACHE INQUIRY HIT) 


This pin is one output of inquiry cycles. If an inquiry 
cycle hits a valid line in the caches of the i860 XP 
microprocessor (either data or instruction), HIT# is 
asserted two clocks after EADS# is activated. If the 
inquiry cycle misses the caches, this pin is negated 
two clocks after EADS# activation. 


This pin changes its value only as a result of EADS# 
activation during AHOLD, HOLD, or BOFF# and re- 
tains its value until two clocks after the next valid 
activation of EADS#. 


HIT# can be used to control the WB/WT # pin of 
other processors in a multiprocessor system. Activa- 
tion of HIT# indicates that the inquiring processors 
should cache the line as S-state, not E-state. 


4.2.20 HITM# (HIT MODIFIED LINE) 


This pin is an output of inquiry cycles. When an' in- 
quiry hits a modified line in the internal data cache, 
the i860 XP microprocessor asserts HITM# two 
clocks after EADS# is activated. (Refer also to the 
EADS# signaL)The HITM# signal stays active until 
the last BRDY# for the corresponding write-back 
cycle. At all other times, HITM# is inactive. HIT# is 
also asserted when HITM# is asserted (except for 
the special case of an inquiry after the ADS# of a 
write-back). 


4.2.21 HLDA (BUS HOLD ACKNOWLEDGE) 


The i860 XP microprocessor activates HLDA in re- 
sponse to a hold request presented on the HOLD 
pin. Assertion of HLDA indicates that the i860 XP 
microprocessor has given the bus to another local 
bus master. It is driven active in the same clock that 
the i860 XP microprocessor floats its bus. All output 
pins are floated except LOCK#, 
BREQ, HLDA, 
PCHK#, HIT#, and HITM#. 


The time required to acknowledge a hold request is 
one clock plus the number of clocks needed to finish 
any outstanding bus cycles (maximum of four out- 
standing cycles of four burst transfers each for total 
of 16 transfers). If this hold latency is too long for a 
given application, BOFF# can be used instead. 


When leaving a bus hold, the i860 XP microproces- 
sor deactivates HLDA and, in the same clock period, 
initiates a pending bus cycle, if any. 
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4.2.22 HOLD (BUS HOLD) 


This pin, along with the output signal HLDA, is used 
for local bus arbitration. At some time after the 
HOLD signal is asserted, the i860 XP microproces- 
sor releases control of the local bus and puts most 
bus interface outputs in floating state, then asserts 
HLDA-all 
during the same clock period. It main- 
tains this state until HOLD is deasserted. Instruction 
execution stops only if required instructions or data 
cannot be read from the on-chip instruction and data 
caches. The i860 XP microprocessor ignores HOLD 
until all outstanding bus cycles are complete (until 
the last BRDY#). The i860 XP microprocessor rec- 
ognizes HOLD even during RESET and LOCK# . 
HOLD cannot be used when the 82495XP cache 
controller is attached. 


4.2.23 INV (INVALIDATE) 


The external system asserts this signal to invalidate 
the cache-line state in the case of an inquiry cycle 
hit. It is sampled together with A31-A5 in the clock 
EADS# is active. 


4.2.24 INT/Csa (INTERRUPT/CODE-SIZE 
EIGHT BITS) 


This input, like the BERR input, allows interruption of 
the current instruction stream. The processor sam- 
ples INT at instruction boundaries. If interrupts are 
enabled (IM set in par) when INT is sampled active, 
the i860 XP microprocessor fetches the next instruc- 
tion from virtual address OxFFFFFFOO.INT is level 
triggered. To assure that an interrupt is recognized, 
INT should remain asserted until the software ac- 
knowledges the interrupt (by executing an interrupt- 
acknowledge cycle, for example). The interrupt may 
be ignored by the processor if the INT signal does 
not remain active. 


Interrupt latency (the maximum time between asser- 
tion of INT and execution of the first instruction of 
the trap handler) depends both on the internal con- 
text and on the external system. After INT is assert- 
ed, the i860 XP microprocessor finishes all instruc- 
tions currently being executed, including any out- 
standing bus cycles, before starting the trap handler. 
The following instruction sequence is an example of 
the worst case: 


pfld.q 
pfld.q 
ld.l 
br 
ld.l 
st.l 


• 
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If INT is asserted during the execution stage of the 
last Id.l instruction, the execution of the trap handler 
may have to wait for: 


• Two 2-transfer bursts (the pfld instructions) 
• Two data cache line fills (misses by the Id.l in- 
structions) 


• Two data cache line write-backs (eliminating 
modified lines to open space for the fills) 


• Two instruction cache line fills (the target of the 
br and the first instruction of the trap handler) 


• Three TLB miss sequences of up to six nonpipe- 
lined accesses each (the br, the last Id.l, and the 
trap handler) 


The time to finish the above bus activities can be 
extended by inquiry cycles and associated write- 
backs initiated by an external cache or bus control- 
ler. 


Besides the bus-related delays, the i860 XP micro- 
processor has internal freeze conditions that can de- 
lay interrupt response by up to 10 additional clocks. 


During a locked sequence, the INT pin is ignored, 
and the INT bit of epsr reflects the value on the INT 
pin. To limit the time that INT is ignored, the lock 
instruction can assert LOCK# for only 30-33 
in- 
structions before trapping. 


This input is asynchronous, but appropriate setup 
and hold times must be met to insure recognition on 
any specific clock. 


Configuring the processor in CS8 mode. 
The processor enters the code size eight mode 
(CS8) if the INT signal is sampled active for at least 
one clock during a three clock window; two clocks 
before and one clock after RESET deactivation. If it 
is sampled not active for at least 4 clocks; two 
clocks before RESETdeactivates and two clocks af- 
ter RESET deactivates the i860XP microprocessor 
enters normal code size mode. 


4.2.25 KBD, KB1 (CACHE BLOCK) 


For reads, these output signals define which cache 
block (line) is going to receive the data. For write- 
backs, these lines specify which block is being 
flushed. They are driven together with cycle defini- 
tion for cacheable data reads, TLB replacement, 
code fetch cycles, and write-backs. External hard- 
ware can use these signals to observe changes to 
cache blocks. 
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4.2.26 KEN# (CACHE ENABLE) 


The i860 XP microprocessor samples KEN# to de- 
termine whether the data being read for the current 
cache-miss cycle is to be cached. When the i860 XP 
microprocessor generates a read cycle that can be 
cached (CACHE# output active) and KEN# is ac- 
tive, the cycle is transformed into a burst line fill. By 
activating KEN#, the memory system commits to a 
four-transfer burst. The entire 64 bits of the data bus 
are used for the read, regardless of the state of the 
byte-enable pins. 


If KEN# is sampled inactive, code fetches are not 
transferred in bursts, but 128-bit data items may still 
be transferred with a burst length of two. 


KEN# is sampled together with NA# or BRDY#, 
whichever comes first. It is sampled only with the 
first BRDY# of a burst; its value at any other time 
has no effect. 


4.2.27 LEN (DATA LENGTH) 


The LEN output pin specifies the number of burst 
transfers for each cycle. This pin and the CACHE# 
output pin are used by the system to determine the 
burst length for each cycle (refer to Table 4.4). The 
i860 XP microprocessor an generate 1, 2, or 4-trans- 
fer bursts for reads and writes. 


LEN is inactive if the internal request is for 64 bits or 
less. If LEN is active, the internal request is for 128 
bits or more, and the cycle should be returned as a 
two- or four-transfer burst. LEN is always active for 
128-bit data accesses. LEN is always inactive for 
code accesses. 


A cacheable read (CACHE# active) can be auto- 
matically converted to a four-transfer burst regard- 
less of LEN by assertion of KEN# . 


Table 4.4 summarizes different cycle lengths as they 
are calculated from the LEN and CACHE# signals. 
LEN has the same timing as the address. 


4.2.28 LOCK # (ADDRESS LOCK) 


This signal is used to provide atomic (indivisible) 
read-modify-write sequences in multiprocessor sys- 
tems. The address to be locked is the one being 
driven on A31-A3 when LOCK# is activated. A mul- 
tiprocessor bus arbiter must permit only one proces- 
sor a locked read, locked write, or unlocked write to 
that address and must maintain the lock of that loca- 
tion across cycle boundaries until LOCK# deacti- 
vates. The simplest arbitration hardware can just 
lock the entire bus against all other accesses during 
LOCK# assertion; however, software must never 
assume that this implementation is being used. 
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Tile i860 XP microprocessor coordinates the exter- 
nal LOCK# signal with the lock and unlock instruc- 
tions. Programmers do not have to be concerned 
about the fact that bus activity is not always synchro- 
nous with instruction execution. LOCK# is asserted 
with ADS# for the address operand of the first load 
or store instruction executed after the lock 
instruc- 
tion. 


After an unlock instruction, LOCK# is deasserted 
with the next load or store. The i860 XP microproc- 
essor deactivates LOCK# one clock after ADS# for 
the last locked bus cycle. Unlike the i860 XR micro- 
processor, the i860 XP microprocessor does not 
deassert LOCK# immediately when a trap occurs. 
Instead, the trap handler must execute a load or 
store instruction to deassert LOCK#. (The handler 
does not have to execute an unlock 
instruction, 


however. The unlocking function is performed by the 
processor's trap logic.) 


The i860 XP microprocessor also asserts LOCK# 
during TLB miss processing for updates of the ac- 
cessed bit in page-directory and page-table entries. 
The maximum time that LOCK# can be asserted in 
this case is the time required to perform a nonpipe- 
lined, four-byte, read-modify-write sequence. 


Between locked sequences, at least one cycle of no 
LOCK# is guaranteed by the behavior of the unlock 
instruction. 


Between lock and unlock instructions, the INT pin is 
ignored. 


Instruction fetches do not alter the LOCK# signal. 


4_2.29 M/IO# 
(MEMORY-I/O) 


M/IO# specifies whether the current cycle is for the 
memory address space or for the I/O 
address 
space. M/IO# is one of the bus cycle definition pins. 
Tables 4.2 and 4.3 show the types of bus cycle gen- 
erated. The value of this pin changes only when 
ADS# is asserted. 


4.2.30 NA# (NEXT ADDRESS REQUEST) 


NA# makes address pipelining possible. The sys- 
tem asserts NA# for at least one clock to indicate 
that it is ready to accept the next address from the 
i860 XP microprocessor. (If the system does not im- 
plement pipelining, NA# must not be activated.) The 
i860 XP microprocessor samples NA# every clock, 
starting one clock after the activation of ADS#. If 
the i860 XP microprocessor has a new cycle pend- 
ing internally when NA{I is activated, it initiates that 
cycle in the clock after NA# is asserted. Up to three 
bus cycles can be outstanding simultaneously. 
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NA# is latched internally; the i860 XP microproces- 
sor remembers that NA# was asserted until it has 
an internal request to send to the bus; so, assertion 
of NA# for a single clock can trigger an ADS# sev- 
eral clocks later. NA# 
is ignored in the clock of 
ADS#. 


KEN# and WB/WT # inputs for the current cycle 
are sampled with NA#, if NA# is asserted before 
the first BRDY# of the current cycle. 


NA# is also used in conjunction with FLlNE# 
to 
invoke write-back of a modified line during outstand- 
ing bus cycles. 


4.2.31 NENE# (NEXT NEAR) 


The i860 XP microprocessor asserts NENE# when 
the current address is in the same DRAM page as 
the previous bus cycle. This signal allows higher- 
speed reads and writes in the case of consecutive 
accesses to static column or page-mode DRAMs. 
The i860 XP microprocessor determines the DRAM 
page size by inspecting the software-controlled DPS 
field in the dlrbase 
register. The page size can 
range from 29 to 216 64-bit words, supporting DRAM 
sizes from 256K x 1 to 4G x n. The value of this 
pin changes only when ADS# is asserted. NENE# 
is never asserted for the next bus cycle after the 
address 
bus 
has been 
floating 
(after AHOLD, 


BOFF#, or HLDA is deasserted). 


4.2.32 PCD (PAGE CACHE DISABLE) 


PCD provides a cacheability indication on a page by 
page basis. This signal, together with PWT, is set to 
an attribute bit in the page table entry for the current 
cycle. When paging is enabled, PCD corresponds to 
the CD bit (bit 4) of the page table entry. The i860 XP 
microprocessor does not perform a cache fill to any 
page for which CD of the page table entry is set. 
When paging is disabled, or for any cycle that is not 
paged (Idlo, stlo, Idlnt, scyc), the i860 XP micro- 
processor drives PCD inactive. 


During TLB miss processing, PCD is inactive while 
the address translation hardware is accessing the 
first level page directory. During accesses to the 
second-level page-table entry, PCD reflects the CD 
values taken from the first level page-table entry. 


The value of this pin changes only when ADS# is 
asserted. 


4.2.33 PCHK# (PARITY CHECK) 


This output shows the result of the parity check on 
data pins in the previous clock of a read cycle. It is 


2-55 


1860TM XP MICROPROCESSOR 


asserted for one clock when incorrect parity has 
been detected. It reflects the parity status for the 
entire data bus. If the processor detects a parity er- 
ror on the data bus it drives the PCHK# signal irre- 
spective of the status on the PEN# signal for that 
cycle. Therefore, PCHK# should only be sampled 
for those cycles which are known to have valid parity 
attached. In addition, PCHK# should only be sam- 
pled during the clock after BRDY# is activated for 
reads. PCHK# is not active in CS8 mode. 


PCHK# does not terminate outstanding bus cycles, 
so the system must still activate BRDY# a sufficient 
number of times or activate BOFF# for those cy- 
cles. PCHK# is always inactive after any code fetch 
in CS8 mode. 


4_2.34 PCYC (PAGE CYCLE) 
The page cycle line is active during memory read or 
write cycles to distinguish page-table accesses from 
other accesses. The types of bus cycle generated 
are indicated in Tables 4.2 and 4.3. The value of this 
pin changes only when ADS# is asserted. 


4.2.35 PEN# (PARITY ENABLE) 
The i860 XP microprocessor samples this signal for 
read cycles on the same clock edge at which 
BRDY# is found asserted. If sampled active, the 
i860 XP microprocessor feeds the parity check re- 
sult into the interrupt logic. If a parity error is encoun- 
tered, the i860 XP microprocessor vectors to the 
trap handler. The BEAR register latches the offend- 
ing address, as described with the BERR signal. 
This interrupt is not masked by the IM bit of the PSR, 
nor is it masked during lock cycles. 


The system should deassert PEN# any time the 
DP7-DPO pins are known not to reflect the parity of 
the full eight-byte bus (for example, reads from 1/0 
devices or ROMs that are not parity protected). 


The system should deassert PEN# during code 
fetches in CS8 mode. 


Selecting large or small output buffers on the ad- 
dress and other control signals. 
At RESET,the value of the PEN# signal determines 
the output buffer configuration for ADS#, A21-A3, 
BE7#-BEO#, 
W/R# 
and HITM# 
signals. These 
processor pins are configured as normal (small out- 
put buffers) if the PEN# signal is sampled active for 
at least one clock during a three clock window; one 
clock before and two clocks after RESET deactiva- 
tion. If it is sampled not active for at least four 
clocks; two clocks before RESET deactivates and 
two clocks after RESET deactivates these pins will 
configured in high current mode (large output buff- 
ers). 
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4.2.36 PWT (PAGE WRITE-THROUGH) 


PWT provides a write-back/write-through indication 
on a page by page basis. This signal, together with 
PCD, is set to an attribute bit in the page table entry 
for the current cycle. When paging is enabled, PWT 
corresponds to the WT bit (bit 3), and write-back 
caching is implemented for this page only if WT is 
clear. When paging is disabled, or for any cycle that 
is not paged (Id ID, stlo, 
Idlnt, 
SCYC), the i860 XP 
microprocessor drives PWT inactive. 


During TlB miss processing, PWT is inactive while 
the address translation hardware is accessing the 
first level page directory. During accesses to the 
second-level page-table entry, PWT reflects the WT 
value taken from the first level page-table entry. 


The value of this pin changes only when ADS# is 
asserted. 


4.2.37 ~ESET (SYSTEM RESET) 


Asserting RESET for at least ten ClK periods caus- 
es initialization of the i860 XP microprocessor. On 
power up, RESET should remain active at least one 
millisecond after vcc and ClK have reached their 
proper DC and AC specs. 


After the RESET signal goes inactive the processor 
remains in the RESET state for three more clocks. 
Applications that use the HOLD signal to float the 
bus during RESET should keep HOLD active for 
three more clocks after the RESET signal is deacti- 
vated. 


4.2.38 RSRVD, SPARE 
The RSRVD input is reserved by Intel Corporation 
and must be tied HIGH to vcc through a resistor 
(5 KO). The spare input should be left unconnected. 


4.2.39 TCK (TEST CLOCK) 


This is the clock input for the TAP (test access port). 
If the TAP is to be used, this signal must be connect- 
ed to a clock synchronous to ClK. If the TAP is not 
used, TCK can be tied low. TCK does not need to be 
kept running when boundary scan is not active. 


The rising edge of TCK must be externally synchro- 
nized to ClK. The boundary scan latches retain their 
state when TCK is stopped at either logic zero or 
one. 


4.2.40 TDI (TEST DATA INPUT) 


TDI is the input for test instructions and data to the 
TAP. TDI is sampled on the rising edge of TCK. It is 
provided with an internal pull-up resistor, so that an 
open circuit at TDI produces a result equivalent to 
driving continuous HIGH signals. 


4.2.41 TDO (TEST DATA OUTPUT) 


This is the serial output of the TAP. The contents of 
TAP registers are shifted out through TOO on the 
falling edge of TCK. The data is moved from TOI to 
TOO without inversion, which allows easy serial cas- 
cading of different components for scanning. 


TOO is held in high-impedance state, except while 
scanning is in progress. This allows parallel connec- 
tion of these outputs for several components. 


4.2.42 TMS (TEST MODE SELECT) 


This input is decoded by the TAP to select the oper- 
ation of the TAP. It is sampled at the rising edge of 
TCK. It is provided with an internal pull-up resistor to 
assure deterministic behavior for open-circuit failure 
at this pin. If boundary scan is not used, TMS can be 
tied high or left unconnected. 


4.2.43 TRST 11 (TEST RESET) 


This input resets the TAP. If the TAP is not used, 
TRSTII should be tied lOW. To ensure determinist- 
ic behavior of the test logic, TMS should be held 
HIGH while TRST 11 changes from lOW to HIGH. 


4.2.44 Vcc (SYSTEM POWER) AND Vss 
(GROUND) 


The i860 XP microprocessor has 54 pins for power 
and 56 for ground. All pins must be connected-to the 
appropriate low-inductance power and ground sig- 
nals in the system. 
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4.2.45 VccCLK (CLOCK POWER) 


This is the power supply for the internal ClK buffer. 
It should be connected to the same vcc plane as 
the other vcc pins. 


4.2.46 WB/WT 11 (WRITE-BACK/WRITE- 
THROUGH) 


This input signal defines cache policy for the line 
being accessed in the current bus cycle. The proc- 
essor samples WB/WT 11 for both reads and writes 
on the same clock edge at which it finds NAil or the 
first BROY11 asserted, whichever comes first. If this 
signal is sampled low, the write-through policy is ap- 
plied to the cache line-if 
an internal write hits this 
line, it causes a write-through cycle. If this signal is 
sampled high, the write-back policy is applied-fu- 
ture write hits to this line do not show up on the bus. 


4.2.47 W/RII 
(WRITE/READ) 


This pin specifies whether a bus cycle is a read 
(lOW) or write (HIGH) cycle. Tables 4.2 and 4.3 
show the types of bus cycle generated. The value of 
this pin changes only when AOSII is asserted. 


5.0 
BUS OPERATION 


The interaction among signals is illustrated by timing 
diagrams. Figure 5.1 shows the conventions used in 
the timing diagrams. 
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NOTES: 
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Figure 5.1. Timing Diagram Conventions 
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5.1 Bus Cycles 


A bus cycle begins when the i860 XP microproces- 
sor activates ADS# and ends when the system acti- 
vates the last of a predetermined number of BRDY# 
signals. Figure 4.4 shows how the i860 XP micro- 
processor and the external system cooperate to de- 
termine the number of BRDY# activations in each 
cycle. The processor starts sampling BRDY# one 
clock after assertion of ADS# and continues sam- 
pling in every clock until the last BRDY# becomes 
active. 


The i860 XP microprocessor supports several differ- 
ent types of bus cycle. These are introduced in order 
of complexity: 
1. Single-transfer cycles 
2. Multiple-transfer (burst) cycles 
3. Pipelined cycles 
4. Cache inquiry cycles 


5.1.1 SINGLE-TRANSFER CYCLE 


The simplest bus cycle is the single-transfer, non- 
cacheable, 64-bit cycle either with or without wait 
states. The shortest bus cycle is two clock periods 
long. Read and write cycles of this type are shown in 
Figure 5.2. 


A wait state is any clock in which the i860 XP micro- 
processor samples BRDY# but the-system does not 
assert it. The system can add wait states to any cy- 
cle. Figure 5.3 shows cycles with two wait states 
added. Any number of wait states can be added to 
i860 XP microprocessor bus cycles by maintaining 
BRDY# inactive. 
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Figure 5.2. Fastest Single-Transfer Cycles 
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5.1.2 
BURST 
CYCLES 


Figure 5.3. Single-Transfer 
Cycles with Wait States 


When a bus request requires more than a single 
data transfer (refer to Table 4.4), the i860 XP micro- 
processor requires that the memory system perform 
a burst data transfer. Burst cycles allow the maxi- 
mum bus transfer rate by eliminating unnecessary 
driving of the address bus. The addresses of the 
data items in burst cycles all fall within the same 32- 
byte aligned area (corresponding to an internal 
i860 XP microprocessor cache line). Given the ad- 
dress of the first transfer, external hardware can cal- 
culate the addresses of subsequent transfers. With 
these addresses eliminated from the bus, a new 
data item can be sampled into the i860 XP micro- 
processor every clock period. 


The fastest possible burst cycle requires two clock 
periods for the first data item: one clock for ADS# 
and one clock for BRDY#; subsequent data items 
are transferred every clock period. One such bus 
cycle is shown in Figure 5.4. Note that, in this case, 


the initial cycle generated by the i860 XP microproc- 
essor could be satisfied by a ~ingle data transfer, but 
the system transforms it into a multiple-transfer 
cache line fill by activating KEN# in the clock period 
of the first BRDY#. KEN# has this effect only if the 
CACHE# pin is active, which means the cycle is in- 
ternally cacheable in the i860 XP microprocessor. 


Read data is sampled only in the clock period in 
which BRDY# is returned, which means that data 
need not be sent to the i860 XP microprocessor ev- 
ery clock period in the burst cycle. Figure 5.5 shows 
an example of a burst cycle in which two clock peri- 
ods are required for every burst item. 


The burst length attributes LEN and CACHE# are 
driven with the address. Figure 5.6 illustrates two 
consecutive burst cycles with differing length attri- 
butes: the first one is a noncacheable 128-bit read, 
.and the second one is a cache line fill initiated by a 
cacheable 64-bit read. 
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Figure 5.4. Basic Burst Cycle 
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Figure 5.5. Slow Burst Cycle 
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Figure 5.6. Different Lengths of Burst Cycles 


The timing of write bursts is similar to that of read 
bursts. The i860 XP microprocessor does not put 
data on 063-00 
for writes until the clock period af- 
ter AOS#. 


When initiating any read, the i860 XP microproces- 
sor presents the address for the data item request- 
ed. When the cycle is converted into a cache fill the 
first data item returned corresponds to the add~ess 
sent out by the i860 XP microprocessor. The remain- 
ing items must be returned in the order shown in 
Table 5.1. This ordering is optimized for two-bank 
memories, but works equally well with noninter- 
leaved memories. 


In i860 XP microprocessor systems, memory must 
support the burst order as defined in Table 5.1 for 
reads. For writes, the burst addresses are always 
increasing, so writes with four transfers match the 
first line of the table. In CS8 (code-size 8 bits) mode 
instructions are not fetched in bursts. 
' 


Note that the i860 XP microprocessor drives only 
the first address of a burst cycle; the memory sys- 
tem is responsible for calculating subsequent ad- 
dresses as shown in the table. The addresses can 
be derived by complementing A3 after every trans- 
fer, and complementing A4 after two transfers. 


Table 5.1. Burst Order for Cache Line Transfers 


1st 
2nd 
3rd 
4th 
Address 
Address 
Address 
Address 


0 
8 
Ox10 
Ox18 
8 
0 
Ox18 
Ox10 
Ox10 
Ox18 
0 
8 
Ox18 
Ox10 
8 
0 


5.1.3 PIPELlNED CYCLES 


A pipelined 
cycle is one that starts while one or two 
other bus cycles are outstanding. A cycle is consid- 
ered outstanding 
until the last BROY# is asserted to 
terminate that cycle. A nonpipelinoo 
cycle is one 
that starts when no other bus cycles are outstand- 
ing. Both types of cycle can be either read or write 
cycles. To allow high transfer rates in large memory 
systems, the i860 XP microprocessor supports two- 
level pipelining. New cycles can start as often as 
every other clock until three cycles are outstanding. 


The system asserts NA# 
to 
indicate that the 
i860 XP microprocessor can start another cycle be- 
fore the current one is completed. (NA# can even 
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be asserted while BRDY# is active.) The i860 XP 
microprocessor begins sampling NA# in the next 
clock after ADS# is asserted. If the following condi- 
tions are met, a new (pipelined) cycle begins: 
1. NA# having been active 
2. An internal request pending 
3. Compatibility between the pending request and 
the outstanding requests (refer to Table 5.2) 
4. HOLD, BOFF#, and AHOLD not ac;tive 
5. Fewer than three cycles outstanding 


• Reads can be pipelined into TLB miss writes. TLB 
misses for instructions can be pipelined into data 
accesses, and vice versa. 


• No data cycle is ever pipelined while LOCK# is 
active. 


• 
1/0 cycles, special cycles, and Idlnt cycles never 
begin when any cycle is outstanding. 


NA# may be asserted before, simultaneously with, 
or after the first BRDY# of the current cycle. If NA# 
is asserted before the first BRDY#, the cacheability 
(KEN#) and cache policy ry.JB/WT#) 
indicators for 
the current cycle are sampled during the same clock 
period as NA# is sampled active; otherwise, they 
are sampled with the first BRDY#. Figure 5.7 shows 
an example of four-transfer, pipelined, back-to-back 
reads. Note the timing of KEN#. Because NA# is 
asserted before the first BRDY# of the cycle A, 
KEN# is sampled with the NA# for cycle B. 


Table 5.2. Pipeline Cycle Compatibility 


The following "compatibility" rules determine when 
the processor does not issue a pipelined ADS# 
(they are the source of Table 5.2): 
• Data cache line fills are pipelined into each other 
only in the case of an aliasing virtual tag miss with 
a physical tag hit. 


B 
If A Is Outstanding, can B be Plpellned Into It? 


Data 
Data Cache 
Data Cache 
Cache 
Store Miss, 
Read Miss 
Wrlte- 
Instruction 
TLB 
Idlo, stlo, 
LOCK # 
A 
Back" 
Fetch 
pfld 
Miss 
Idlnt, scyc 
Active 
Line Fill 
Wrlte-Thru 
KEN#=1 


Data 
Cache 
YES' 
YES' 
YES' 
YES 
YES 
YES' 
YES 
NO 
YES 
Line Fill 


Data Cache 
Store Miss, 
YES 
YES 
YES 
YES 
YES 
YES 
YES 
NO 
YES 
Wrlte·Thru 


Data Cache 
Read Miss 
YES' 
YES' 
YES' 
YES 
YES 
YES 
YES 
NO 
YES 
w 
..J 
KEN# = 1 
CJ>- 
CJ 
Write-Back 
YES 
YES 
YES 
NO 
YES 
YES 
YES 
NO 
YES 
III:) 
Instruction 
0 
YES 
YES 
YES 
YES 
YES 
s 
Fetch 
YES 
YES 
NO 
YES 
w 
a:: 
Do. 
pfld 
YES 
YES 
YES 
YES 
YES 
YES 
YES 
NO 
YES 


TLBMlss 
YES 
YES 
YES 
YES 
YES 
YES 
YES 
NO 
YES 


stlo 
YES 
YES 
YES 
YES 
YES 
YES 
YES 
NO 
YES 
scyc 


Idlo 
NO 
NO 
Idlnt 
NO 
NO 
YES 
NO 
YES 
NO 
NO 


LOCK # 
NO 
NO 
Active 
NO 
NO 
YES 
NO 
YES 
NO 
NO 


NOTE: 
, Pipelining can occur if the first ADS# is for an aliasing virtual tag miss with a physical tag hit. 
"Inquiry 
write-backs are not pipelined into prior cycle unless FLlNE# is asserted. 
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Figure 5.7. Plpellned Cache Line Fills 


Write cycles can be pipelined into read cycles and 
vice versa, but, in both cases, the processor will 
leave one clock between bursts to allow bus turn- 
over, and will ignore any BRDY*' given to it at that 
time. Pipelined back-to-back read and write cycles 
are shown in Figure 5.8.On writes, assertion of NA*, 
does not cause the values on the data bus to 
change; it just enables new address and cycle speci- 
fication outputs. 


5.1.4 INTERRUPT ACKNOWLEDGE CYCLES 


In response to a trap caused by assertion of the INT 
pin, trap-handling software can generate interrupt 
acknowledge cycles by executing a procedure simi- 
lar to the following. 


lIThe 
following 
lock 
ldint.b src2, 
or 
rdest, 


unlock 


Ilnop 
//nop 


ldint.b rO, 


lock instruction must be on a 32-byte boundary: 
II Lock the bus 
rdest 
II First INTA cycle. Src2 contains 8. 
rO, rdest 
II Won't proceed until rdest loaded. 
II Unlock the bus after the next ldint 
II Insert 4 + <number 
of NOPs> 
idle 
II 
clocks for 8259A recovery. 


rdest 
II 
Second INTA cycle 
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Figure 5_8_ Plpellned 
Back-to-Back 
Read and Write Cycles 


Figure 5.9 shows the interrupt acknowledge cycles 
generated by the code sequence. Interrupt acknowl- 
edge cycles are generated in locked pairs. The inter- 
rupt vector is returned during the second cycle. Each 
of the interrupt acknowledge cycles is terminated 
when the external system responds by asserting 
BRDY11. Wait states can be added by withholding 
BRDY11. There must be a number of idle clocks be- 
tween the first and second cycles to allow for 8259A 
recovery time. The software controls the number of 
intervening clocks via the number of nop instruc- 
tions in the interrupt acknowledge routine. 


5_1_5 
SPECIAL 
BUS CYCLES 


The i860 XP microprocessor provides a special cy- 
cle to indicate to the external system that certain 
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internal conditions have occurred. The special bus 
cycle (indicated by M/lOll 
= 0, D/CII 
= 0, and 
W/RII 
= 1) is generated by the i860 XP microproc- 
essor as a response to scyc 
instruction execution. 
This cycle (defined in Table 5.3) is used to flush or 
invalidate a secondary cache. The defined value of 
byte enables can be generated by using an appropri- 
ate address operand in the scyc 
instruction. The 
scyc instruction does not have any effect on the 
internal caches. External hardware must acknowl- 
edge a special bus cycle by asserting BRDY11 once. 
The data driven on the data bus with BRDY11 is un- 
defined. The effect of scyc is determined by decod- 
ers in external hardware. 


f\r1ut:LJ;\r 
I:i~t-t-#, the processor 
activates 
BREQ 
in the clock after an internal 
bus request 
is generat- 
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HLDA may be asserted as soon as the clock period 
after the one in which HOLD is asserted. HLDA may 
be deasserted as soon as the clock after the one in 
which HOLD is deasserted. 


An example HOLD/HLDA transaction is shown in 
Figure 5.10. The i860 XP microprocessor recognizes 
HOLD even while RESET is asserted, and it drives 
HLDA in this case as well. 


HOLD is recognized even when BOFF# is active, 
and the i860 XP microprocessor responds with 
HLDA the same as when the bus is idle. 


5.2.2 
BUS CYCLE 
BACK-oFF 
AND RESTART 


The i860 XP microprocessor provides the ability to 
abort bus cycles and restart them again. It is neces- 
sary to abort cycles for reasons such as the follow- 
ing: 
1. Retry after an error is detected by ECC or parity 
logic. 
2. Escape from a deadlock; for example, when the 
i860 XP microprocessor is using A31-A3 to load 
a new cache line, but the 82495XP cache con- 
troller needs A31-A5 
to invalidate a line in the 
CPU cache which the 82495XP cache contr.oller 
is replacing in its cache in order to satisfy the 
CPU's line-fill request. 
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3. Maintain cache consistency; for example, the 
i860 XP microprocessor is attempting to read or 
write to a line that has been modified in the cache 
of another CPU. 
4. Prevent illegal access to an address already 
locked by another CPU in a multiprocessor sys- 
tem. 


5.2.2.1 
Cycle Back-Off 


Bus cycles are aborted when the system asserts 
BOFF#. The i860 XP microprocessor samples this 
pin in every clock period that it is driving the bus. 
When BOFF# is asserted, the i860 XP microproces- 
sor immediately (in the next clock period) floats the 
bus. It floats the ADS# pin one clock period later, 
thereby giving time for ADS# to be deasserted so 
that it is not left floating active. The i860 XP micro- 
processor floats the same pins as for HOLD, but 
HLDA is not asserted. If a bus cycle is in progress at 
the time BOFF# is asserted, the cycle is aborted, 
and, in a read cycle, any data returned to the proc- 
essor while BOFF# is active is ignored. BOFF# 
overrides BRDY#; so, if both are sampled active in 
the same clock, BRDY# is ignored. BOFF# aborts 
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FIgure 5.10. HOLD/HLDA 
Handshake 
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a burst cycle even if it arrives with the last BRDYIf 
of the cycle. However, for read bursts, data transfers 
completed before assertion of BOFFIf are used by 
the processor if they satisfy an internal request. 
Cacheable data is cached in spite of BOFFIf; how- 
ever, the cached data is overwritten when the cycle 
is restarted. 


The bus remains in the high-impedance state until 
BOFFIf is deasserted. If cycles need to be restarted 
or if a new internal request has been generated, the 
BREQ signal is asserted within four clock periods 
after the assertion of BOFFIf . 


5.2.2.2 
Cycle Restart 


When the system deasserts BOFFIf, the i860 XP 
microprocessor restarts aborted bus cycles from the 
beginning by driving the address and status (A31- 
A3, W/RIf, 
D/CIf, 
etc.) and asserting ADSIf. 
If 
more than one cycle was outstanding when BOFFIf 
was asserted, the i860 XP microprocessor restarts 
all outstanding cycles in the same order. If HITMIf is 
active due to an inquiry, the write-back for it will be 
the first cycle after deassertion of BOFF#. BOFF# 
restarts all aborted cycles except: 


• The stale cycles mentioned in section 5.3.5. 
• The read that may have been generated by an 
alias hit (virtual tag miss, but physical tag hit). 


• The read that may have been generated by a 
pfld that hit the data cache. 


If the processor's KENIf pin was active (with NAif 
or first BRDYIf) before the cycle was aborted, exter- 
nal hardware must activate it again after the cycle is 
restarted. In other words, the system cannot use 
BOFFIf to change the cacheability of a cycle via 
KENIf. 


The LOCKIf signal is not affected by restarted cy- 
cles; it retains its state in spite of BOFFIf assertion. 


5.2.2.3 
Late Back-Off 
Modes 


In some cases the 
logic that needs to assert 
BOFFIf cannot make the necessary decision in time 
to cancel the relevant cycle or data transfer. For ex- 
ample: 
1. The result of checking ECC or parity may not be 
available until one or two cycles after the BRDYIf 
to which it corresponds. 
. 


2. When the i860 XP microprocessor is attempting 
to read or write to a line that might be modified in 
the cache of another processor on the same bus, 
it may be advantageous to let part of a burst run 
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in parallel with inquiries to the other processors, 
rather than delay the entire burst until the inquir- 
ies are finished. 


For such situations, the i860 XP microprocessor pro- 
vides late back-off mode. For a read cycle in this 
mode, the processor employs a buffer to internally 
delay data and BRDYIf, which allows BOFFIf as- 
sertion 
to 
be delayed 
relative to 
the 
external 
BRDYIf. 
Likewise, for a write cycle in this mode, 


BOFFIf 
assertion 
can 
be 
delayed 
relative 
to 
BRDY#. However, data for a write cycle is not de- 
layed. 


Two f1avorsof late back-off mode are provided: 
1. One allows BOFF# to be delayed by one clock 
period relative to the data transfer. The proces- 2 
sor enters one-clock late back-off mode when 
the FLlNEIf pin has been sampled active for at 
least three clock periods when RESET deacti- 
vates. 
2. The other allows BOFFIf to be delayed by up to 
two clock periods relative to the data transfer. 
The i860 XP microprocessor enters this mode 
when software sets the LB bit of the dlrbase reg- 
ister. 


If the processor enters one-clock late back-off mode 
during RESET, it is impossible to enter two-clock 
late back-off mode. The LB bit has no effect. Fur- 
thermore, software cannot exit two-clock late back- 
off mode once it is activated, and the LB bit cannot 
be cleared except by resetting the processor. 


Figures 5.12-5.17 illustrate variations on late back- 
off mode cycles. BOFFIf can be (and usually'is) as- 
serted longer than one clock period, as Figure 5.11 
shows; the remaining figures show an active time of 
only one clock. 


5.2.2.4 
One-Clock 
Late Back-Off 
Mode 


In one-clock late back-off mode the data is delayed 
internally by one clock before it is used. 


In this mode, data and BRDYIf are seen by internal 
logic one clock period later than they appear on the 
bus, which is equivalent to adding an extra wait state 
to reads on the external bus (Figure 5.13). All re- 
sponses to BRDYIf (assertion of the ADSIf for the 
next cycle, assertion of HLDA in response to a 
HOLD request, and deassertion of HITMIf) are de- 
layed by one clock period compared to the normal 
mode of operation. Not delayed, however, are write 
data on D63-DO and sampling of KEN# and WB/ 
WTIf. KENIf and WB/WT If must be valid with the 
first BRDYIf assertion. Also, the response to NA# 
(assertion of ADSIf) 
is not delayed if fewer than 
three pipelined cycles are outstanding. 
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NOTES: 
A Noncacheable. 
64-bit 
cycle 
(one transfer) 
B Next cycle 
(any type) 
1. BOFF # cancels 
cycle 
and data transfer 
2. Cycle A restarts 
one clock 
after BOFF # is deasserted 
3. Earliest 
ADS# 
assertion 
for next cycle 


Figure 5.11. Normal Back-Off 
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NOTES: 
A Noncacheable. 
64-bit cycle 
(one transfer) 
B Next cycle 
(any type) 
1. BOFF # cancels 
cycle 
and data transfer 
2. Cycle A restarts 
one clock 
after BOFF # is deasserted 
3. Earliest 
ADS# 
assertion 
for next cycle 


Figure 5.12. One-Clock 
Normal Back-Off 
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Figure 5.13. Fastest Nonplpellned Cycles In One-Clock Late Back-Off Mode 


If BOFF11 is asserted as late as the second BRDY11 
(Figure 5.14), it cancels the entire cycle, ignores 
data latched with the first BRD'?11, and ignores the 
data being driven with the second BRDY11. This is 
true of a two-transfer burst (shown) as well as a four- 
transfer burst (not shown). 


In a two-transfer burst, if BOFF11 is asserted in the 
clock after the second BRDY11 (Figure 5.15), it still 
cancels the cycle. 


In a four-transfer burst, if BOFF11 is asserted within 
one clock' after the last BRDY11 (Figure 5.16), it still 
forces a retry of the cycle, but previously transferred 
read data is used by the processor if it satisfies the 
read request 


5.2.2.5 Two-Clock Late Back-Qff Mode 


Two-clock late back-off mode gives external logic 
even more time to decide to use BOFF11. In this 


mode,'data delivery is delayed by either one or two 
clock periods, depending on external activity. For 
any BRDY11, the data is delayed by one clock peri- 
od. If in the next clock period BRDY11 is again as- 
serted, the previous data is used. However, if in that 
next clock period BRDY11 remains inactive, the data 
is delayed for one extra clock period before it is 
used. The responses to BRDY11 (assertion of the 
ADSII 
for the next cycle, assertion of HLDA, and 
deassertion of HITMII) are delayed by one or two 
clock periods, depending on the value of BRDY11 in 
the next clock. The response to NA 11 (assertion of 
ADS 11) is not delayed if fewer than three pipelined 
cycles are outstanding. 


The st.c dlrbase 
instruction that sets the LB bit 
must be aligned on a 32-byte boundary and must be 
followed by seven nop instructions. Software must 
not enable late back-off mode when the processor is 
used with the 82495XP external cache controller. 
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NOTES: 
A Noncacheable, 
128-bit 
cycle 
(two transfers) 
B Next cycle 
(any type) 
1. BOFF'" 
cancels 
both transfers 
(A1 in buffer, 
A2 on 063-00) 
2. Cycle A restarts 
one clock 
after BOFF'" 
is deasserted 
3. Earliest 
ADS'" 
assertion 
for next cycle 


Figure 5_14_ One-Clock 
Late Back-Qff 
Mode (Case 1) 
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NOTES: 
A Noncacheable, 
128-bit 
cycle 
(two transfers) 
B Next cycle 
(noncacheable) 
1. BOFF'" 
cancels 
both transfers 
(A2 in buffer 
is needed 
to satisfy 
request) 
2. Cycle A restarts 
one clock 
after BOFF'" 
is deasserted 
3. Earliest 
ADS'" 
assertion 
for next cycle 


Figure 5.15. One-Clock 
Late Back-Qff 
Mode (Case 2) 
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NOTES: 
A Cacheable 64-bit (or less) cycle (four transfers) 
B Next cycle (any type) 
1. BOFF# cancels A2 and A3 transfers. but A1 transfer has already satisfied request 
2. Cycle A restarts one clock after BOFF# is deasserted 
3. Earliest ADS# assertion for next cycle 


Figure 
5.16. One-Clock 
Late Back-Off 
Mode (Case 3) 
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Figure 
5.17. Two-Clock 
Late Back-Off 
Mode 


5.3 Cache Inquiry Cycles (Snooping) 
dress bus of the i860 XP microprocessor 
is bidirec- 
tional 
in order to allow the address 
of inquiry to be 
driven by the system. An inquiry cycle can begin dur- 
ing any hold state: 


1. While HOLD and HLDA are asserted. 


2. While BOFF # is asserted. 


3. While AHOLD 
(address 
hold) is asserted. 


Another 
processor 
initiates an inquiry cycle to check 
whether 
an address 
is cached 
in the internal 
data or 
instruction 
cache of the i860 XP microprocessor. 
An 
inquiry cycle differs 
from any other cycle in that it is 


initiated 
externally 
to the 
i860 XP microprocessor, 


and the signal for beginning 
the cycle is EADS# 
(Ex- 
ternal 
Address 
Status) 
instead 
of ADS #. The 
ad- 


• 
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If neither a HOLD nor a BOFF11 is in effect, the sys- 
tem can assert AHOLD to interrupt the current bus 
activity. 


EADS11 is first sampled two clocks after BOFF11 or 
AHOLD assertion, or one clock after HLDA. This al- 
lows time for the processor to float A31-A5 and for 
the system to stabilize the inquiry address there. 


In the clock in which EADS11 is asserted, the 
i860 
XP microprocessor 
samples these 
inputs, 
which qualify the type of inquiry: 
INV 
Specifies whether the line (if found) must 
be invalidated (that is, changed to I-state). 
FLINEII 
Specifies whether the line (if found in M- 
state) must be written back immediately or 
after outstanding bus cycles are complet- 
ed. 


The i860 XP microprocessor compares the address 
of the inquiry request with addresses of lines in 
cache and of any line in the write-back buffer waiting 
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to be transferred on the bus. It does not, however, 
compare with the address of write-miss data in the 
write buffers. Two clock periods after sampling 
EADS11, the i860 XP drives the results of the inquiry 
look-up on these output pins: 
HITII 
Specifies whether the address was found 
(active) or not found (inactive). 
HITM 11 If active, the line found was in the M-state; 
if inactive, the line was in E- or S-state, or 
was not found. 


Figure 5.18 shows an inquiry with AHOLD that miss- 
es the cache. When the system asserts AHOLD, the 
i860 XP microprocessor floats A31-A3 
in the next 
clock period. It does not, however, assert HLDA; no 
acknowledge is required. Once the address pins are 
floating, external logic drives the address for the in- 
quiry on A31-A5 and starts the inquiry cycle by acti- 
vating EADS11. The i860 XP microprocessor does 
not begin sampling EADSII until the second clock 
after AHOLD is activated. EADSII activation may be 
delayed any number of clocks. 
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Figure 5.18. Inquiry Miss Cycle 
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The earliest that AHOLD can be deasserted is the 
clock after EADS# assertion. However, by maintain- 
ing AHOLD active, multiple inquiry cycles can be ex- 
ecuted in one AHOLD session (Figure 5.19). The 
i860 XP microprocessor can accept inquiry cycles at 
a rate of one every other clock period, unless a 
write-back is required. The earliest that ADS# can 
be asserted for the next cycle is the clock after 
AHOLD deassertion. 


The second inquiry in Figure 5.19 hits an unmodified 
line in the cache. When a cache line with matching 
address is found and the INV input signal is asserted 
(as in this case), that line is invalidated (changed to 
l-state). If the INV signal is inactive, the line enters 
S-state. 


i860™ XP MICROPROCESSOR 


5.3.1 INQUIRY WRITE·BACK CYCLES 


If an inquiry cycle hits a dirty (M-state) line in the 
i860 XP microprocessor cache, the i860 XP micro- 
processor asserts the HITM# signal to indicate that 
the line will be written on the bus. The HITM# output 
becomes valid in the same clock period as HIT#. In 
this case the modified line is written out, and the 
cache entry is changed to either I or S state accord- 
ing to INV. The HITM# signal stays active through 
the last BRDY# for the corresponding write-back 
cycle. 


An inquiry write-back cycle is similar to ordinary 
write-back cycles. It is initiated by assertion of 
ADS#. ADS# is asserted even when the AHOLD• 


ClK 


I 
I 
I 
I 


___ 
FR_O_W_C_P_U:_J} - -:--< TO 
CPU Xt:t::::ttX TO 
cPU} 
- 
-:- 
- 
{\, 
__ 
: 
F••:R_OW_C_PU•••• 
_ 


I 
I 
I 
I 
:tttt:t:tt::tt::t:::t::t::::t::::t:tt::tu.A:f::f'JJ:::~:::fffffifffffffff:JJJ:ffffffffff:JJJ:: 


AHOLD 
,. 


INV 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
::t::f:tt::t!tttt:::ttt::tttt:t:t:8::tt:::tt:::t:::mtttttt:::t:t:t:tt::tt::t:::::::!:::::ttt:t:::::i:t:t:::::::t::t: 


I 
I 
I 
I 
I 
I 


I 


HITII' 


ADS*' 


________________ 
~I • 
'tI 
I 


VV 


240874-46 


NOTES: 
A Outstanding 
cycle 
(for example, 
a single-transfer 
read) finishes 
during the inquiry 


B Earliest 
inquiry, 
no invalidation 


C Earliest 
successive 
inquiry, with invalidation 


1. EADS #0 is not sampled 
in the clock 
after its assertion 


2. Inquiry B misses 
cache 


3. Earliest 
deassertion 
of AHOLD 
is one clock 
after last assertion 
of EADS#o 
4. Inquiry C hits cache, 
invalidates 
line 


5. Earliest 
assertion 
of ADS #0for next cycle 
is one clock 
after deassertion 
of AHOLD 
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Figure 5.19. Fastest Inquiry Cycles (Miss and Hit) 


2-73 


ADDRESS 


CACHE/; 


-I 
-, 
-, 
-I 
-, 
I 
I 
I 
I 
I 


AHOLD 


EADS/; 


INV 


I 
I 
I 
I 


t??(?????::::(::::::(??r:--t~??:::~}:: 
:::~::::::)::~:::{:::~:?==::::~:::::::::~: 
::::(~((?~f(::~::::::::::~:::t::::~:.t:(~(((~:}: 
:::~::}}:~::::::}~::}!:::l~::}:(~:::}::::~}: 
(~(:}::~:::?::::~:}:(:~}}}:! 


I 
I 
I 
I 
I 
\~~~------------_:--~I 
• 


HITIoI/; 


HIT/; 
\ 


LEN/; 


ADS/; 


BRDY/; 


DATA 


240874-47 


NOTES: 
A Outstanding cycle (for example, a single-transfer read) 
W Write-back cycle 
1. EADS# is not sampled while HITM# is active 
2. Earliest ADS# assertion if not delayed by outstanding cycle 
3. ADS# for write-back delayed by outstanding cycle 
4. HITM# deactivates after last BRDY# of write-back 
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Figure 5.20. Inquiry Hit Cycle with Write-Back 


I 


The fact that a write-back cycle is initiated while ad- 
dress lines are floating supports multiple inquiries 
(with write-backs) during a single AHOLD session. 
This is especially useful during secondary cache re- 
placement processing, when the secondary-cache 
line is larger than that of the i860 XP microproces- 
sor. 
. 


Note that EADS# is ignored as long as HITM# is 
active. If the system is executing a series of inquir- 
ies, it might happen that the HITM# assertion for 
one inquiry masks the EADS# for a subsequent in- 
quiry. In that case the system must reassert EADS# 
to restart the masked inquiry. 


Inquiries can occur during a hold due to HOLD/ 
HLDA or BOFF#. However, in these cases, the cy- 
cle definition pins and ADS# are floating. If an in- 
quiry requires a write-back, the HOLD' or BOFF# 
must be deasserted so that the cycle definition pins 
and ADS# can be driven to start the write-back cy- 
cle. If HITM# is active at the time of ADS#, the first 
ADS# 
issued after HOLD is deasserted corre- 
sponds to the write-back of the modified line which 
was snooped. 


5.3.2 
SNOOPING 
RESPONSIBILITY 
LIMITS 


The i860 XP microprocessor takes responsibility for 
responding to inquiry cycles for a cache line only 
during the time that the line is actually in the cache 
or in a write-back buffer. There are times during the 
cache line fill cycle and during the cache replace- 
ment cycle when the line is "in transit", and inquiry 
(snooping) responsibility must be taken by other sys- 
tem components. 


Systems designers should consider the possibility 
that an inquiry cycle may arrive at the same time as 
a cache line fill or replacement for the same ad- 
dress. This situation can occur: 
' 


• In multiprocessor systems that have external 
(secondary) caches 
with 
separate CPU and 
memory busses, thereby allowing concurrent ac- 
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tivity on the two busses. In such systems, it is 
desirable to run invalidation cycles concurrently 
with other i860 XP microprocessor bus activity. It 
can happen that writes on the memory bus cause 
invalidation requests to the i860 XP microproces- 
sor at the same time that the i860 XP microproc- 
essor fetches data from the. secondary cache. 
Such events can occur at any time relative to 
each other. 
• 
In multiprocessor systems with no secondary 
cache, if memory is dual-ported. In such systems, 
two processors can simultaneously read the 
same line, each sending an inquiry to the other. 


The simultaneous activities considered here may be 
for different data items in the same cache line. Un- 
less the inquiry request is timed carefully with re- 2 
spect to the cache fill cycle, the cache-consistency 
mechanism may be subverted, and data inconsist- 
encies may result (for example, both CPUs may get 
the line in E-state on a read). If the 82495XP and 
82490XP cache is being used, the timing with re- 
spect to the i860 XP microprocessor is handled cor- 
rectly by the cache controller; however, the same 
problem may arise between the memory system and 
the secondary cache. 


There are two cases to consider: 
1. Inquiry for a line that is being cached. 
2. Inquiry for a line that is being replaced. 


5.3.2.1 
Inquiry for a Line Being Cached 


The i860 XP microprocessor accepts an inquiry cy- 
cle at any time, even if it hits the line being cached at 
that time. Regardless of the timing of the cycle, the 
i860 XP microprocessor delivers the read data to the 
load instruction that initiated the read request. How- 
ever, the timing of the invalidation cycle determines 
whether the line is placed in the cache and what 
value the i860 XP microprocessor drives on HIT#. 
Table 5.4 summarizes the different cases. 


Table 5.4. Inquiry for a Line being Cached 


EADS # before 
EADS# 
after 
or with NA# 
NA# 
or 
or 1st BRDY# 
1stBRDY# 


Line is cached? 
YES 
NO 


HIT#= 
Inactive 
Active 


Data/Instruction 
YES 
YES 
used by CPU? 
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If EADS# is asserted before or with the sampling of 
KEN#, the processor cannot match the address of 
the line being cached with an invalidation request. 
Thus, the processor does not assert HIT#. The ex- 
ternal system must satisfy the inquiry with the cor- 
rect data and WB/WT # status. If invalidation of that 
line is required, the system must do one of the fol- 
lowing: 
• Delay assertion of EADS# until one clock after 
assertion of KEN# . 


• Reassert EADS# after KEN#. 


• Make KEN# inactive at the first BRDY# or NA#, 
thereby preventing the line from being cached. 


Figures 5.21 and 5.22 show when the i860 XP micro- 
processor picks up responsibility for inquiries for a 
line that it is caching. Figure 5.21 shows the earliest 
EADS# 
assertion that invalidates the line being 
cached relative to the first BRDY# for nonpipelined 
cycles. Figure 5.22 shows the earliest EADS# as- 
sertion that invalidates the line being cached relative 
to the first NA# for pipelined cycles. These timings 
hold for normal and late back-off modes. 
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NOTES: 
A Cache 
line fill cycle 
S Snoop 
(inquiry) 
cycle 
R Addresses 
of cache 
line fill and snoop 
are the same 
1. Earliest 
EADS'" 
assertion 
that can invalidate 
line being filled 


Figure 5.21. Snoop Responsibility 
Pickup (Nonpipelined Cycle) 
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NOTES: 
A Cache 
line fill cycle 
B Next cycle 
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cycle 
R Addresses 
of cycles 
A and S are the same 
1. Earliest 
EADS# 
assertion 
that can invalidate 
line being filled 
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Figure 5.22. Snoop Responsibility 
Plckup (Pipellned 
Cycle) 


5.3.2.2 
Inquiry 
for a Line Being Replaced 


When the i860 XP microprocessor is replacing a line, 
there are two cases: 
1. If the replacement does not require write-back, 
the address being replaced can be matched by 
an inquiry until assertion of NA# or first BRDY# 
of the line-fill cycle. From that point on, the in- 
quiry has no effect. 


2. If the replacement requires a write-back, the ad- 
dress being replaced can be matched by an in- 
quiry until assertion of the last BRDY# for the 
write-back. An EADS# as late as two clocks be- 
fore the last BRDY# can cause HITM# to be 
asserted. 


Figures 5.23 through 5.25 show when the i860 XP 
microprocessor drops responsibility for recognizing 
inquiries for a line that it is writing back. They show 
the latest EADS# assertion that can cause HITM# 
assertion. In late back-off mode, EADS# can be as- 
serted later, because BRDY# is internally delayed 
(Figures 5.24 and 5.25). 
\ 


In all these cases, HITM# remains active for only 
one clock period. HITM#, as always, remains active 
through the last BRDY# of the corresponding write- 
back; in these cases the write-back has already 
completed. 


If an inquiry cycle hits the write-back address after 
its ADS# has been issued, the i860 XP microproc- 
essor asserts HITM#; however, HIT# is deassert- 
ed. This unique combination of values on HIT# and 
HITM# indicates that the write-back cycle corre- 
sponding to the HITM# has already been issued. 


•• 
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NOTES: 
A Write-back 
cycle 
S Snoop 
(inquiry) 
cycle 
R Addresses 
of cycles 
A and S are the same 


Figure 5.23. Latest Snooping of Write-Back (Not Late Back-Off Mode) 
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NOTES: 
A Write-back 
cycle 
S Snoop 
(inquiry) 
cycle 
R Addresses 
of cycles 
A and S are the same 


Figure 5.24. Latest Snooping of Write-Back (One-Clock Late Back-Off Mode) 
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Figure 5_25.Latest Snooping of Write-Back (Two-Clock Late Back-Off Mode) 


5.3.3 WRITE CYCLE REORDERING DUE TO 
BUFFERING 


The MESI cache protocol and the ability to perform 
and respond to inquiry cycles guarantee that writes 
to the cache are logically equivalent to writes that go 
to memory. In particular, the order of read and write 
operations on cached data is the same as if the op- 
erations were on data in memory. Even uncached 
memory read and write requests usually occur on 
the external bus in the same order that they are is- 
sued in the program. For example, when a write miss 
is followed by a read miss, the write data goes onto 
the bus before the read request is put on the bus. 
However, the posting of writes in write buffers cou- 
pled with inquiry cycles may cause the order of 
writes seen on the external bus to differ from the 
order they appear in the program. Consider the fol- 
lowing example, which is illustrated in Figure 5.26: 
1. Three bus cycles are outstanding. 
2. Processor 1 executes a store to address A, which 
misses the cache. This store is posted; that is, 
the data is latched in the write buffer while the 
processor continues execution without waiting for 
the store to be completed on the bus. In this case 
the store is not even put on the bus because 
there are already three outstanding cycles. 


3. Processor 1 executes a store to address B, which 
hits the cache. 
4. Processor 2 executes an inquiry for address B. 
Processor 1 looks in its cache, finds the modified 
line, asserts HIT# and HITM#, and executes a 
write-back cycle to address B, while the data for 
address A is still in the write buffer. . 


5. Processor 1 issues the write to address A on the 
bus. 
. 


In this example, the original order of the writes has 
been changed. In most cases it is not necessary that 
the ordering of writes be strictly maintained. But 
there are cases (for example, semaphore updates in 
a multiprocessor system) that require stores to be 
observed externally in the same order as pro- 
grammed. There are several ways to ensure seriali- 
zation of stores: 
1. Bracket one of the stores with the lock and un- 
lock 
instructions. That forces serialization of the 
stores (refer to section 5.4). In the above exam- 
p~eof a store-miss followed by store-hit, locking 
either store would ensure that the internal store- 
hit does not update the cache until the miss gets 
to the external bus. 


2. Apply the write-through policy to the critical data, 


b~ ~etting WT=1 in the page table entries or by 
driVingthe WB/WT # pin low. 


• 
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Figure 5.26. Write Reordering due to Buffering 


3. Configure the 
processor for Strong Ordering 


Mode by asserting EWBE*" during RESET. 


Option 1 is implementable by user-level programs, 
while option 2 is an operating-system level solution, 
not directly implementable by user-level code. Op- 
tion 3, the hardware solution, is discussed in greater 
detail in section 5.3.4. 


5.3.4 STRONG ORDERING MODE 


In strong ordering mode, the processor delays up- 
dates to its internal data cache in either of these 
conditions: 
1. The internal write buffer is not empty. 


2. An external write buffer is not empty (the external 
system signals this condition by deactivating the 
EWBE*" signal). 


By delaying the cache update until all write buffers 
are empty, the i860 XP microprocessor avoids the 
out-of-order sequence shown in section 5.3.3. 


In strong ordering mode, EWBE*" can be deassert- 
ed only between the ADS*" and the last BRDY*" of 
a store. The earliest deassertion is the clock after 
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ADS*"; the latest deassertion is together with the 
last BRDY*". EWBE*" can be reasserted at any 
time, except when the processor is performing an 
inquiry write-back. In other words, EWBE*" must not 
activate while HITM*" is active. FSDWhen EWBE*" 
is asserted, indicating that the external write buffer is 
empty, the processor can complete any cache up- 
date that may have been delayed by its deassertion 
(buffer full indication). 


Figure 5.27 shows how an external cache can use 
EWBE*" when a store miss in the i860 XP micro- 
processor is also a miss in the external cache. 


An external cache controller should also refrain from 
updating the external cache while EWBE*" is active. 


In strong ordering mode cache hit store cycles may 
appear on the data bus. 
In strong ordering mode the processor delays up- 
dates to its internal data cache if the internal write 
buffers are full. These writes may in fact appear on 
the external data bus. 
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NOTE: 
1. Assumes the external cache needs four cycles to write the data to memory. 
2. Pending internal data cache updates are delayed until the clock in which EWBE# 
is sampled LOW. 


Figure 5.27. Timing of EWBE# 


5.3.5 
SCHEDULING 
INQUIRY 
WRITE-BACK 
CYCLES 


In order to preserve 
system-wide 
ordering 
of memo- 
ry transactions 
in multiprocessor 
systems 
that have 
a pipelined 
or split-transaction 
memory 
bus, it may 
be necessary 
to get the data corresponding 
to an 
inquiry 
hit before 
outstanding 
bus cycles 
are com- 
pleted. 
Another 
bus master 
can always 
request 
an 
inquiry while the i860 XP microprocessor 
has cycles 
outstanding 
on the bus. However, 
when AHOLD 
is 
asserted, 
the i860 XP microprocessor 
normally com- 
pletes 
outstanding 
cycles 
before 
it performs 
any 
write-back 
that may be required. 
The i860 XP micro- 
processor 
provides 
two methods 
for causing 
the in- 
quiry write-back 
before 
outstanding 
cycles 
are com- 
pleted: 


FLlNE# 
When 
FLlNE# 
is 
asserted 
during 
the 
EADS # of an inquiry that hits an M-state 
line, the i860 XP microprocessor 
issues a 
write-back 
cycle and writes the dirty line to 


. memory 
before the outstanding 
bus cycles 
are completed. 


BOFF # 
If there are outstanding 
cycles on the bus, 
asserting 
BOFF # clears 
the bus pipeline. 
If an inquiry causes HITM # to be asserted, 
then the first cycle 
issued 
by the i860 XP 
microprocessor 
after 
deassertion 
of 
BOFF # is the inquiry write-back 
cycle. Af- 
ter the 
inquiry 
write-back, 
it reissues 
the 
aborted_cycles. 


5.3.5.1 
Choosing 
between 
FLlNE# 
and BOFF# 


FLlNE #, although 
the more efficient 
choice, 
cannot 
handle a" situations. 
Under certain 
circumstances, 
it 
can happen 
that outstanding 
stores 
on the bus cor- 


respond 
to data that is obsolete 
relative 
to the data 
in the cache, 
because 
a subsequent 
store 
has up- 
dated the cache after the ADS# 
for the outstanding 
store has occurred. 
For example: 


• 
An aliasing 
store hit, in which 
a cache virtual-tag 
miss occurs and the ADS# 
is issued at the same 
time as a physical-tag 
hit. Then the cached 
data 
would be updated 
before external 
memory, 
and a 
subsequent 
store 
to 
the 
new 
virtual 
address 
could 
also update 
cache 
before 
the outstanding 
bus store completed. 


• 
Back-ta-back 
writes to the same line can also up- 
date the cache 
more recently 
than the bus when 
the 
write-once 
update 
policy 
is employed. 
The 
first write updates the cache and generates 
a bus 
write request, 
but the second 
write only updates 
the cache. 


In both of these examples 
the outstanding 
stores on 
the bus are obsolete 
relative to the data in the cache 
line. 
If an inquiry 
cycle 
hits a line and 
this 
line is 
written 
back out of order (that is, before outstanding 
stores are completed), 
special care should be taken 
to discard 
the outstanding 
stores. 


The easiest 
way to avoid this situation 
is not to as- 
sert FLlNE# 
when 
stores 
are outstanding, 
but use 
BOFF # instead. 
If out-of-order 
write-back 
is imple- 
mented 
with 
BOFF #, 
the 
i860 XP microprocessor 
does not restart 
the outstanding 
store to that line if 
such a store has been obsoleted 
by a later cache hit 
store. 
That 
is, the i860 XP microprocessor 
detects 
this condition 
and kills the obsolete 
data. However, 


lock-bracketed 
stores 
(including 
the last store in the 
lock sequence) 
are restarted 
by the i860 XP micro- 
processor, 
because 
lock-bracketed 
stores 
update 
the cache only after BRDY # is returned. 


• 
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If, on the other hand, out-of-order write-back is im- 
plemented by using only the FLlNE# pin, the exter- 
nal system must return BRDY#s 
for outstanding 
stores, but the data must be ignored if it has already 
been written out by an inquiry write-back. 


Note that if a replacement write-back is in progress 
(ADS# has been issued, but last BRDY# has not 
occurred) and an inquiry hits the same line that is 
being written back, the FLlNE# pin is ignored. The 
system can recognize this special case by the fact 
that HITM# is asserted while HIT# is deasserted. If 
other cycles are outstanding and it is necessary to 
write the line back before the other cycles, BOFF# 
can be used. 


5.3.5.2 Reordering Write-Backs with FLINE# 


FLlNE# must be active during the EADS# that initi- 
ates an inquiry. BRDY# must not be asserted forthe 
previously issued cycles while HITM# is active. If 
HITM# is asserted while the data transfer of the 
outstanding cycle is in progress (Le. first BRDY# 
has been asserted, but the entire transfer has not 


yet been completed), the i860 XP microprocessor 
waits for the current cycle to complete, and only 
then issues the write-back. After the last BRDY# for 
the ongoing burst (if any), BRDY# is ignored until 
the clock period after ADS# 
is asserted for the 
write-back. 


From the viewpoint of the i860 XP microprocessor, 
an inquiry write-back cycle is just another bus cycle; 
so, if there is an outstanding cycle at the time of 
FLlNE# and HITM# activation, the system must as- 
sert NA# to initiate the write-back. 


Figure 5.28 illustrates simple cycle reordering, when 
FLlNE# is not asserted during the data transfer of 
another cycle. The outstanding request could be ei- 
ther a read or write. 


Figure 5.29 shows the case in which FLlNE# is as- 
serted after data transfer for the outstanding cycle 
has already started. In this case, the i860 XP micro- 
processor does not issue a write-back until the out- 
standing transfer is completed. NA# is needed in 
this example only if other outstanding cycles remain. 
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Figure 5.28. Cycle Reordering via FLlNE# (No Ongoing Burst) 
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NOTES: 
1. BRDY# is ignored by CPU from end of ongoing burst through ADS# of write-back, even if other cycles remain 
outstanding 
2. NA# required only if another cycle is outstanding 
3. If the first BRDY# is asserted here or sooner (relative to HITM#), the outstanding cycle completes before the 
FLlNE# write·back. 


Figure 5.29. Cycle Reordering 
via FLlNE'" 
(Ongoing 
Burst) 
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5.3.5.3 Reordering Write-Backs with BOFF11 


Back-eft 
cycles 
are discussed 
in general 
in Section 
5.2.2. Figure 5.30 shows 
how BOFF 11 can be used 
to cancel outstanding 
cycles so that an inquiry write- 
back can take place immediately. 


5.4 The LOCK# Cycle Attribute 


The processor 
asserts the LOCK 11 signal when sev- 
eral accesses 
to a single 
memory 
location 
must be 
effectively 
uninterruptible. 
By causing 
LOCK 11 to be 
asserted. 
a programmer 
can. for example. 
increment 
the contents 
of a memory 
variable 
and be assured 
that the variable 
will not be accessed 
between 
the 
read and the update of that variable. 


NOTES: 
. 


A Outstanding 
cycle 
(for example. 
noncacheable 
128-bit 
read) W Write-back 
cycle 
1. AHOLD 
begins 
an inquiry while one cycle 
is outstanding. 


2. Earliest 
assertion 
of EADS# 
is two clocks 
after assertion 
of AHOLD 
3. Inquiry hits modified 
line. 


4. Assertion 
of BOFF # aborts 
the outstanding 
cycle. 


5. BRDY# 
asserted 
during 
BOFF# 
is ignored 
by CPU. 


6. Write-back 
begins 
after deassertion 
of BOFF # . 
7. Earliest 
assertion 
of ADS# 
for restart 
of cycle A (assuming 
no pipelining). 


elK 


10 
12 
13 
15 
is 
11 


AHOLD 


. 
\.V 


!lI':::f '.'.'.'.'.'."'.",.".,.,.,.,.,.,.""""""" 
r~~rrr~._'_ .•kI~~~':~:~I::::::.:.:.,.,.,.,.,.,.,.,.,.,.,.,.,.,.,.,.,.,.,.'.'.'.',',',",',',' ~II:f~'~'::~:m~'~:::'i':" 
""""""':':~tf~~:~~~trr~I~:~~~~~:~r~:r"'I~r:~rrrrrrr~~rrirrr~I~r:':~:::t:::~:'rr~'r! 


, 
\~~'----~~--------------~I 
i 
\ 
; 


: 
: 
\--:--<,,-.----------_ 
y 


: 
i 
\ __:__,,,~:=_---------_ I 


• 
• • 
• 
~~--+-~--~-----r--~\JLJ 
~ 


, 
• r!!~~mm=!\ 
BRDY" 
~rf:':':::""""""'·'·'·'·'······ 
,A2 
W 1 
, 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
~----~----~----~----:..----:--~-~-----:--~--:--r;:rv-. 
II"'I~II~I~ 


CACHE- 


DATA 


240874-57 


Figure 5.30. Cycle Reordering via BOFF11 (Ongoing Burst) 


2-84 


The memory location to be locked is the one whose 
address is driven during the cycle in which LOCK# 
is first activated. In multiprocessor systems, external 
hardware should guarantee that no other processor 
is granted a locked read, locked write, or unlocked 
write to the same location until LOCK# is deassert- 
ed. The i860 XP microprocessor has no hardware 
provision to prevent another master from also lock- 
ing the variable; this responsibility falls on the bus 
arbiter. In the simplest implementation, the arbiter 
can globally prevent other masters from accessing 
the bus. 


Not all cycles affect the value of LOCK#. Code 
fetches, write-backs due to replacement or inquiry, 
and cycles restarted due to BOFF# do not affect 
LOCK# . Any other type of cycle can be used to initi- 
ate or terminate LOCK#, including cache line fills, 
interrupt acknowledge, 1/0, and special cycles. 


Data accesses with LOCK# asserted are not pipe- 
lined, and other data cycles are not pipelined while a 
LOCK# 
cycle 
remains 
outstanding. 
Instruction 
fetches, however, may be pipelined during lock. 


The i860 XP microprocessor can run very long lock 
sequences; therefore, to guarantee reasonable bus 
turnover latency in multimaster systems, the i860 XP 
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microprocessor recognizes bus hold (HOLD), ad- 
dress hold (AHOLD), and back-off (BOFF#) while 
the LOCK# signal is active. In spite of such inter- 
vening conditions, the arbiter should prevent any 
other bus master from also locking or updating the 
variable the i860 XP microprocessor locked. In sim- 
ple systems the HOLD input can be masked by the 
LOCK# output (that is, the external logic that gener- 
ates HOLD can AND the LOCK# signal with other 
hold conditions). More sophisticated systems, how- 
ever, may allow the bus to be turned over while 
LOCK# is asserted. 


Whatever the lock implementation, arbiter design 
must, in one case, allow another processor to write 
the locked variable. That case is when another 
i860 XP microprocessor or master asserts HITM# in 
response to the inquiry generated by the locking 
processor's initial read. That other master must write 
back the locked variable before the i860 XP micro- 
processor can read it. This HITM# write-back must 
always be allowed. 


The timing of LOCK# is shown in Figure 5.31. Note 
that LOCK# is asserted in the same clock period as 
ADS# for the locked address, but is deasserted in 
the clock period after ADS# for the unlocking load 
or store. 


ADS- 


ADDRESS 
x 
o 
x 


BRDY- 


DATA 


LOCK- 


NOTES: 
L Locking 
access 
U Unlocking 
access 
1. This address 
is to be locked 
2. LOCK'" 
is asserted 
with ADS'" 
3. LOCK'" 
is deasserted 
one clock 
after ADS'" 
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Figure 5.31. LOCK# 
Timing 
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5.5 
RESET Initialization 
bus during RESET should keep HOLD active for 
three more clocksafter the RESET signal is deacti- 
vated. 
Initialization of the i860 XP microprocessor is caused 
when the system asserts the RESET signal for at 
least ten clocks. Table 5.5 shows the status of out- 
put pins during the time that RESET is asserted. 
Note that the bidirectional data pins (D63-DO and 
DP7-DPO) are floated during RESET,though the bi- 
directional A31-A3 pins are not. If the i860 XP mi- 
croprocessor is used with 82495~P and 82496XP 
cache, however, the latter do float the bidirectional 
pins they share with i860 XP microprocessor during 
RESET. Note that HOLD requests are honored dur- 
ing RESET and that the HLDA output signal may 
also become active. The status of output pins de- 
pends on whether a HOLD request is being acknowl- 
edged. Note also that the test logic may be active 
during RESET and that the EXTEST instruction may 
drive other values on the output pins. 


Some aspects of processor configuration are deter- 
mined by asserting input signals during RESET. To 
select a given option, the corresponding input must 
be asserted for at least the last three clocks before 
the falling edge of RESET; to deselect, the corre- 
sponding input must be deasserted for at least the 
last three clocks before the falling edge of RESET: 
EWBE# 
Enter strong ordering mode. 
FLINE# 
Enter one clock late back-off mode. 


INTleS8 Enter eight-bit code-size mode. 
PEN# 
Enter normal (small output buffers) cur- 
rent mode. 


Figure 5.32 shows how configuration pins are sam- 
pled during the three clock periods just before the 
After the RESET signal goes inactive the processor 
falling edge of RESET. No inputs besides EWBE#, 
remains in the RESET state for three more clocks. 
HOLD, FLlNE#, INT/CS8, and PEN# are sampled 
Applications that use the HOLD signal to float the 
during RESET. 


Table 5_5_Output Pin Status during Reset 


Pin Value 
Pin Name 
HOLD 
HOLD 
Not Acknowledged 
Acknowledged 


BREQ 
LOW 
LOW 
HLDA 
LOW 
HIGH 
W/R#, 
PWT, PCD 
LOW 
Tristate OFF 
ADS# 
HIGH 
Tristate OFF 
063-00, 
DP7-DPO 
Tristate OFF 
Tristate OFF 
A31-A3, BE7#O-BEO#, NENE# CACHE#, CTYP, D/C#, 
Undefined 
Tristate OFF 
KBO,KB1, LEN, M/IO#, PCYC 
PCHK#, HIT# 
Undefined 
Undefined 
HITM#, LOCK# 
HIGH 
HIGH 


NOTE: 
This table does not apply 
if the test logic is running 
the EXTEST 
instruction. 
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HDlD 


HLOA 


240874-59 
NOTE: 
1. The CPU samples 
these 
inputs 
in the clocks 
preceding 
the falling 
edge of reset. 


2. This pin should 
be active two clocks 
prior to the falling 
edge of reset. 


Figure 5.32. Reset Activities 
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While in eight-bit code-size mode, instruction cache 
misses are one-byte reads (transferred on 07-00 
of 
the data bus) instead of eight-byte reads. This allows 
the i860 XP microprocessor to be bootstrapped from 
an eight-bit ROM. For these code reads, byte en- 
ables BE2#'-BEO#' are redefined to be the lowor- 
der three bits of the address, so that a complete 
byte address is available. The entire eight-byte data 
bus continues to be parity-checked by the i860 XP 
microprocessor during CS8-mode instruction fetch- 
es; therefore, external hardware must either gener- 
ate good parity on all eight bytes or disable parity 
traps by deasserting PEN#' during CS8 mode. 


While in this mode, instructions must reside in an 
eight-bit wide memory, while data must reside in a 
separate 64-bit wide memory. After the code has 
been loaded into 64-bit memory, initialization code 
can initiate 64-bit code fetches by clearing the CS8 
bit of the dlrbase 
register (refer to section 2). Once 
eight-bit code-size mode is disabled by software, it 
cannot be reenabled except by resetting the i860 XP 
microprocessor. 


Instruction fetches in CS8 mode update the instruc- 
tion cache if KEN#' is asserted during NA#' or all of 
the first eight BROY#'s (refer to section 4.2.26). 
They are pipelined if NA#' is asserted. When used 
with the 82495XP and 82496XP cache, CS8 mode 
works only if the ROM locations are made non- 
cacheable. 


'6.0 
TESTABILITY 


The i860 XP microprocessor provides testability fea- 
tures compatible with the proposed Standard 
Test 
Access 
Port and Boundary-Scan 
Architecture 
(IEEE 
Std. P1149.1/06). The subset of the standard test 
logic implemented in the i860 XP microprocessor 
provides for testing the interconnections between 
the i860 XP microprocessor and other integrated cir- 
cuits once they have been assembled onto a printed 
circuit board. 


The test logic consists of a boundary-scan register 
and other building blocks that are accessed through 
a test access port (TAP).The TAP provides a simple 
serial interface that makes it possible to test all sig- 
nal traces with only a few probes. 


The TAP can be controlled by a bus master. The bus 
master can be either automatic test equipment or a 
component that interfaces to a four-pin test bus: 


i860TM XP MICROPROCESSOR 


6.1 
Te$t Architecture 


The test logic contains the following elements: 


• Test access port (TAP), which consists of input 
pins TMS, TCK, TOI, and TRST#'; and output pin 
TOO. 


• TAP controller, which receives the dedicated test 
clock (TCK) and interprets the signals on the test 
mode select (TMS) line. The TAP controller gen- 
erates clock and control signals for the instruc- 
tion and test data registers and for other parts of 
the test logic. 


• Instruction register (IR), which allows instruction 
codes to be shifted into the test logic. The in- 
struction codes are used to select the test to be 
performed or the test data register to be ac- 
cessed. 
• Test data registers: Bypass Register (BPR), De- 
vice Identification Register (DID), and Boundary- 
Scan Register (BSR). 


The instruction and test data registers are separate 
shift-register paths connected in parallel and having 
a common serial data input and a common serial 
data output connected to the TAP TOI and TOO sig- 
nals respectively. 


6.2 
Test Data Registers 


The test logic contains the following data registers: 


• 
Bypass 
Register 
(BPR): BPR is a one-bit shift 
register that provides a minimum-length path be- 
tween TOI and TOO when no test operation of 
the component is required. This allows more rap- 
id movement of test data to and from other board 
components that are required to perform test op- 
erations. While running through BPR, the data is 
transferred without inversion from TOI to TOO. 


• 
Device 
Identification 
Register 
(DID): This reg- 
ister contains the manufacturer's identification 
code, part number code, and version code in the 
format shown by Figure 6.1. The values are: man- 
ufacturer's identification code (9), part number 
code (61AO),version code (8), entire 32-bit value 
(Ox861A0013). 


• 
Boundary 
Scan Register 
(BSR): The BSR is a 
single shift-register path containing 150 cells that 
are connected to all input and output pins of the 
i860 XP microprocessor. Figure 6.2 shows the 
logical structure of the BSR. Input cells only cap- 
ture data; they do not affect operation of the 
i860 XP microprocessor. Data is transferred with- 
out inversion from TOI to TOO through the BSR 
during scanning. The BSR can be operated by 
the EXTEST and SAMPLE instructions. 


• 
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Figure 6.1. Format of DID Register 
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Figure 6.2. Logical Structure of BSR Register 


6.3 Instruction Register 


The Instruction Register (IR) selects the test to be 
performed and the test data register to be accessed. 
It is four bits wide, with no parity bit. Table 6.1 shows 
the encoding of the instructions supported by the 
TAP controller of the i860 XP microprocessor. The 
rightmost bit is the least significant and is the first 
shifted out on TOO. 


Table 6.1. TAP Instruction 
Encoding 


Instruction 
Code 
Instruction 


0000 
EXTEST Boundary Scan 
0001 
SAMPLE Boundary Scan 
0010 
IOCOOE 
0011 ... 1110 
Intel reserved 
CAUTION' 
1111 
BYPASS 


• CAUTION: Operation of these private instructions may 
cause damage to the component. 
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EXTEST The BSR cells associated with output pins 
drive the output pins of the i860 XP micro- 
processor. Values scanned into the BSR 


. cells become the output values. The BSR 
cells associated with input pins sample 
the inputs of the i860 XP microprocessor. 
Note that 1/0 pins can be input or output 
for this test, depending on their control 
setting. The values shifted to the input 
latches are not used by the internal logic 
of the i860 XP microprocessor. After use 
of the EXTEST command, the i860 XP mi- 
croprocessor must be reset (with the RE- 
SET signal) before normal use. 
SAMPLE The BSR cells associated with output pins 
sample the value driven by the i860 XP 
microprocessor. 
BSR cells 
associated 
with input pins sample on the rising edge 
of TCK the values driven to the i860 XP 


microprocessor. 
BSR cells 
associated 
with 1/0 pins sample the value on the re- 
spective pin. The 1/0 pin can be driven by 
the i860 XP microprocessor or by external 
hardware. The values shifted to the input 
latches are not used by the internal logic 
of the i860 XP microprocessor. 


IDCODE 
The identification code of the i860 XP mi- 
croprocessor from the 
010 register is 
passed to TOO. The 010 register is not 
altered by data shifted in on TOI. 


BYPASS 
Test data is passed from TOI to TOO via 
the single-bit BPR, effectively bypassing 
the test logic of the i860 XP microproces- 
sor. Because of its special encoding, this 
instruction can be entered by holding TOI 
HIGH while completing an instruction- 
scan cycle. This reduces the demands on 
the host test system in cases where ac- 
cess is required, for example, only to chip 
57 on a 100-chip board. 
Note that an open circuit fault in the 
board-level test data path causes the 
BPR register to be selected following an 
instruction-scan cycle, because the TOI 
input has a pull-up resistor. Therefore, no 
unwanted interference with the operation 
of the on-chip system logic can occur. 


Table 6.2 defines which registers are active during 
execution of each instruction. 


6.4 
TAP Controller 


The TAP Controller is a synchronous, finite state 
machine. It controls the sequence of operations of 
the test logic. The TAP Controller changes state 
only in response to the following events: 
1. A rising edge of TCK. 
2. A transition to logic zero at the TRST# input. 


3. Power-up. 
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The value of the TMS input signal at a rising edge of 
TCK controls the sequence of state changes. The 
state diagram for the TAP controller is shown in Fig- 
ure 6.3. Test designers must consider the operation 
of the state machine in order to design the correct 
sequence of values to drive on TMS. 


6.4.1 
TEST-LOGIC-RESET 
STATE 


In this state, the test logic is disabled so that normal 
operation of the i860 XP microprocessor can contin- 
ue unhindered. This is achieved by initializing the in- 
struction register such that the 10COOEinstruction 
is loaded. No matter what the original state of the 
controller, the controller enters 
Test-Logic-Reset 
when the TMS input is held HIGH for at least five 
rising edges of TCK. The controller remains in this 
state while TMS is HIGH. 


If the controller leaves the Test-Logic-Reset 
state as 
a result of an erroneous LOW signal on the TMS line 
at the time of a rising edge of TCK (for example, a 
glitch due to external interference), it returns to the 
Test-Logic-Reset 
state following three rising edges 
of TCK while the TMS signal at the intended HIGH 
logic level. The operation of the test logic is such 
that no disturbance is caused to on-chip system log- 
ic operation as the result of such an error. On leav- 
ing the Test-Logic-Reset 
state, the controller moves 
into the Run-Test/Idle 
state, where no action occurs 
because the current instruction has been set to se- 
lect operation of the 010 register. The test logic is 
also inactive in the Select-OR-Scan 
and 
Select-fR- 
Scan states. 


The TAP controller is also forced to the Test-Logic- 
Reset 
state by applying a LOW logic level to the 
TRST# input and at power-up. 


Table 6.2. Registers 
Active by Instruction 


Register 
Mode 
BSR 
DID 
BPR 


EXTEST 
TOI - 
BSR - 
TOO 
Inactive 
Inactive 
SAMPLE 
TOI - 
BSR - 
TOO 
Inactive 
Inactive 
10COOE 
Inactive 
OIO-TOO 
Inactive 
BYPASS 
Inactive 
Inactive 
TOI - 
BPR - 
TOO 
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NOTE: 
0,1 The values 
present 
on TMS at the time of a rising edge on TCK. 


Figure 6.3. TAP Controller State Diagram 


6.4.2 RUN-TEST/IDLE STATE 
6.4.3 SELECT-OR-SCAN STATE 


The controller enters this state between scan opera- 
tions. Once in this state, the controller remains in 
this state as long as TMS is held LOW. No activity 
occurs in the test logic. The instruction register and 
all test data registers retain their previous state. 
When TMS is HIGH and a rising edge is applied to 
TCK. the controller moves to the Select-OR-Scan 
state. 


This is a temporary controller state. The test data 
register selected by the current instruction retains its 
previous state. If TMS is held LOW and a rising edge 
is applied to TCK when in this state, the controller 
moves into the Capture-OR 
state, and a scan se- 
quence for the selected test data register is initiated. 
If TMS is held HIGH and a rising edge is applied to 
TCK, the controller moves to the Select-IR-Scan 
state. 


The instruction does not change in this state. 
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6.4.4 
SELECT-IR-SCAN 
STATE 


This is a temporary controller state. The test data 
register selected by the current instruction retains its 
previous state. If TMS is held LOW and a rising edge 
is applied to TCK when in this state, the controller 
moves into the Capture-IR 
state, and a scan se- 
quence tor the instruction register is initiated. If TMS 
is held HIGH and a rising edge is applied to TCK, the 
controller moves to the Test-Logic-Reset 
state. 


The instruction does not change in this state. 


6.4.5 
CAPTURE-DR 
STATE 


In this state, the BSR captures input pin data if the 
current instruction is EXTESTor SAMPLE.The other 
test data registers, which do not have parallel input, 
are not changed. 


The instruction does not change in this state. 


When the TAP controller is in this state and a rising 
edge is applied to TCK, the controller enters the 
Exit1-0R 
state if TMS is HIGH or the Shift-OR state 


if TMS is LOW. 


6.4.6 
SHIFT-DR 
STATE 


In this controller state, the test data register con- 
nected between TOI and TOO as a result of the cur- 
rent instruction shifts data one stage toward its serial 
output on each rising edge of TCK. 


The instruction does not change in this state. 


When the TAP controller is in this state and a rising 
edge is applied to TCK, the controller enters the 
Exit1-0R 
state if TMS is HIGH or remains in the 
Shift-OR state if TMS is LOW. 


6.4.7 
EXIT1-DR 
STATE 


This is a temporary state. If TMS is held HIGH, a 
rising edge applied to TCK while in this state causes 
the controller to enter the Update-OR 
state, which 
terminates the scanning process. If TMS is held low 
and a rising edge is applied to TCK, the controller 
enters the Pause-OR 
state. 


The test data register selected by the current in- 
struction retains its previous state unchanged. The 
instruction does not change in this state. 
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6.4.8 
PAUSE-DR 
STATE 


The pause state allows the test controller to tempo- 
rarily halt the shifting of data through the test data 
register in the serial path between TOI and TOO. 
This might be necessary, for example, to allow the 
tester to reload its pin memory from disk during ap- 
plication of a long test sequence. 


The test data register selected by the current in- 
struction retains its previous state. The instruction 
does not change in this state. 


The controller remains in this state as long as TMS 
is LOW. When TMS goes HIGH and a rising edge is 
applied to TCK, the controller moves to the Exit2-0R 
state. 


6.4.9 
EXIT2-DR 
STATE 


This is a temporary state. If TMS is held HIGH and a 
rising edge is applied to TCK, the scanning process 
terminates, 
and the 
TAP 
controller 
enters 
the 
Update-OR 
state. If TMS is held LOW and a rising 
edge is applied to TCK, the controller enters the 
Shift-OR state. 


The test data register selected by the current in- 
struction retains its previous state unchanged. The 
instruction does not change in this state. 


6.4.10 
UPDATE-DR 
STATE 


The BSR register is provided with a latched parallel 
output to prevent changes at the parallel output 
while data is shifted in response to the EXTEST and 
SAMPLE instructions. When the TAP controller is in 
this state and the BSR register is selected, data is 
latched onto the parallel output of this register from 
the shift-register path on the falling edge of TCK. 
The data held at the latched parallel output does not 
change other than in this state. 


All shift-register stages in test data registers select- 
ed by the current instruction retain their previous 
state unchanged. The instruction does not change in 
this state. 


When the TAP controller is in this state and a rising 
edge is applied to TCK, the controller enters the 
Select-OR-Scan 
state if TMS is held HIGH or the 
Run-Test/Idle 
state if TMS is held LOW. 


6.4.11 
CAPTURE-IR 
STATE 


In this controller state the shift register contained in 
the instruction register loads the fixed value 0001 on 
the rising edge of TCK. 


• 
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The test data register selected by the current in- 
struction retains its previous state. The instruction 
does not change in this state. 


When the controller is in this state and a rising edge 
is applied to TCK, the controller enters the Exit1-IR 
state if TMS is held HIGH or the Shift-IR state if TMS 
is held LOW. 


6.4.12 SHIFT-IR STATE 


In this state, the shift register contained in the in- 
struction register is connected between TOI and 
TOO and shifts data one stage towards its serial out- 
put on each rising edge of TCK. 


The test data register selected by the current in- 
struction retains its previous state. The instruction 
does not change 'in this state. 


When the controller is in this state and a rising edge 
is applied to TCK, the controller enters the Exitt-IR 
state if TMS is held HIGH or remains in the Shift-/R 
state if TMS is held LOW. 


6.4.13 EXIT1-IR STATE 


This is a temporary state. If TMS is held HIGH, a 
rising edge applied to TCK while in this state causes 
the controller to enter the Update-IR 
state, which 
terminates the scanning process. If TMS is held low 
and a rising edge is applied to TCK, the controller 
enters the Pause-IR state. 


The test data register selected by the current in- 
struction retains its previous state unchanged. The 
instruction does not change in this state, and the 
instruction register retains its state. 


6.4.14 PAUSE-IR STATE 


This state allows the shifting of the instruction regis- 
ter to be temporarily halted. 


The test data register selected by the current in- 
struction retains its previous state. The instruction 
does not change in this state, and the instruction 
register retains its state. 


The controller remains in this state as long as TMS 
is LOW. When TMS goes HIGH and a rising edge is 
applied to TCK, the controller moves to the Exit2-IR 
state. 


6.4.15 EXIT2-IR STATE 


This is a temporary state. If TMS is held HIGH and a 
rising edge is applied to TCK, the scanning process 
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terminates, 
and the 
TAP controller 
enters 
the 
Update·/R 
state. If TMS is held LOW and a rising 
edge is applied to TCK, the controller enters the 
Shift-IR state. 


The test data register selected by the current in- 
struction retains its previous state unchanged. The 
instruction does not change in this state, and the 
instruction register retains its state. 


6.4.16 UPDATE-IR STATE 


The instruction shifted into the instruction register is 
latched onto the parallel output from the shift-regis- 
ter path on the falling edge of TCK. Once the new 
instruction has been latched, it becomes the current 
instruction. 


Test data registers selected by the current instruc- 
tion retain the previous state. 


6.5 
Boundary Scan Register Cell 
Ordering 


Figure 6.4 shows the order of cells in the BSR. 
There are 150 cells including TOO. TOI is not a BSR 
cell. 


The OCTL,ACTL, TCTL, and OCTL cells do not cor- 
respond to pins of the i860 XP microprocessor; rath- 
er, they control the bidirectional and tristate pins: 
DCTL 063-00,OP7-0PO 
ACTL A31-A3 
TCTL Tristate 
outputs: 
AOS#, 
BE7#-BEO#, 


CACHE#, 
CTYP, O/C#, 
KBO, KB1, LEN, 
M/IO#, 
NENE#, PCO, PCYC, PWT, W/R# 
OCTL Outputs not floated 
in normal operation: 
BREQ, 
HIT#, 
HITM#, 
HLOA, 
LOCK#, 


PCHK# 


If a value of one is loaded into any of these control 
latches, the associated pins will not drive the exter- 
nal bus while running EXTEST. 


The values of OCTL, ACTL, TCTL, and OCTL are 
undefined 
during the SAMPLE instruction. 


The values and direction of I/O and outputs do not 
change during the scanning process (that is, during 
Shift-DR states). They only change after scanning is 
completed (in the Update-DR 
state). 


The decision table, Table 6.3, defines how the 
boundary scan instructions EXTEST and SAMPLE/ 
PRELOAD utilize BSR. 
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Figure 6.4. Boundary Scan Register Ordering 
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- 
BROY" 
KEN" 
NA" 
WB/WT" 
Vcc 
Vcc 
Vcc 
Vcc 
Vcc 
Vcc 
Vcc 
Vcc 
VcC 
Vee 
Vcc 
055 
051 
O~~ 
~O 
- 


U 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


W/R" 
LEN 
PWT 
PCYC 
Vss 
Vss 
Vss 
Vss 
Vss 
Vss 
Vss 
Vss 
Vss 
Vss 
Vss 
0511 
~9 
0~2 
039 
T 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


A3 
RESET 
lOCK" 
W/IO" 
[ADS" 
INT/CS8 
BERR 
FUNE" 
HlOA 
KBl 
NENE" 
HIT" 
TRST" 
TDI 
0112 
058 
OU 
D52 
D37 
S 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
u 
Vss 
Vcc 
BOFF" 
D/C" 
PCD 
INV 
PEN" 
BREQ 
TOO 
KBO 
HOLD 
TWS 
DII3 
DIIO 
057 
Vcc 
D33 
D35 
R 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


TCK 
Vss 
Vcc 
CACHE" 
AHOLD 
0111 
D5~ 
Vcc 
Vss 
OP~ 
Q 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


VccClK 
Vcc 
Vss 
RSRVD 
CTYP 
0511 
ope 
Vss 
Vcc 
D3~ 
P 
0 
0 
0 
0, 
0 
0 
0 
0 
0 
0 
,- 


Vcc 
Vcc 
Vss 
ADS" 
HITW" 
- 
DP7 
050 
Vss 
Vcc 
0311 
N 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


Vcc 
Vss 
Vss 
ClK 
A5 
D53 
D~7 
Vss 
Vss 
D31 
M 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


Vcc 
Vcc 
Vss 
SPARE 
All 
~II 
D~l 
Vss 
Vex; 
Vcc 
L 
0 
0 
0 
0 
0 
PINOUT 
0 
0 
0 
0 
0 


Vcc 
Vss 
Vss 
AID 
All 
D45 
~3 
Vss 
Vss 
Vcc 
K 
0 
0 
0 
0 
0 
PIN SIDE VIEW 
0 
0 
0 
0 
0 


Vex; 
Vcc 
Vss 
1.12 
Al~ 
DP5 
D311 
Vss 
Vcc 
Vcc 
J 
0 
0 
- 0 
0 
0 
0 
0 
0 
0 
0 


Vcc 
Vss 
Vss 
All 
1.20 
032 
PCHK" 
Vss 
Vss 
Vex; 
H 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


Vcc 
Vex; 
Vss 
1.22 
A2S 
0211 
030 
Vss 
Vcc 
028 
G 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


1.7 
Vcc 
Vss 
1.211 
1.30 
02~ 
026 
Vss 
Vcc 
027 
F 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


At 
Vss 
Vcc 
1.27 
BED" 
D21 
023 
025 
Vss 
Vcc 
E 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


All 
Vss 
Vcc 
1.29 
8El" 
BE2" 
8EII" 
EWBE" 
01 
05 
010 
014 
OP2 
017 
0111 
020 
Vex; 
OP3 
022 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


1.13 
AlII 
1.18 
1.31 
8E4" 
Vss 
Vss 
Vss 
Vss 
Vss 
Vss 
Vss 
Vss 
Vss 
012 
Oil 
07 
016 
018 
C 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


1.15 
1.21 
1.24 
80" 
Vss 
Vcc 
Vcc 
Vss 
Vcc 
Vss 
Vcc 
Vss 
Vcc 
Vcc 
Vss 
09 
011 
013 
015 
B 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


1.17 
1.23 
1.25 
BE5" 
8[7" 
BYPASSII 
DO 
Vcc 
Vcc 
Vcc 
Vcc 
Vcc 
02 
Vcc 
DPO 
03 
04 
OS 
DPl 
A 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


1 
2 
3 
4 
5 
,6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
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I\) 
cO 
m 
~O 
~4 
D51 
D55 
Vcc 
Vcc 
Vcc 
Vcc 
Vcc 
Vcc 
Vcc 
Vcc 
Vcc 
Vcc 
Vcc 
wB/WT. 
HA. 
KEH. 
BRDY- 
U 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


D39 
~2 
D49 
D58 
Vss 
Vss 
VSS 
Vss 
VSS 
Vss 
Vss 
Vss 
Vss 
VSS 
Vss 
PCYC 
PWT 
LEH 
W/R. 
T 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


037 
D52 
~8 
D58 
082 
TOI 
TRST. 
HIT. 
HEHE. 
KBl 
HLOA 
RIHE· 
BERR 
INTICS8 
£AOS. 
••/10. 
LOCK. 
RESET 
A3 
S 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


035 
D33 
Vcc 
OS7 
080 
D83 
T••S 
HOLD 
K80 
TOO 
BREQ 
PEN. 
INV 
PCD 
O/C. 
aoFF. 
Vcc 
Vss 
A4 
R 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


OP4 
Vss 
Vcc 
OS4 
081 
AHOLD 
CACHE. 
Vcc 
Vss 
TCK 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


, D34 
Vcc 
Vss 
OP8 
050 
CTYP 
RSRVO 
Vss 
Vcc 
VccCLK 
P 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
- 


Vcc 
038 
Vcc 
Vss 
OSO 
OP7 
HIT•••• ADS" 
Vss 
Vcc 
N 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


031 
Vss 
Vss 
047 
OS3 
AS 
ClK 
Vss 
Vss 
Vcc 


••• 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


Vcc 
Vcc 
Vss 
041 
~e 
AI 
SPARE 
Vss 
Vcc 
Vcc 
L 
0 
0 
0 
0 
0 
PINOUT 
0 
0 
0 
0 
0 


Vcc 
Vss 
Vss 
043 
~S 
All 
Al0 
Vss 
Vss 
Vcc 
K 
0 
0 
0 
0 
0 
TOP SIDE VIEW 
0 
0 
0 
0 
0 


Vcc 
Vcc 
Vss 
038 
OP5 
, 
A14 
A12 
Vss 
Vcc 
Vcc 
J 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


Vcc 
Vss 
Vss 
PCHK_ 
D32 
A20 
A16 
Vss 
Vss 
Vee 
H 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


D20 
Vcc 
Vss 
030 
D211 
A26 
A22 
VSS 
Vcc 
Vcc 
G 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


027 
Vcc 
Vss 
026 
024 
A30 
A28 
VSS 
Vcc 
A7 
r 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


Vcc 
Vss 
025 
023 
021 
BEO. 
U7 
Vcc 
Vss 
AI 
E 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


D22 
OP3 
Vcc 
D20 
018 
D17 
DP2 
014 
Dl0 
D5 
01 
EWBE· 
BE6. 
BE2- 
BEl" 
A29 
Vcc 
Vss 
All 
D 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


DIe 
016 
D7 
oe 
012 
Vss 
Vss 
Vss 
Vss 
Vss 
Vss 
Vss 
Vss 
Vss 
BE4- 
A31 
A18 
A18 
A13 
C 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


D15 
DU 
011 
09 
Vss 
Vcc 
Vcc 
Vss 
Vcc 
Vss 
Vcc 
Vss 
Vcc 
Vcc 
Vss 
BE3. 
A24 
A21 
A15· 


B 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


OPl 
06 
D4 
D3 
OPO 
Vcc 
D2 
Vcc 
Vcc 
Vcc 
Vcc 
Vcc 
DO 
BYPASS- 
BE7- 
BE5. 
A25 
A23 
A17 
A 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


19 
18 
17 
16 
15 
14 
13 
12 
11 
10 
9 
8 
7 
6 
5 
4 
3 
2 
1 
240874-65 


::!! 
la 
C; 
""~ 


~ 
>< 
'Vi:nalI 
.. 
'VS' 
gaec;:: 
o! 
;--a 
'\ID 
3 
:?2J 
-t 
Iiiiil 
0 
F : 
~i 
=> 
~ 
~ 


~- 


(i 
G) 


~ 
>< 
'Ui:(; 
::uo 
'U 
::uoo 
m 
~o 
::u 


--l 


@ 


intet~ 


i860TM XP MICROPROCESSOR 


Location 
Signal 


Table 7.1. Pin Cross Reference by Location 


Location 
Signal 
Location 
Signal 
Location 
Signal 


A01 
A17 
A02 
A23 
A03 
A25 
A04 
8E5# 
A05 
8E7# 
A06 
8YPASS# 
A07 
00 
A08 
vcc 
A09 
vcc 
A10 
vcc 
A11 
vcc 
A12 
vcc 
A13 
02 
A14 
VCC 
A15 
OPO 
A16 
03 
A17 
04 
A18 
06 
A19 
OP1 
801 
A15 
802 
A21 
803 
A24 
804 
8E3# 
805 
Vss 
806 
VCC 
807 
VCC 
808 
VSS 
809 
vcc 
810 
VSS 
811 
· 
VCC 
812 
VSS 
813 
VCC 
814 
VCC 
815 
VSS 
816 
09 
817 
011 
818 
013 
819 
015 
C01 
A13 
C02 
A19 
C03 
A18 
C04 
A31 
C05 
8E4# 
C06 
VSS 
C07 
VSS 
C08 
VSS 
C09 
VSS 
C10 
VSS 
C11 
Vss 
C12 
VSS 
C13 
VSS 
C14 
VSS 


C15 
012 
C16 
08 
C17 
07 
C18 
016 
C19 
018 
001 
A11 
002 
VSS 
003 
VCC 
004 
A29 
005 
8E1# 
006 
8EU 
007 
8E6# 
008 
EW8E# 
009 
01 
010 
05 
011 
010 
012 
014 
013 
OP2 
014 
017 
015 
019 
016 
020 
017 
VCC 
018 
OP3 
019 
022 
E01 
A9 
E02 
VSS 
E03 
VCC 
E04 
A27 
E05 
8EO# 
E15 
021 
E16 
023 
E17 
025 
E18 
VSS 
E19 
VCC 
F01 
A7 
F02 
VCC 
F03 
VSS 
F04 
:A28 
F05 
A30 
F15 
024 
F16 
026 
F17 
VSS 
F18 
VCC 
F19 
027 
G01 
VCC 
G02 
VCC 
G03 
VSS 
G04 
A22 
G05 
A26 
G15 
028 
G16 
030 
G17 
VSS 


G18 
VCC 
G19 
029 
H01 
VCC 
H02 
VSS 
H03 
VSS 
H04 
A16 
H05 
A20 
H15 
032 
H16 
PCHK# 
H17 
VSS 
H18 
VSS 
H19 
VCC 
J01 
vcc 
J02 
VCC 
J03 
VSS 
J04 
A12 
J05 
A14 
J15 
OP5 


• J16 
038 
J17 
VSS 
J18 
VCC 
J19 
VCC 
K01 
VCC 
K02 
VSS 
K03 
VSS 
K04 
A10 
K05 
A8 
K15 
045 
K16 
043 
K17 
VSS 
K18 
VSS 
K19 
VCC 
L01 
VCC 
L02 
VCC 
L03 
VSS 
L04 
SPARE 
L05 
A6 
L15 
048 
L16 
041 
L17 
VSS 
L18 
VCC 
L19 
VCC 
M01 
VCC 
M02 
VSS 
M03 
VSS 
M04 
CLK 
M05 
A5 
M15 
053 
M16 
047 
M17 
VSS 
M18 
VSS 
M19 
031 


N01 
VCC 
N02 
VCC 
N03 
VSS 
N04 
AOS# 


N05 
HITM# 
N15 
OP7 
N16 
050 
N,17 
VSS 
N18 
VCC 
N19 
036 
P01 
VCCCLK 
P02 
VCC 
P03 
VSS 
P04 
RSRVO 
P05 
CTYP 
P15 
059 
P16 .........•.. 
OP6 
P17 
VSS 
P18 
VCC 
P19 
034 
001 
TCK 
002 
VSS 
003 
VCC 
004 
CACHE# 
005 
AHOLO 
015 
061 
016 
054 
017 
VCC 
018 
VSS 
019 
OP4 
R01 
A4 
R02 
VSS 
R03 
VCC 
R04 
80FF# 
R05 
O/C# 
R06 
PCO 
R07 
INV 
R08 
PEN# 
R09 
8REO 
R10 
TOO 
R11 
K80 
R12 
HOLO 
R13 
TMS 
R14 
063 
R15 
060 
R16 
057 
R17 
VCC 
R18 
033 
R19 
035 
S01 
A3 
S02 
RESET 


S03 
LOCK# 
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Location 
Signal 
Location 
Signal 
Location 
Signal 
Location 
Signal 


504 ......... 
M/IO# 
518 ............ 
052 
T13 ............ 
VSS 
U08 ............ 
vcc 
505 ........ 
EA05# 
519 ............ 
037 
T14 ............ 
VSS 
U09 ............ 
vcc 
506 ....... 
INT/Csa 
T01 ......... 
W/R# 
T15 ............ 
Vss 
U10 ............ 
VCC 
507 .......... 
BERR 
T02............ 
lEN 
T16 ............ 
056 
U11 ............ 
VCC 
508 ........ 
FLlNE# 
T03 ........... 
PWT 
T17 ............ 
049 
U12 ............ 
vcc 
509 .......... 
HlOA 
T04 .......... 
PCYC 
T18 ............ 
042 
U13 ............ 
VCC 
510 ............ 
KB1 
T05 ............ 
VSS 
T19 ............ 
039 
U14 ............ 
VCC 
511 ........ 
NENE# 
T06 ............ 
VSS 
U01 ........ 
BROY# 
U15 ............ 
VCC 
512........... 
HIT# 
T07 ............ 
VSS 
U02 ......... 
KEN# 
U16 ............ 
055 
513......... 
TR5T# 
T08 ............ 
VSS 
U03 ........... 
NA# 
U17 ............ 
051 
514 ............ 
TOI 
T09 ............ 
VSS 
U04 ...... 
WB/WT# 
U18 ............ 
044 
515 ............ 
062 
T10 ............ 
VSS 
U05 ............ 
VCC 
U19 ............ 
040 
516 ............ 
058 
T11 ............ 
VSS 
U06 ............ 
VCC 


517 ............ 
046 
T12 ............ 
VSS 
U07............ 
VCC 


Table 7.1.Pin Cross Reference 
by Location 
(Continued) 


Signal 
Location 
Signal 
Location 
Signal 
Location 
Signal 
Location 


A3 
501 
AHOlO 
005 
013 
B18 
043 
K16 
A4 
R01 
BEO# 
E05 
014 
012 
044 
U18 
A5 
M05 
BE1# 
005 
015 
B19 
045 
K15 
A6 
l05 
BEU 
D06 
016 
C18 
046 
517 
A7 
F01 
BE3# 
B04 
017 
014 
047 
M16 
A8 
K05 
BE4# 
C05 
018 
C19 
048 
U5 
A9 
E01 
BE5# 
A04 
019 
015 
049 
T17 
A10 
K04 
BE6# 
007 
020 
016 
050 
N16 
A11 
001 
BE7# 
A05 
021 
E15 
051 
U17 
A12 
J04 
BERR 
507 
022 
019 
052 
518 
A13 
C01 
BOFF# 
R04 
023 
E16 
053 
M15 
A14 
J05 
R5RVO 
P04 
024 
F15 
054 
016 
A15 
801 
BROY# 
U01 
025 
E17 
055 
U16 
A16 
H04 
BREO 
R09 
026 
F16 
056 
T16 
A17 
A01 
CACHE # 
004 
027 
F19 
057 
R16 
A18 
C03 
ClK 
M04 
028 
G15 
058 
516 
A19 
CO2 
CTYP 
P05 
029 
G19 
059 
P15 
A20 
H05 
00 
A07 
030 
G16 
060 
R15 
A21 
B02 
01 
D09 
031 
M19 
061 
015 
A22 
G04 
02 
A13 
032 
H15 
062 
515 
A23 
A02 
03 
A16 
033 
R18 
063 
R14 
A24 
B03 
04 
A17 
034 
P19 
O/C# 
R05 
A25 
A03 
05 
010 
035 
R19 
OPO 
A15 
A26 
G05 
06 
A18 
036 
N19 
OP1 
A19 
A27 
E04 
07 
C17 
037 
519 
OP2 
013 
A28 
F04 
08 
C16 
038 
J16 
OP3 
018 
A29 
004 
09 
B16 
039 
T19 
OP4 
019 
A30 
F05 
010 
011 
040 
U19 
OP5 
J15 
A31 
C04 
011 
B17 
041 
l16 
OP6 
P16 
A05# 
N04 
012 
C15 
042 
T18 
OP7 
N15 


Table 7.2.Pin Cross Reference 
by Pin Name 
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Table 7.2. Pin Cross Reference 
by Pin Name (Continued) 


Signal 
Location 
Signal 
Location 
Signal 
Location 
Signal 
Location 


EADS# 
S05 
Vcc 
B06 
Vcc 
R17 
Vss 
H17 
FLlNE# 
S08 
Vcc 
B07 
Vcc 
U05 
Vss 
H18 
HIT# 
S12 
Vcc 
B09 
Vcc 
U06 
Vss 
J03 
HITM# 
N05 
Vcc 
B11 
Vcc 
U07 
Vss 
J17 
HLDA 
S09 
Vcc 
B13 
Vcc 
U08 
Vss 
K02 
HOLD 
R12 
Vcc 
B14 
Vcc 
.U09 
Vss 
K03 
INJ/CS8 
S06 
Vcc 
D03 
Vcc 
U10 
Vss 
K17 
INV 
R07 
Vcc 
D17 
Vcc 
U11 
Vss 
K18 
KBO 
R11 
Vcc 
E03 
Vcc 
U12 
Vss 
L03 
KB1 
S10 
Vcc 
E19 
Vcc 
U13 
Vss 
L17 
KEN# 
U02 
Vcc 
F02 
Vcc 
U14 
Vss 
M02 
LEN 
T02 
Vcc 
F18 
Vcc 
U15 
Vss 
M03 
LOCK# 
S03 
Vcc 
G01 
VccCLK 
P01 
Vss 
M17 
M/IO# 
S04 
Vcc 
G02 
Vss 
B05 
Vss 
M18 
NA# 
U03 
Vcc 
G18 
Vss 
B08 
Vss 
N03 
NENE# 
S11 
Vcc 
H01 
Vss 
B10 
Vss 
N17 
PCD 
R06 
Vcc 
H19 
Vss 
B12 
Vss 
P03 
PCHK# 
H16 
Vcc 
J01 
Vss 
B15 
Vss 
P17 
PCYC 
T04 
Vcc 
J02 
Vss 
C06 
Vss 
002 
PEN# 
R08 
Vcc 
J18 
Vss 
C07 
Vss 
018 
PWT 
T03 
Vcc 
J19 
Vss 
C08 
Vss 
R02 
RESET 
S02 
Vcc 
K01 
Vss 
C09 
Vss 
T05 
SPARE 
L04 
Vcc 
K19 
Vss 
C10 
Vss 
T06 


EWBE# 
D08 
Vcc 
L01 
Vss 
C11 
Vss 
T07 
BYPASS# 
A06 
Vcc 
L02 
Vss 
C12 
Vss 
T08 


TCK 
001 
Vcc 
L18 
Vss 
C13 
Vss 
T09 


TOI 
S14 
Vcc 
L19 
Vss 
C14 
Vss 
T10 
TOO 
R10 
Vcc 
M01 
Vss 
D02 
Vss 
T11 
TMS 
R13 
Vcc 
N01 
Vss 
E02 
Vss 
T12 


TRST# 
S13 
Vcc 
N02 
Vss 
E18 
Vss 
T13 
Vcc 
A08 
Vcc 
N18 
Vss 
F03 
Vss 
T14 
Vcc 
A09 
Vcc 
P02 
Vss 
F17 
Vss 
T15 
Vcc 
A10 
Vcc 
P18 
Vss 
G03 
W/R# 
T01 
Vcc 
A11 
Vcc 
003 
Vss 
G17 
WB/WT# 
U04 
Vcc 
A12 
Vcc 
017 
Vss 
H02 
Vcc 
A14 
Vcc 
R03 
Vss 
H03 
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Table 7.3. Ceramic 
PGA Package 
Dimension 
Symbols 


LeHeror 
Description 
of Dimensions 
Symbol 


A 
Distance 
from seating plane to highest point of body 


A1 
Distance 
between 
seating plane and base plane (lid) 


A2 
Distance 
from base plane to highest point of body 


A3 
Distance from seating plane to bottom of body 


B 
Diameter 
of terminal 
lead pin 


D 
Largest overall package 
dimension 
of length 


01 
A body length dimension, 
outer lead center to outer lead center 


e1 
Linear spacing between 
true lead position 
centerlines 


L 
Distance 
from seating plane to end of lead 


81 
Other body dimension, 
outer lead center to edge of body 


NOTES: 
1. ContrOlling 
dimension: 
millimeter. 


2. Dimension 
"e-" 
("e") 
is noncumulative. 
3. Seating 
plane 
(standoff) 
is defined 
by P.C. board 
hole size: 0.0415-0.0430 
inch. 


4. Dimensions 
"S", 
"Bj" 
and "C" 
are nominal. 


5. Details 
of Pin 1 identifier 
are optional. 
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SWAGGED 
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PIN~ 
45° 
CHAMFER 
(4 Pl) 
(INDEX CORNER) 
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SEATING- 


PLANE 
~ 


1.3- HII 


L+ 
L __ 
I- 
r= 
.1 
••• 


~ 


~::-- - 
AI-I- 
-1.2- 
BASE PLANE- 


S~~:~l 


ea (ALL PINS) hi 


~ 
~ 


PIN 
DETAIL 


240874-66 


Family: 
Ceramic 
Pin Grid Array 
Package 


Symbol 
Mllllmeters 
Inches 


Mln 
Max 
Notes 
Mln 
Max 
Notes 


A 
3.56 
4.57 
.140 
.180 


A1 
0.64 
1.14 
Solid Lid 
.025 
.045 
Solid Lid 


A2 
2.79 
3.56 
Solid Lid 
.110 
.140 
Solid Lid 


A3 
1.14 
1.40 
.045 
.055 


B 
0.43 
0.51 
.017 
.020 


0 
49.28 
49.96 
1.940 
1.967 


01 
45.59 
45.85 
1.795 
1.805 


e1 
2.29 
2.79 
.090 
.110 


L 
2.54 
3.30 
.100 
.130 


N 
240 
280 
240 
280 


S1 
1.52 
2.54' 
.060 
.100 


ISSUE 
9/90 


Figure 
7.3. 262-Lead 
Ceramic 
PGA Package 
Dimensions 
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8.0 
PACKAGE THERMAL 
SPECIFICATIONS 


For this section, 
let: 


P = 
maximum 
power consumption 


TC = 
case temperature 


TA = 
ambient 
air temperature 


8CA = thermal 
resistance 
from case to ambient 
air 


8JC = thermal 
resistance 
from junction 
to case 


8JA = thermal 
resistance 
from junction 
to ambient 
air 


The i860 XP microprocessor 
is specified 
for opera- 
tion when TC is within the range of 0·C-85·C. 
TC may 
be 
measured 
in 
any 
environment 
to 
determine 
whether 
the i860 XP microprocessor 
is within speci- 
fied operating 
range. The case temperature 
should 
be measured 
at the center 
of the top surface 
oppo- 
site the pins. 


TA can 
be calculated 
from 
8CA with .the following 
equation: 
-, 


TA = Tc - 
P " 8CA 


intel~ 


Typical values for 8CA at various airflows and for 8JC 
are given in Table 
8.1 for the 1.95 sq. in., 262 pin, 
ceramic 
PGA. 8JC is shown 
so that 8JA can be cal- 
culated 
by: 


Note that 8JC with a heatsink 
differs 
from 8JC with- 
out a heatsink 
because 
case 
temperature 
is mea- 
sured 
differently. 
Case 
temperature 
for 
8JC with 
heatsink 
is measured 
at the center 
of the heat fin 
base. Case temperature 
for 8JC without 
heatsink 
is 
measured 
at the center of the package 
top surface. 


Table 8.2 shows the maximum TA allowable 
(without 
exceeding 
Tcl at various 
airflows. 


Note that TA is greatly 
improved 
by attaching 
"fins" 
or a "heat 
sink" 
to the package. 
P (the maximum 
power consumption) 
is calculated 
by using the maxi- 
mum tee at 5V as tabulated 
in the D.C. Characteris- 
tics of section 
9. 


Figure 8.1 gives typical 
Ice derating 
with case tem- 
perature. 
For more information 
on heat sinks, mea- 
surement 
techniques, 
or package 
characteristics, 
re- 
fer 
to 
Intel Packaging Handbook, 
order 
number 
240800. 


Table 
8.1. Thermal 
Resistance-In 
·C/Watt 


8CA as a Function 
of Airflow 
- 
ft/mln 
(m/sec) 


8JC 
0 
200 
400 
600 
800 
1000 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 
(5.07) 
I 
With Heat Sink" 
1.6 
10.1 
6.3 
4.3 
3.2 
2.5 
2.2 
I 
Without 
Heat Sink 
1.0 
13.5 
11.0 
8.0 
6.5 
5.5 
5.0 


NOTE: 
" Nine-fin, 
unidirectional 
heat sink (fin dimensions: 
0.250' 
height. 0.040' 
fin width. 0.100' 
center-to-center 
spacing. 
1.730' 


length) 
- 


Table 8.2. Maximum 
TA at Various 
Airflows-In 
·C 


Airflow 
- 
ft/m In (m/sec) 


fClK 
0 
200 
400 
600 
800 
1000 
(MHz) 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 
(5.07) 


TA with 
50 
24 
47 
59 
66 
70 
72 
Heat Sink" 


TA without 
50 
4 
I 19 
37 
46 
52 
55 
Heat Sink 


TA with 
40 
34.5 
53.5 
63.5 
69 
72.5 
74 
Heat Sink" 


TA without 
40 
17.5 
30 
45 
52.5 
57.5 
60 
Heat Sink 


NOTE: 
" Nine-fin. 
unidirectional 
heat sink (fin dimensions: 
0.250' 
height. 0.040' 
fin width. 0.100' 
center-to-center 
spacing. 
1.730' 
length) 
• 
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Figure 8.1. Ice Derating 
with Case Temperature 


9.0 
ELECTRICAL DATA 


All input and output timings 
are specified 
relative 
to 
the 1.5V level of the rising edge ofCLK 
and refer to 
the point that the signal reaches 
1.5V. 
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9.1 
Absolute Maximum Ratings 


Case Temperature 
TC under Bias 
O°C to 85°C 


Storage Temperature 
- 65°C to + 150°C 


Voltage 
on Any Pin with 
Respect 
to Ground 
-0.5 
to VCC+0.5V 


NOTICE: This data sheet contains preliminary infor- 
mation on new products in production. The specifica- 
tions are subject to change without notice. Verify with 
your local Intel Sales office that you have the latest 
data sheet before finalizing a design. 


• WARNING: Stressing the device beyond the "Absolute 
Maximum 
Ratings" 
may 
cause 
permanent 
damage. 


These are stress ratings 
only. Operation 
beyond 
the 
"Operating 
Conditions" 
is not recommended 
and ex- 
tended exposure 
beyond 
the 
"Operating 
Conditions" 
may affect device reliability. 


9.2 
D.C. Characteristics 


Table 9.1. D.C. Characteristics 
Operating 
Conditions: 
VCC = 5V ± 5%; Tc = O°C to 85°C 


Symbol 
Parameter 
Min 
Max 
Units 
Notes 


Vll 
Input lOW 
voltage 
(TIl) 
-0.3 
+0.8 
V 


VIH 
Input HIGH voltage 
(TIl) 
2.0 
Vcc+0.3 
V 


VIHC 
ClK 
Input HIGH (TIl) 
2.5 
vcc + 0.3 
V 


VOl 
Output lOW 
voltage 
(TIl) 
0.45 
V 
1 


VOH 
Output HIGH voltage 
(TIl) 
2.4 
V 
2 


Icc 
Power supply current (@ 50 MHz) 
1.2 
Amp 
3 


Icc 
Power supply current (@40 MHz) 
1.0 
Amp 
3 


III 
Input leakage current 
±15 
/-LA 
4 


ILlP 
Input leakage current (pull-up) 
-400 
/-LA 
5 


ILO 
Output leakage current 
±15 
/-LA, 
6 


CIN 
Input capacitance 
11.5 
pF 
7 


Co 
l/O or output capacitance 
14 
pF 
7 


NOTES: 
1. This parameter is measured with current load of 5 mA. 
2. This parameter is measured with current load of 1 mA. Typical value is vcc - 
0.45V. 


3. Measured at 50 MHz and vo: = 5V. 
4. This parameter is for inputs without pullups. vcc is on, and OV s VIN s: vcc. 
5. This parameter is for inputs with pullups and VIL = 0.45V. Note that if the pull-ups are put in high-impedance state via the 
DCTl boundary scan cell that also tri-states the data outputs, then the leakage is ± 15 /LA. 
6. 0.45V s: VIN s: vcc - 
0.45V. 
7. These parameters are not tested; they are guaranteed by design characterization. 
8. TRST# is tested only at VIN = vcc 
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9.3 
A.C. Characteristics 


Table 9.2. A.C. Characteristics 
CL = 0 pF Unless Otherwise 
Specified; vcc = 5V ± 5%; TC = O°C to 85°C 


40MHz 
50MHz 


Symbol 
Parameter 
Fig 
Mln 
Max 
Mln 
Max 
Notes 


(ns) 
(n5) 
(ns) 
(ns) 


tc 
ClK 
Period 
9.1 
25 
40 
20 
40 


ttc 
TCK Period 
9.2 
40 
1000 
40 
1000 


ClKStability 
9.1 
0.1% 
0.1% 


tch 
ClK 
High Time 
9.1 
7 
7 


tcl 
ClKlowTime 
9.1 
7 
7 


tr 
ClK 
Rise Time 
9.1 
3 
3 
h 


tf 
ClKFallTime 
9.1 
3 
3 
h 


ts 
TCK to ClK 
Skew 
9.3 
±1 
±1 
i 


ttch 
TCK High Time 
9.2 
10 
10 


ttcl 
TCKlowTime 
9.2 
10 
10 


ttcr 
TCK Rise Time 
9.2 
4 
4 


ttcf 
TCKFallTime 
9.2 
4 
4 


tsu.1 
RESET, HOlO, 
BERR, FLlNE#, 
9.1 
8 
7 
PEN #, INT ICS8 Setup Time 


tsu.2a 
BOFF #, AHOlO, 
INV, 
9.1 
8 
7 
WB/WT 
# Setup Time 


tsu.2b 
KEN#, 
NA# 
9.1 
8 
8 


tsu.3 
EAOS # Setup Time 
9.1 
9 
8 


tsu.4 
EWBE # Setup Time 
9.1 
8.5 
7.5 


tsu.5 
BROY # Setup Time 
9.1 
8.5 
7.5 


tsu.6a 
OP7-DPO 
Setup Time 
9.1 
8.5 
7.5 


tsu.6b 
063-00 
Setup Time 
9.1 
8.5 
8 


tsu.7 
063-00, 
OP7-0PO 
Setup Time 
9.1 
5.5 
4.5 
(late 
Backoff 
Mode) 


tsu.8 
A31-A5 
Setup Time 
9.1 
11 
10 


tllU.9 
EWBE# 
Setup Time 
9.1 
8.5 
7.5 


ttsu 
TOI, TMS, TRST # Setup Time 
9.2 
8 
8 


tth 
TOI, TMS, TRST # Hold Time 
9.2 
2 
1 
b 


th.1 
Hold Time, All Inputs 
9.1 
2 
1 
c 
except 063-00, 
OP7-00 


th.2 
063-00, 
OP7-0PO 
Hold Time 
9.1 
3 
2 
c 
(Normal and late 
Back-Off 
Mode) 


ttco 
TOO Valid Oelay and All Outputs 
9.2 
1.5 
17.5 
1.5 
16.5 
a, f 
Valid Oelay in EXTEST Mode 
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40MHz 
50MHz 


Symbol 
Parameter 
Fig 
Mln 
Max 
Mln 
Max 
Notes 


(ns) 
(ns) 
(ns) 
(ns) 


tco.1 
A31-A22 
Valid Delay 
9.1 
1.5 
12 
1.5 
11 
a,j 


tco.2a 
A21-A3 
Valid Delay 
9.1 
1.5 
11.5 
1.5 
10.5 
a, g,j 
(High Current Mode) 


tco.2b 
A21-A3 
Valid Delay 
9.1 
1.5 
12 
1.5 , 
11 
a,j 
(Normal Current Mode) 


tco.3 
063-00, 
DP7-DPO 
Valid Delay 
9.1 
2.5 
14 
2.5 
13 
a,d 


. tco.4 
BREQ, HLDA, PCHK#, 
9.1 
1.5 
13 
1.5 
12 
a 
NENE#, 
KBO, KB1 Valid Delay 


tco.5a 
ADS# 
Valid Delay 
9.1 
1.5 
10 
1.5 
9 
a,g 
(High Current Mode) 


tco.5b 
ADS# 
Valid Delay 
9.1 
1.5 
11 
1.5 
10 
a 
(Normal Current Mode) 


tco.6a 
W/R# 
Valid Delay 
9.1 
1.5 
11 
1.5 
10 
a,g 
(High Current Mode) 


tco.6b 
W/R# 
Valid Delay 
9.1 
1.5 
. 12 
1.5 
11 
a 
(Normal Current Mode) 


tco.7a 
HITM# 
Valid Delay 
9.1 
1.5 
12 
1.5 
11 
a,g 
(High Current Mode) 


tco.7b 
HITM# 
Valid Delay 
9.1 
1.5 
13 
1.5 
12 
a 
(Normal Current Mode) 


tco.8 
PWT, PCD, HIT#, 
CTYP, D/C# 
MIIO#, 
9.1 
1.5 
12 
1.5 
11 
a 
PCYC, LOCK#, 
CACHE#, 
LEN Valid Delay 


tco.9a 
BEO#-BE7# 
Valid Delay 
9.1 
1.5 
12 
1.5 
11 
a,g 
(High Current Mode) 


tco.9b 
BEO#-BE7# 
Valid Delay 
9.1 
1.5 
13 
1.5 
12 
a 
(Normal Current Mode) 


tz.1 
Float Time All Outputs 
9.1 
2 
19 
2 
18 
e 
except 
063-00, 
DP7-DPO 


tz.2 
Float Time 063-00, 
DP7 -DPO 
9.1 
3 
19 
3 
18 
e 


tzt 
Float Time during Boundary 
Scan EXTEST 
9.1 
20 
20 
f 


Table 
9.2. A.C. Characteristics 
(Continued) 
CL = 0 pF Unless Otherwise 
Specified; 
VCC = 5V ±5%; TC = O°C to 85°C 


NOTES: 
a. Minimum and maximum delays are for OpFload. 
b. These hold times are referenced to the falling edge of TCK. 
c. These hold times are referenced to the rising edge of CLK. 
d. Output delay for 063-00, 
DP7-DPO is from the CLK after ADS# activation. 
e. Float time = delay until maximum output current is less than ± ILO. Float time is not tested. 
f. Delay from falling edge of TCK. 
g. These pins can be configured as normal or high-current buffers. When they are configured as high-current buffers for 
interface with cache memory or other large loads, use the derating curves in Figure 9.3. Otherwise, all normal buffers use 
the derating curves in Figure 9.4. 
h. tr and If should be measured between 0.8V and 2.5V. 
i. Assumes TCK and ClK both at 25 MHz. 


j. 1.5 ns @ TCASE= 20'C to 85'C Minimum Address Valid Delay 10 ns @ TCASE= O'C to 20'C Minimum Address Valid 
Delay. 
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Figure 9.1. elK, Input, and Output Timings 


/+----------lIe---------~ 
/+-----lIeh------o-l-----lIel----.j 


2.0V. - - - 1--------------~------------------}~-. 
::::;::::-::::::-:::: 
- \:::::::::::::/_--- 


TCK 


lIet1--_ 
IlerI----- 


, 
W' 


TRST" 
~ 
- - - - - - 
- - -/~ 
- - - - - - - - - - - - - 
- - - - - - - - -. 
1.5V 
- 
TOI, 
, 
\11 


TWS, 
~,-, 
• 
• -_-_-+--_- J,-J+lI\-~-_-_-_-_- • • -_-_-_-_-_- r.-_-_-_-_-_- 
_-.. 


11 
TOO 
• - - - - - - - - - - - - - - 
- - - - - - - - - -. 
1.5V 
1\ 


240874-69 


Figure 9.2. TAP Signal Timings 
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NOTES: 
Graphs 
are not linear outside 
the CL range shown. 


NOMINAL 
= OpF value 
given in the A.C. Timings 
table. 
"Typical 
part under worst-case 
conditions. 


Figure 9.3. Typical Output Delay vs Load Capacitance 
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Graphs 
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the CL range shown. 
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in the A.C. Timings 
table. 
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Figure 9.4. Typical 
Output 
Delay vs Load Capacitance 
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NOTES: 
Graphs 
are not linear outside 
the CL range shown. 


-Typical 
part under worst-case 
conditions. 


Figure 9.5a. Typical Slew Time vs Load Capacitance under Worst-Case Conditions (Rising Voltage) 
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Graphs 
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the CL range shown. 
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Figure 9.5b. Typical Slew Time vs Load Capacitance under Worst-Case Conditions (Failing Voltage) 
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Figure 9.6. Typical Ice vs. Frequency 
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9.4 
Component 
Buffer Model 


9.4.1 FIRST 
ORDER 
ELECTRICAL 
BUFFER 
MODEL 


The first order electrical buffer model provides an 
accurate and simple representation of the buffers 
used in the inputs and outputs of the CHMOS i860 
XP CPU. The model output consists of four compo- 
nents: 
1. Linear voltage waveform (dV/dt) 


2. Intrinsic buffer delay due to CL (to) 
3. Buffer output impedance (Ra) 
4. Buffer output capacitance (Co) 


as shown in Figure 9.7a 


A fitting algorithm has been used to arrive at values 
for dVIdt, to. Co. and Ra such that Ra matches the 
actual buffer impedance and Co. the intrinsic buffer 
output capacitance whether the output is on or off. 
remains constant across the operating range while 
minimizing the difference between the full buffer cir- 
cuit and its simplified electrical model for a set of 
different loads (lumped capacitance. and short and 
long transmission lines). dV/dT is the slope of the 
voltage ramp. while to is the intrinsic buffer delay 
associated with a given CL.to accounts for the intrin- 
sic delay by offsetting the excitation of the model by 
the amount of the delay. 


NOTE: 
to is zero for CL = 0 and when the load is repre- 
sented by a transmission line. 


dV Idt u(t-to) 
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Figure 9.7a. Output Model 


The input model consists of one component. buffer 
capacitance (CIN).as shown in Figure 9.7b. 
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Figure 9.7b. Input Model 


9.4.2 FIRST 
ORDER 
ELECTRICAL 
MODEL 
PARAMETER 
VALUES 


The parameters that make up the first order electri- 
cal model vary with the buffer design. In addition. 
these parameters also vary with the operating condi- 
tion (i.e.• temperature and VcC> of the buffer pro- 
cess. The typical process corner is being modeled. 
Two sizes of buffer are used on these components. 
labelled here as small and large. The parameter val- 
ues found in Table 9.3 and 9.4 list dV/dt. to. Ra. and 
Co. These parameters are provided for both low-to- 
high and high-to-Iow transitions at the typical pro- 
cess corner for three operating conditions (Vcc = 
5.5V and TJ = -10°C. Vcc = 5.0V and TJ = 80°C. 
and vcc = 4.5V and TJ = 125°C. 


9.4.3 PACKAGE 
PARAMETERS 


In addition to the buffe( characteristics. package 
characteristics are also included to complete the 
model. Package inductance. capacitance and resist- 
ance values vary with design geometry and material 
properties of the package. Figure 9.8 shows a model 
of the package including these parameters and 
should be placed between the first order electrical 
buffer model as shown in Figure 9.9 and the board 
interconnects. Notice' the package model only in- 
cludes the package inductance (Lp) and capaci- 
tance (Cp). This is sufficient since the package re- 
sistance ls so small it is negligible. 


Table 9.5 lists the buffer model parameters for each 
pin of the i860 XP microprocessor. The table gives 
the package model parameters for each pin. fol- 
lowed by the input capacitance (input and 1/0 pins) 
andlor output buffer size (outputs and 1/0). In those 
cases where the buffer used by a pin is an option 
selected at reset by the PEN# input. the output buff- 
er column lists the sizes available. Large buffers cor- 
respond to high-current mode. while small buffers 
correspond to normal current mode. 


• 
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9.4.4 BOARD 
INTERCONNECTS 


The board interconnect can be considered as a 
lumped parameter (capacitive load) or as a transmis- 
sion line. As a rule of thumb, an unterminated board 
interconnect may be considered as a capacitive load 
if the round trip time (time for signal to travel from 
one end of the interconnect to the other and back) is 
short compared to the transition time of the signal. 
At frequencies of 50 MHz and above most intercon- 
nects behave as transmission lines (Figure 9.10). 
For accurate results at high frequencies, these 
transmission line effects must be taken into account 
and modeled. 


Figure 9.8. Package 
Model 
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Figure 9.9a. Output Buffer and Package 
Model 
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Figure 9.9b. Input Buffer and Package 
Model 
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Figure 9.10. Transmission 
Line Model 


i860TM XP MICROPROCESSOR 


Table 9.3. Small Output 
Buffer First Order Electrical 
Model Parameter 
Values 


TJ 
Ro 
Co 
to (ns) at various CL 


Transition 
Vcc 
dV/dT 
0 
5 
25 
50 
100 
150 
(C) 
(ohms) 
(pF) 
(pF) 
(pF) 
(pF) 
(pF) 
(pF) 
(pF) 


Low-to-High 
5.5 
-10 
28.0 
4.3 
5.5/1.2 
0 
0.0 
0.1 
0.3 
0.7 
1.1 


Low-to-High 
5.5 
80 
36.4 
4.3 
5.5/1.4 
0 
0.0 
0.1 
0.8 
0.8 
1.2 


Low-to-High 
5.5 
125 
40.4 
4.3' 
5.5/1.5 
0 
0.0 
0.1 
0.4 
0.8 
1.2 


Low-to-High 
5.0 
-10 
30.2 
4.3 
5.0/1.2 
0 
0.0 
0.1 
0.4 
0.8 
1.2 


Low-to-High 
5.0 
80 
39.2 
4.3 
5.0/1.4 
0 
0.0 
0.2 
0.4 
0.9 
1.3 


Low-to-High 
5.0 
125 
43.5 
4.3 
5.0/1.6 
0, 
0.0 
0.2 
0.4 
0.9 
1.3 


Low-to-High 
4.5 
-10 
33.0 
4.3 
4.5/1.2 
0 
0.0 
0.2 
0.5 
1.0 
1.4 


Low-to-High 
4.5 
80 
42.8 
4.3 
4.5/1.6 
0 
0.0 
0.2 
0.5 
1.0 
1.5 


Low-to-High 
4.5 
125 
47.4 
4.3 
4.5/1.6 
0 
0.0 
0.3 
0.6 
1.1 
1.6 


High-to-Low 
5.5 
-10 
23.2 
4.3 
5.5/1.0 
0 
0.0 
0.4 
0.7 
1.2 
1.6 


High-to-Low 
5.5 
80 
31.4 
4.3 
5.5/1.4 
0 
0.0 
0.4 
0.9 
1.3 
1.8 


High-to-Low 
5.5 
125 
36.1 
4.3 
5.5/1.6 
0 
0.0 
0.5 
0.8 
1.3 
1.8 


High-to-Low 
5.0 
-10 
24.0 
4.3 
5.0/1.1 
0 
0.0 
0.5 
0.9 
1.2 
1.7 


High-to-Low 
5.0 
' 80 
32.8 
4.3 
5.0/1.4 
0 
0.0 
0.5 
0.9 
1.5 
1.9 


, 


High-to-Low 
5.0 
125 
37.8 
4.3 
5.0/1.7 
0 
0.0 
0.5 
0.9 
1.4 
1.8 


High-to-Low 
4.5 
-10 
25.1 
4.3 
4.5/1.2 
0 
0.0 
0.4 
0.7 
1.2 
1.7 


High-to-Low 
4.5 
80 
34.5 
4.3 
4.5/1.6 
0 
0.0 
0.4 
0.8 
1.3 
1.8 


High-to-Low 
4.5 
125 
39.9 
4.3 
4.5/1.8 
0 
0.0 
0.5 
0.9 
1.4 
1.9 


2-113 


• 


1860TM XP MICROPROCESSOR 
infel~ 


Table 9.4. Large Output Buffer First Order Electrical 
Model Parameter 
Values 


TJ 
Ro 
Co 
to (ns) at various CL 


Transition 
Vcc 
(C) 
(ohms) 
(pF) 
dV/dT 
0 
5 
25 
50 
100 
150 
200 
250 
300 
(pF) 
(pF) 
(pF) 
(pF) 
(pF) 
(pF) 
(pF) 
(pF) 
(pF) 


Low-to-Hiqh 
5.5 
-10 
12.1 
4.3 
5.5/0.7 
0 
0.0 
0.1 
0.3 
0.6 
0.8 
1.0 
1.3 
1.5 


Low-to-High 
5.5 
80 
15.5 
4.3 
5.5/0.9 
0 
0.0 
0.2 
0.3 
0.6 
0.9 
1.1 
1.4 
1.7 


Low-to-High 
5.5 
125 
17.2 
4.3 
5.5/1.1 
0 
0.0 
0.2 
0.4 
0.7 
1.0 
1.2 
1.4 
1.7 


Low-to-High 
5.0 
-10 
13.0 
4.3 
5.0/0.9 
0 
0.0 
0.1 
0.3 
0.6 
0.9 
1.1 
1.4 
1.7 


Low-to-High 
5.0 
80 
16.7 
4.3 
5.0/1.0 
0 
0.0 
0.2 
0.4 
0.8 
1.1 
1.4 
1.7 
2.0 


Low-to-High 
5.0 
125 
18.5 
4.3 
5.0/1.2 
0 
0.0 
0.2 
0.4 
0.8 
1.1 
1.4 
1.7 
2.0 


Low-to-High 
4.5 
-10 
14.1 
4.3 
4.5/0.9 
0 
0.0 
0.2 
0.4 
0.7 
1.1 
1.4 
1.7 
2.0 


Low-to-High 
4.5 
80 
18.0 
4.3 
4.5/1.2 
0 
0.0 
0.2 
0.4 
0.9 
1.2 
1.5 
1.9 
2.2 


Low-to-High 
4.5 
125 
19.9 
4.3 
4.5/1.3 
0 
0.0 
0.2 
0.5 
0.8 
1.2 
1.5 
1.9 
?2 


High-to-Low 
5.5 
-10 
10.6 
4.3 
5.5/0.7 
0 
0.0 
0.3 
0.6 
0.9 
1.2 
1.5 
1.8 
2.0 


High-to-Low 
5.5 
80 
13.9 
4.3 
5.5/1.0 
0 
0.0 
0.4 
0.7 
1.2 
1.5 
1.9 
2.2 
2.5 


High-to-Low 
5.5 
125 
15.8 
4.3 
5.5/1.1 
0 
0.0 
0.4 
0.8 
1.3 
1.7 
2.0 
2.4 
2.8 


High-to-Low 
5.0 
-10 
11.0 
4.3 
5.0/0.8 
0 
0.0 
0.4 
0.7 
1.0 
1.3 
1.6 
1.9 
2.1 


High-to-Low 
5.0 
80 
14.5 
4.3 
5.0/1.0 
0 
0.0 
0.4 
0.8 
1.2 
1.6 
2.0 
2.3 
2.6 


High-to-Low 
5.0 
125 
16.5 
4.3 
5.0/1.2 
0 
0.0 
0.4 
0.8 
1.3 
1.7 
2.1 
2.5 
2.8 


High-to-Low 
4.5 
-10 
11.3 
4.3 
4.5/0.9 
0 
0.0 
0.4 
0.7 
1.1 
1.4 
1.7 
2.0 
2.4 


High-to-Low 
4.5 
80 
15.2 
4.3 
4.5/1.2 
0 
0.0 
0.4 
0.8 
1.3 
1.6 
2.0 
2.3 
2.7 


High-to-Low 
4.5 
125 
17.4 
4.3 
4.5/1.3 
0 
0.0 
0.4 
0.8 
1.3 
1.7 
2.1 
2.5 
2.8 
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Table 9.5 Buffer Models 


Input 
Output 


Pin Name 
Location 
Cp (pF) 
Lp (nH) 
Buffer 
Buffer 
, 
Typical 
Typical 
CIN (pF) 
Size 
Typical 
(Large or Small) 


A3 
S01 
7.6 
13.8 
6.7 
LIS 


A4 
R01 
6.2 
14.5 
6.7 
LIS 


A5 
M05 
6.5 
7.8 
6.7 
LIS 


A6 
L05 
5.3 
8.0 
6.7 
LIS 


A7 
F01 
7.7 
16.2 
6.7 
LIS 


A8 
K05 
5.1 
7.7 
6.7 
LIS 


A9 
E01 
8.0 
16.4 
6.7 
LIS 


A10 
K04 
5.1 
8.8 
6.7 
LIS 


An 
D01 
8.3 
16.8 
6.7 
LIS 


A12 
J04 
5.2 
9.0 
6.7 
LIS 


A13 
C01 
8.7 
17.2 
6.7 
LIS 


A14 
J05 
5.2 
7.8 
6.7 
LIS 


A15 
B01 
9.0 
17.8 
6.7 
LIS 


A16 
H04 
5.2 
9.0 
6.7 
LIS 


A17 
A01 
9.4 
18.2 
6.7 
LIS 


A18 
C03 
7.8 
14.5 
6.7 
LIS 


A19 
CO2 
9.0 
15.3 
6.7 
LIS 


A20 
H05 
7.5 
7.7 
6.7 
LIS 


A21 
802 
8.5 
15.7 
6.7 
LIS 


A22 
G04 
7.5 
9.1 
4.4 
S 


A23 
A02 
8.1 
15.7 
4.4 
S 


A24 
803 
7.0 
14.5 
4.4 
S 


A25 
A03 
7.7 
14.6 
4.4 
S 


A26 
G05 
6.7 
7.9 
4.4 
S 


A27 
E04 
7.6 
9.6 
4.4 
S 


A28 
F04 
6.5 
9.2 
4.4 
S 


A29 
D04 
7.4 
10.0 
4.4 
S 


A30 
F05 
5.9 
8.2 
4.4 
S 


A31 
C04 
6.6 
10.4 
4.4 
S 


ADS# 
N04 
6.2 
9.1 
LIS 


AHOLD 
a05 
6.0 
8.8 
2.0 


BEO# 
E05 
5.7 
8.8 
LIS 


BE1# 
D05 
6.7 
8.8 
LIS 


BE2# 
006 
5.7 
9.0 
LIS 
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Input 
Output 


Pin Name 
Location 
Cp (pF) 
Lp (nH) 
Buffer 
Buffer 
Typical 
Typical 
CIN (pF) 
Size 
Typical 
(Large or Small) 


BE3# 
B04 
6.5 
11.2 
liS 


BE4# 
COS 
5.9 
10.6 
liS 


BE5# 
A04 
6.5 
12.0 
LIS 


BE6# 
007 
4.9 
8.6 
liS 


BE7# 
A05 
6.1 
11.5 
liS 


BERR 
S07 
5.8 
8.7 
2.0 


BOFF# 
R04 
6.3 
10.4 
2.0 


RSRVO 
P04 
6.4 
9.4 
2.0 


BROY# 
U01 
8.0 
14.7 
2.0 


BREQ 
R09 
4.4 
7.5 
S 


BYPASS # 
A06 
Strapping Option 


CACHE# 
004 
6.6 
9.8 
S 


ClK 
M04 
6.2 
8.9 
2.0 


CTYP 
P05 
6.5 
8.6 
S 


Do 
A07 
5.5 
10.6 
4.4 
S 


01 
009 
7.6 
7.6 
4.4 
S 


02 
A13 
7.4 
15.0 
4.4 
S 


03 
A16 
7.7 
17.7 
4.4 
S 


04 
A17 
9.2 
17.9 
4.4 
S 


05 
010 
7.5 
7.6 
4.4 
S 


06 
A18 
9.4 
18.3 
4.4 
S 


07 
C17 
8.6 
15.9 
4.4 
S 


08 
C16 
8.6 
14.5 
4.4 
S 


09 
B16 
9.3 
14.7 
4.4 
S 


010 
011 
8.3 
7.5 
4.4 
S 


011 
B17 
8.9 
14.7 
4.4 
S 


012 
C15 
8.1 
7.8 
4.4 
S 


013 
B18 
8.6 
15.4 
4.4 
S 


014 
012 
7.2 
7.8 
4.4 
S 


015 
B19 
8.2 
15.6 
4.4 
S 


016 
C18 
7.9 
10.7 
4.4 
S 


017 
014 
6.7 
9.2 
4.4 
S 


018 
C19 
7.6 
14.2 
4.4 
S 


019 
015 
6.4 
10.0 
4.4 
S 


Table 9.5. Buffer Models (Continued) 
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Table 9.5 Buffer Models 
(Continued) 


Input 
Output 


Pin Name 
Location 
Cp (pF) 
Lp (nH) 
Buffer 
Buffer 
Typlc;al 
Typical 
CIN (pF) 
Size 
Typical 
(Large or Small) 


020 
016 
7.4 
10.7 
4.4 
8 


021 
E15 
5.6 
8.8 
4.4 
8 


022 
019 
6.7 
12.7 
4.4 
8 


023 
E16 
5.5 
9.7 
4.4 
8 


024 
F15 
5.3 
8.3 
4.4 
8 


025 
E17 
6.6 
9.9 
4.4 
8 


026 
F16 
5.3 
9.7 
4.4 
8 


027 
F19 
6.2 
11.7 
4.4 
8 


028 
G15 
5.1 
7.9 
4.4 
8 


029 
G19 
6.2 
11.8 
4.4 
8 


030 
G16 
5.1 
8.9 
4.4 
8 


031 
M19 
8.6 
16.2 
4.4 
8 


032 
H15 
5.2 
7.7 
4.4 
8 


033 
R18 
11.0 
19.6 
4.4 
8 


034 
P19 
8.0 
18.4 
4.4 
8 


035 
R19 
9.1 
18.8 
4.4 
8 


036 
N19 
8.1 
16.9 
4.4 
8 


037 
819 
9.2 
20.7 
4.4 
8 


038 
J16 
8.4 
8.9 
4.4 
8 


039 
T19 
10.5 
19.6 
4.4 
8 


040 
U19 
10.8 
19.1 
4.4 
8 


041 
L16 
8.3 
10.9 
4.4 
8 


042 
T18 
10.5 
17.8 
4.4 
8 


043 
K16 
8.4 
8.8 
4.4 
8 


044 
U18 
10.1 
17.7 
4.4 
8 


045 
K15 
9.3 
7.5 
4.4 
8 


046 
817 
9.5 
14.5 
4.4 
8 


047 
M16 
8.0 
9.8 
4.4 
8 


048 
L15 
8.0 
7.7 
4.4 
8 


049 
T17 
8.7 
14.6 
4.4 
8 


050 
N16 
7.8 
9.9 
4.4 
8 


051 
U17 
8.6 
15.2 
4.4 
8 


052 
818 
7.6 
14.3 
4.4 
8 
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Input 
Output 


Pin Name 
Location 
Cp (pF) 
Lp (nH) 
Buffer 
Buffer 
Typical 
Typical, 
CIN(pF) 
Size 
Typical 
(Large or Small) 


053 
M15 
7.7 
7.1 
4.4 
5 


054 
016 
7.0 
11.1 
4.4 
5 


055 
U16 
8.0 
14.3 
4.4 
5 


056 
T16 
7.8 
12.8 
4.4 
5 


057 
R16 
6.5 
11.8 
4.4 
5 


058 
516 
7.5 
11.3 
4.4 
5 


059 
P15 
6.2 
8.7 
4.4 
5 


060 
R15 
7.1 
9.6 
4.4 
5 


061 
015 
5.9 
9.3 
4.4 
5 


062 
515 
6.9 
10.7 
4.4 
5 


063 
R14 
5.6 
9.7 
4.4 
5 


O/C# 
R05 
5.8 
9.7 
5 


OPO 
A15 
7.7 
18.3 
4.4 
5 


OP1 
A19 
9.7 
18.9 
4.4 
5 


OP2 
013 
7.1 
8.5 
4.4 
5 


OP3 
018 
6.7 
11.3 
4.4 
5 


OP4 
019 
10.4 
19.0 
4.4 
5 


OP5 
J15 
9.9 
7.7 
4.4 
5 


OP6 
P16 
9.3 
10.7 
4.4 
5 


OP7 
N15 
6.8 
8.9 
4.4 
5 


EA05# 
505 
5.5 
10.5 
2.0 


EWBE# 
008 
7.5 
7.6 
2.0 


FLlNE# 
508 
5.4 
8.1 
2.0 


HIT# 
512 
5.9 
11.1 
5 


HITM# 
N05 
6.2 
8.2 
L 


HLOA 
I 
509 
5.3 
7.9 
5 


HOLD 
R12 
6.1 
11.1 
2.0 


INT/C58 
506 
5.2 
10.0 
2.0 


INV 
R07 
5.3 
8.2 
2.0 


KBO 
R11 
6.1 
9.2 
5 


KB1 
510 
6.4 
7.9 
5 


KEN# 
U02 
7.4 
13.4 
2.0 


LEN 
T02 
7.9 
12.8 
5 


Table 9.5 Buffer Models (Continued) 
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Table 9.5 Buffer Models (Continued) 


Input 
Output 


Pin Name 
Location 
Cp(pF) 
Lp (nH) 
Buffer 
Buffer 
Typical 
Typical 
CIN(pF) 
Size 
Typical 
(lairge or Small) 


LOCK# 
503 
7.7 
11.2 
5 


MIIO# 
504 
7.3 
10.3 
5 


NA# 
U03 
7.1 
13.0 
2.0 


NENE# 
511 
6.3 
9.6 
5 


PCO 
R06 
5.6 
8.9 
5 


PCHK# 
H16 
5.1 
8.8 
5 


PCYC 
T04 
7.2 
11.4 
5 


PEN# 
R08 
4.8 
7.8 
2.0 


I 
PWT 
T03 
7.4 
12.1 
5 


RE5ET 
502 
7.9 
12.5 
2.0 


5PARE 
L04 
NC 


TCK 
\ 
001 
5.8 
14.1 
2.0 


TOI 
514 
6.5 
9.8 
2.0 


TOO 
R10 
6.3 
7.6 
5 


TM5 
R13 
5.6 
9.6 
2.0 


TR5T# 
513 
6.3 
9.6 
2.0 


W/R# 
T01 
7.8 
14.3 
L/5 


WB/WT# 
U04 
6.7 
12.3 
2.0 
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for logical operations and is sign-extended 
for add and subtract operations (including 
addu and subu) and for all addressing cal- 
culations. 
src 1nl 
Same as srct except that no immediate 
constant or address offset value is permit- 
ted. 
src1s 
Same as srct except that the immediate 
constant. is a 5-bit value that is zero-ex- 
tended to 32 bits. 
src2 
The second of the two source-register des- 
ignators. 
dest 
The destination register designator. 


Thus, the operand specifier isre2, 
for example, 
means that an integer register is used and that the 
encoding of that register must be placed in the sre2 
field of the machine instruction. 


Other (nonregister) operands are specified by a one- 
part abbreviation that represents both the type of 
operand required and the instruction field into which 
the value of the operand is placed: 
11const A 16-bit immediate constant or address off- 
set that the i860 XP microprocessor sign- 
extends to 32 bits when computing the ef- 
fective address. 


Ibroff 
A signed, 26-bit, immediate, relative branch 
offset. 


sbroff 
A signed, 16-bit, immediate, relative branch 
offset. 
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monic 
of the operation. 


Other abbreviations include: 
.p 
Precision specification .ss, 
.sd, or .dd (.ds not permit- 
ted). Refer to Table 10.1. 
.r 
Precision specification .ss, 
.sd, .ds, or .dd. Refer to 
Table 10.1. 
.v 
.sd or .dld Refer to Table 
10.1. 
.w 
.ss or .dd. Refer to Table 
10.1. 


.X 
.b (8 bits), .s (16 bits), or .I 
(32 bits) 
.y 
.I (32 bits), .d (64 bits), or 
.q (128 bits) 
mem.x(address) 
The memory location indi- 
cated by address 
with a 
size of x. 
pdrt.x(address) 
The 1/0 port indicated by 
address 
with a size of x. 
Int_vector.x(address) 
The interrupt vector with a 
size of x returned from 1/0 
port address. 


PM 
The pixel mask, which is 
considerod as an array of 
eight 
bits 
PM(7)..PM(0), 
where PM(O) is the least- 
significant bit. 
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10.1 Instruction 
Definitions in Alphabetical Order 


adds isrc 1,isrc2, idest 
Add Signed 
idest +- isrc 1 + isrc2 
OF +- (bit 31 carry *- bit 30 carry) 
CC set if isrc2 + isrc1 < 0 (signed) 
CC clear if isrc2 + isrc1 2 0 (signed) 


addu isrct, isrc2, idest 
Add Unsigned 
idest +- isrc 1 + isrc2 
OF +- bit 31 carry 
CC +- bit 31 carry 


and isrc 1,isrc2, idest 
Logical AND 
idest +- isrc1 and isrc2 
CC set if result is zero, cleared otherwise ;' 


andh «const, isrc2, idest 
Logical AND High 
idest +- ts coost shifted left 16 bits) and isrc2 
CC set if result is zero, cleared otherwise 


andnot isrct, isrc2, idest 
Logical AND NOT 
idest +- (not isrc1) and isrc2 
CC set if result is zero, cleared otherwise 


andnoth 
# const, isrc2, idest 
Logic:al AND NOT High 
idest +- (not (#const shifted left 16 bits)) and isrc2 
CC set if result is zero, cleared otherwise 


bc Ibroff 
Branch on CC 
IF 
CC = 1 
THEN 
continue execution at brx(lbroff) 
FI 


bc.t Ibroff 
Branch on CC, Taken 
IF 
CC = 1 
THEN 
execute one more sequential instruction 
continue execution at brx(lbroff) 
ELSE 
skip next sequential instruction 
FI 


bla isrc 1ni, isrc2, sbroff 
Branch on LCC and Add 
LCC-temp clear if isrc2 + isrc1ni < 0 (signed) 
LCC-temp set if isrc2 + isrc1ni 2 0 (signed) 
isrc2 +- isrc 1ni + isrc2 
Execute one more sequential instruction 
IF 
LCC 
THEN 
LCC +- LCC-temp 
continue execution at brx(sbroff) 
ELSE 
LCC +- LCC-temp 
FI 


bnc Ibroff 
, 
Elranch on Not CC 
IF 
CC = 0 
THEN 
continue execution at brx(lbroff) 
FI 
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bnc.t Ibroff 
Branch on Not CC, Taken 


IF 
CC = 0 
THEN 
execute one more sequential instruction 
continue execution at brx(lbroff) 
ELSE 
skip next sequential instruction 
FI 


br Ibroff 
Branch Direct Unconditionally 
Execute one more sequential instruction. 
Continue execution at brx(lbroff). 


brl [isrc1ml 
Branch Indirect Unconditionally 


Execute one more sequential instruction 
IF 
any trap bit in psr is set 
THEN 
copy PU to U, PIM to IM in psr 
clear trap bits 
IF 
OS is set and DIM is reset 
THEN 
enter dual-instruction mode after executing one 
instruction in single-instruction mode 
ELSE 
IF 
OS is set and DIM is set 
THEN 
enter single-instruction mode after executing one 
instruction in dual-instruction mode 
ELSE 
IF 
DIM is set 
THEN 
enter dual-instruction mode 
for next instruction pair 
ELSE 
enter single-instruction mode 
for next instruction pair 
FI 
FI 
FI 
FI 
Continue execution at address in isrc1ni 
(The original contents of isrc 1ni is used even if the next instruction 
modifies isrc1ni. Does not trap if isrc1ni is misaligned.) 


bte isrcts, 
isrc2, sbroff 
Branch If Equal 
IF 
isrc1s = isrc2 
THEN 
continue execution at brx(sbroff) 
FI 


btne isrc 1s, isrc2, sbroff 
Branch If Not Equal 
IF 
isrc1s *- isrc2 


THEN 
continue execution at brx(sbroff) 
FI 


call1broff 
Subroutine Call 
r1 ~ 
address of next sequential instruction + 4 (or + 8 in dual mode) 


Execute one more sequential instruction 
Continue execution at brx(lbroff) 


calli [isrc 1ml 
Indirect Subroutine Call 
r1 ~ 
address of next sequential instruction + 4 (or + 8 in dual mode) 


Execute one more sequential instruction 
Continue execution at address in isrc1ni 
(The original contents of isrc1ni is used even if the next instruction 
modifies isrc1ni. Does not trap if isrc1ni is misaligned. The 
register isrc1ni must not be r1.) 


fadd.p fsrc " fsrc2, fdest 
Floating-Point Add 
fdest 
~ 
fsrc 1 + fsrc2 
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faddp tsrct, fsrc2, {des! 
Add with Plxel Merge 
{des! +- tsrct + {src2 (using integer 
arithmetic; 
8-byte operands 
and destination) 
Shift and load MERGE 
register 
from tsrct + {src2 as defined 
in Table 
10.2 


faddz tsrct, {src2, {des! 
Add with Z Merge 
tdest +- tsrct + {src2 (using integer 
arithmetic; 
8-byte operands 
and destination) 
Shift MERGE 
right 16 and load fields 31..16 and 63 ..48 from tsrc 1 + {src2 


famov.r tsrc1, tdest 
Floatln!II-Polnt 
Adder Move 
{des! +- tsrc 1 


fladd.w tsrct, {src2, {des! 
Long-Integer 
Add 
{des! +- tsrct + {src2 (2's complement 
integer arithmetic) 


flsub.w tsrct, {src2, {des! 
Long-Integer 
Subtract 
trdest +- tsrct - tsrce (2's complement 
integer arithmetic) 
. 


flx.v tsrc1, tdest 
Floating-Point 
to Integer 
Conversion 
{des! +- 64-bit value with low-order 
32 bits equal to integer 
part of tsrct rounded 


Floating-Point 
Load 
fld.y isrc1(isrc2), (des! 
(Normal) 
fld.y isrc1(isrc2)+ +', {des! 
(Autolncrement) 
{des! +- mem.y (isrc1 + isrc2) 
IF autoincrement 
THEN isrc2 +- isrc1 + isrc2 
FI 
Cache Flush 
flush # const(isrc2) 
(Normal) 
flush # const(isrc2) + + 
(Autolncrement) 
Write back (if modified) 
the line in data cache that has address 
(# cons! +isrc2) 
80860XR: 
and set tag value to (#cons! + isrc2). 
80860XP: 
and invalidate 
its virtual and physical 
tags. 
Contents 
of line undefined. 
IF autoincrement 
THEN isrc2 +- 
«const + isrc2 
FI 


fmlow.dd 
tsrct, terce. {des! 
Floating-Point 
Multiply 
Low 
{des! +- 
low-order 
53 bits of ({src1 mantissa 
x (src2 mantissa) 
{des! bit 53 +- most significant 
bit of ({src1mantissa x (src2 mantissa) 


fmov.r tsrct, {des! 
: 
Floating-Point Reg-Reg 
Move 
Assembler 
pseudo-operation 
fmov.ss 
tsrct, {des! 
fmov.dd tsrct, {des! 
fmov.sd tsrct, tdest 
fmov.ds tsrct, {des! 


= fladd.ss 
tsrc1. fO, tdest 
= fladd.dd 
tsrc 1, fO, {des! 
= famov.sd 
tsrct, {des! 
= famov.ds 
tsrct, {des! 


fmul.p tsrct, tsrce; {des! 
Floating-Point Multiply 
{des! +- tsrc 1 x tsrce 


fnop 
Floating-Point 
No Operation 
Assembler 
pseudo-operation 
fnop = shrd rO, rO, rO 


• 
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form fsrc1, fdest 
OR with MERGE Register 


fdest +- fsrc1 OR MERGE 
MERGE 
+- 0 


frcp.p fsrc2, fdest 
Floating-POint Reciprocal 
fdest +- 1 / fsrc2 with maximum mantissa error < 2- 7 


frsqr.p fsrc2, fdest 
Floating-Point Reciprocal Square Root 


fdest +- 1 / Hsrc2 
with maximum mantissa error < 2- 7 


Floating-Point Store 
fst.y tdest, isrc1(isrc2) 
(Normal) 
fst.y fdest, 
isrc1(isrc2) + + 
'.(Autolncrement) 


mem.y (isrc2 + isrc1) +- fdest 
IF autoincrement 
THEN isrc2 +- isrct 
+ isrc2 
FI 


fsub.p fsrc1, fsrc2, fdest 
Floating-Point Subtract 


fdest +- tsrc 1 - fsrc2 


ftrunc.v tsrct, 
fdest 
Floating-Point to Integer Conversion 
fdest +- 64-bit value with low-order 32 bits equal to integer part of fsrc1 


fxfr fsrc1, idest 
Transfer F-P to Integer Register 


idest +- fsrc 1 


fzchkl fsrc1, fsrc2, fdest 
, 
32-Blt Z-Buffer Check 


- 
Consider the 64-bit operands as arrays of two 32-bit 


fields fsrc1(1) ..fsrc1(O), 
fsrc2(1) ..fsrc2(O), and fdest(1)..fdest(O) 
where zero denotes the least-significant field. 
PM +- 
PM shifted right by 2 bits 


FOR i = 0 to 1 
DO 
PM [i + 6] +- fsrc2(i) s fsrc1(i) (unsigned) 
fdest(i) 
+- smaller of fsrc2(i) and fsrc1(i) 
OD 
MERGE 
+- 0 


fzchks tsrct, fsrc2, fdest 
16-Blt Z-Buffer Check 


Consider the 64-bit operands as arrays of four 16-bit 
fields fsrc1(3) ..fsrc1(O), 
fsrc2(3)..fsrc2(O), 
and fdest(3) ..fdest(O) 
where zero denotes the least-significant field. 


PM +- PM shifted right by 4 bits 
FOR i = 0 to 3 
DO 
PM [i + 4] +- fsrc2(i) 
s fsrc1(i) (unsigned) 
fdest(i) 
+- smaller of fsrc2(i) and fsrc1(i) 
OD 
MERGE 
+- 0 


Intovr 
Software Trap on Integer Overflow 
IF OF = 1 
THEN generate trap with IT set in psr 
FI 


Ixfr isrct ni, fdest 
Transfer Integer to F-P Register 
fdest 
+- isrc 1ni 
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Id.c csrc2, idest 
Load from Control Register 
idest 
- 
csrc2 


Id.x isrc1(isrc2j, 
idest 
Load Integer 
idest 
-. 
mem.x 
(isrc1 + isrc2j 


Idlnt.x isrc2, idest 
Load Interrupt Vector 
idest 
- 
int_vector.x 
(isrc2) 
. 


NOTE: Not available with the i860 XR CPU 


Idlo.x isrc2, idest 
'.' 
Load 1/0 
idest 
- 
port.x 
(isrc2) 
NOTE: Not available with the i860 XR CPU 
•• 
c 


lock 
Begin Interlocked Sequence 
Set BL in dlrbase. 
The next load or store that appears on the bus locks that location. 
2 


Disable interrupts until the bus is unlocked. 


mov isrc2, idest 
Register-Register 
Move' 


Assembler pseudo-operation 
mov isrc2, idest = shl rO, isrc2, idest 


mov const32, 
idest 
Constant-to-Reglster 
Move 
Assembler pseudo-operation 
when OxFFFF8000 ~ const32 
< Ox8000 ... 
adds l%const32, 
rO, idest 
otherwise ... 


orh h%const32, 
rO, idest 
or l%const32, 
idest, idest 


nop 
, 
Core-Unit No Operation 
Assembler pseudo-operation 
nop = shl rO,rO, rO 


or isrc1, isrc2, idest 
Logical OR 
idest 
- 
isrc1 OR isrc2 
CC set if result is zero, cleared otherwise 


orh 11const, isrc2, idest 
Logical OR high 
idest 
- 
(11const shifted left 16 bits) OR isrc2 
, 


CC set if result is zero, cleared otherwise 


pfadd.p tsrct, 
fsrc2, fdest 
Plpellned Floating-Point Add 
fdest 
- 
last stage adder result 
Advance A pipeline one stage 
A pipeline first stage - 
fsrc1 + fsrc2 


pfaddp fsrc1, fsrc2, fdest 
Plpellned Add with Plxel Merge 
fdest 
- 
last-stage graphics-unit result 
last-stage graphics-unit result - 
fsrc1 + fsrc2 
(using integer arithmetic; 8-byte operands and destination) 
Shift, then load MERGE register from fsrc1 + fsrc2 
as defined in Table 10.2 


pfaddz tsrct, 
fsrc2, fdest 
Plpellned Add with Z Merge 
frdest 
- 
last-stage graphics-unit result 
last-stage graphics-unit result - 
fsrc1 + fsrc2 
(using integer arithmetic; 8-byte operands and destination) 
Shift MERGE right 16, then load fields 31..16 and 63..48 fromfsrc1 
+ 
fsrc2 
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pfam.p tsrct, fsrc2, tdest 
Plpellned 
Floating-Point 
Add and Multiply 
tdest +- last stage adder result 
Advance 
A and M pipeline 
one stage (operands 
accessed 
before 
advancing 
pipeline) 
A pipeline 
first stage +- A-op1 + A-op2 
M pipeline 
first stage +- 
M-op1 x M-op2 


pfamov.r 
tsrct, tdest 
Plpellned 
Floating-Point 
Adder 
Move 
tdest +- last stage adder result 
Advance 
A pipeline 
one stage 
A pipeline 
first stage +- tsrct 


pfeq.p tsrct, {src2, tdest 
Plpellned 
Floating-Point 
Equal Compare 
tdest +- last stage adder result 
CC set if tsrct = {src2, else cleared 
Advance 
A pipeline 
one stage 
A pipeline 
first stage is undefined, 
but no result exception 
occurs 


pfgt.p 
tsrct, {src2, tdest 
Plpellned 
Floating-Point 
Greater-Than 
Compare 
(Assembler 
clears R-bit of instruction) 
tdest +- last stage adder result 
CC set if tsrct > tsrce. else cleared 
Advance 
A pipeline 
one stage 
A pipeline 
first stage is undefined, 
but no result exception 
occurs 


pfladd.w 
tsrct, {src2, tdest 
Plpellned 
Long-Integer 
Add 
taest +- last-stage 
graphics-unit 
result 
last-stage 
qraphics-unit 
result +- tsrct + {src2 (2's complement 
integer 
arithmetic) 


pflsub.w 
fsrcl, {src2, tdest 
Plpellned 
Long-Integer 
Subtract 
{dest +- last-stage 
graphics-unit 
result 
. 


last-stage 
graphics-unit 
result +- tsrct - 
{src2 (2's complement 
integer arithmetic) 


pflx. v tsrc 1, tdest 
Plpellned 
Floatlng-Polntto 
Integer 
Conversion 
taest +- 
last stage adder result 
Advance 
A pipeline 
one stage 
A pipeline 
first stage +- 64-bit value with low-order 
32 bits 
equal to integer 
part of tsrct 
rounded 


Plpellned 
Floating-Point 
Load 


pfld.y isrcl(isrc2), (des! 
(Normal) 
pfld_y isrcl(isrc2)+ +, tdest 
(Autolncrement) 
tdest +- mem.y (third previous 
pfld's 
(isrcl + isrc2) 
(where 
.y is precision 
of third previous 
pfld.y) 
IF autoincrement 
THEN isrc2 +- isrcl + isrc2 
FI 
NOTE: pfld.q 
is not available 
with the i860 XR CPU 


pfle.p tsrct, {src2, tdest 
Plpellned 
F-P Less-Than 
or Equal Compare 
Assembler 
sets R-bit of instruction 
tdest +- last stage adder result 
CC clear if tsrct 
s; tsrce. else set 
Advance 
A pipeline 
one stage 
A pipeline 
first stage is undefined, 
but no result exception 
occurs 
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pfmam.p tsrct, {src2, {des! 
Plpellned Floating-Point Add and Multiply 
{des! +- last stage multiplier 
result 
Advance 
A and M pipeline 
one stage (operands 
accessed 
before advancing 
pipeline) 
A pipeline 
first stage +- A-op1 + A-op2 
M pipeline 
first stage +- M-op1 
x 
M-op2 


pfmov.r tsrct, {des! 
Plpellned Floating-Point Reg-Reg Move 


Assembler 
pseudo-operation 
pfmov .•• tsrct, {des! = pfladd .•• tsrct, fO, {des! 
pfmov.dd tsrct, {des! = pfladd.dd tsrct, to, {des! 
pfmov.sd tsrct, {des! = pfamov.sd 
tsrc 1, {des! 
pfmov.ds tsrct, {des! = pfamov.ds 
tsrc 1, {des! 


pfmsm.p tsrct, {src2, {des! 
Plpellned Floating-Point Subtract and Multiply 
{des! +- 
last stage multiplier 
result 
Advance 
A and M pipeline 
one stage (operands 
accessed 
before advancing 
pipeline) 
A pipeline 
first stage +- A-op1 
- 
A-op2 
M pipeline 
first stage +- M-op1 
x 
M-op2 


pfmul.p tsrct, {src2, {des! 
Plpellned Floating-Point Multiply 
{des! +- last stage multiplier 
result 
Advance 
M pipeline 
one stage 
M pipeline 
first stage +- tsrct 
x {src2 


pfmul3.dd fsrc1, {src2, {des! 
Three-Stage Plpellned Multiply 
{des! +- last stage multiplier 
result 
Advance 
3-Stage 
M pipeline 
one stage 
M pipeline 
first stage +- fsrc1 x fsrc2 


pform tsrc 1,{des! 
Plpellned OR to MERGE Register 
{des! +- last-stage 
graphics-unit 
result 
last-stage 
graphics-unit 
result +- tsrct OR MERGE 
MERGE 
+- 0 


pfsm.p tsrct, fsrc2, {des! 
Plpellned Floating-Point Subtract and Multiply 
{des! +- last stage adder result 
Advance 
A and M pipeline 
one stage (operands 
accessed 
before advancing 
pipeline) 
A pipeline 
first stage +- A-op1 
- 
A-op2 
M pipeline 
first stage +- 
M-op1 
x 
M-op2 


pfsub.p tsrct, fsrc2, {des! 
Plpellned Floating-Point Subtract 
{des! +- last stage adder result 
Advance 
A pipeline 
one stage 
A pipeline 
first stage +- tsrct - 
fsrc2 


pftrunc.v tsrct, {des! 
Plpellned Floating-Point to Integer Conversion 
{des! +- last stage adder result 
Advance 
A pipeline 
one stage 
A pipeline 
first stage +- 64-bit value with low-order 
32 bits 
equal to integer 
part of tsrct 
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pfzchkl fsrc 1,fsrc2, fdest 
Pipelined 32-Blt Z-Buffer Check 
Consider 
the 64-bit operands 
as arrays of two 32-bit 
fields fsrc1(1)..fsrc1(0), 
fsrc2(1) ..fsrc2(0), 
and fdest(1)..fdest(0) 
where 
zero denotes 
the least-significant 
field. 
PM +- PM shifted 
right by 2 bits 
FOR i = 
0 to 1 
DO 
PM [i + 6] +- fsrc2(i) s fsrc1(i) 
(unsigned) 
fdest(i) 
+- last-stage 
graphics-unit 
result 
last-stage 
graphics-unit 
result +- smaller 
of fsrc2(i) and 
fsrc1 
OD 
MERGE 
+- 0 


pfzchks fsrc1, fsrc2, tdest 
Plpellned 16-Blt Z-Buffer Check 
Consider 
the 64-bit operands 
as arrays of four 16-bit 
fields fsrc1(3) ..fsrc1(0), 
fsrc2(3) ..fsrc2(0), 
and. fdest(3) ..fdest(0) 
where zero denotes 
the least-significant 
field. 


PM +- PM shifted 
right by 4 bits 
FOR i = 
0 to 3 
DO 
PM [i + 4] +- fsrc2(i) 
~ fsrc1(i) 
(unsigned) 
fdest +- last-stage 
graphics-unit 
result 
last-stage 
graphics-unit 
result(i) 
+- smaller of fsrc2(i) 
and fsrc1(i) 
OD 
MERGE 
+- 0 


pst.d fdest, 11const(isrc2) 
Plxel Store 
pst.d fdest, 
11const(isrc2) + + 
Plxel Store Autolncrement 
Pixels enabled 
by PM in mem.d 
(isrc2 + 
IIconsf) 
+- fdest 
Shift PM right by 8/pixel 
size (in bytes) bits 


IF autoincrement 
THEN isrc2 +- 
11const + isrc2 
FI 


scyc.x isrc2 
Special Cycles 
Generate 
a special 
bus cycle (D/CII 
=0, 
W/RII 
= 1, MIIOII 
=0) 
and 
set BE711-BEOII 
according 
to the value contained 
in the register isrc2 
NOTE: 
Not available 
with the i860 XR CPU 


shl isrc1, isrc2, idest 
Shift Left 
idest +- isrc2 shifted 
left by isrc1 bits 


shr isrct, 
isrc2, idest'. 
: 
Shift Right 
SC (in psr) +- isrc1 
idest +- isrc2 shifted 
right by tsrct 
bits 


shra isrct, 
isrc2, idest 
Shift Right Arithmetic 
idest +- isrc2 arithmetically 
shifted 
right by 
isrc1 bits 
\ 


shrd isrc1ni, isrc2, idest 
Shift Right Double 


idest +- low-order 
32 bits of isrc1ni:isrc2 
shifted 
right by SC bits 


st.c isrc1ni, csrc2 
Store to Control Register 
csrc2 
+- src 1ni 


slx 
isrct ni, 11const(isrc2) 
Store Integer 
mem.x 
(isrc2 + 11consf) 
+- isrc1ni 
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10.2 
Instruction 
Format and Encoding 


All instructions are 32 bits long and begin on a four- 
byte boundary. When operands are registers, the 
encodings shown in Table 10.3 are used. 


There 
are two 
general core-instruction 
formats 
(REG-format and CTRL-format) and a separate for- 
mat for floating-point instructions. 


Table 10.3. Register Encoding 


Register 
Encoding 


rO 
0 


r31 
31 


fO 
0 


f31 
31 


Fault Instruction 
0 
Processor Status 
1 
Directory Base 
2 
Data Breakpoint 
3 
Floating-Point Status 
4 
Extended Processor Status 
5 


Bus Error Address' 
6 
Concurrency Control' 
7 
pO' 
8 
p1' 
9 
p2' 
10 
p3' 
11 


NOTE: 
'Available only with i860 XP CPU. Using these encodings 
with the i860 XR CPU produces undefined results. 
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10.2.1 REG-FORMAT INSTRUCTIONS 


Within the REG-format are several variations as 
shown in Figure 10.1. Table 10.4 gives the encod- 
ings for these instructions. One encoding is an es- 
cape code that defines yet another variation: the 
core escape instructions. Figure 10.2 shows the for- 
mat of this group, and Table 10.5 shows the encod- 
ings. 


In these instructions, the src2 field selects one of 
the 32 integer registers (most instructions) or one of 
the control registers (st.c and Id.c). Dest 
selects 
one of the 32.integer registers (most instructions) or 
floating-point registers (fld, fst, pfld, pst, Ixfr). For 
instructions where src1 is optionally an immediate 
value, bit 26 of the opcode (I-bit) indicates whether 
src1 is an immediate. If bit 26 is clear, an integer 
register is used; if bit 26 is set, src1 is contained in 
the low-order 16 bits, except for bte and btne in- 
structions. For bte and btne, the .five-bit immediate 
value is contained in the src1 field. For st, bte, btne, 
and bla, the upper five bits of the offset or broffset 
are contained in the dest field instead of src1, and 
the lower 11 bits of offset 
are the lower 11 bits of 
the instruction. 


For Id and st, bits 28 and zero determine operand 
size as follows: 


81t28 
81tO 
Operand Size 


0 
0 
8-bits 
0 
1 
8-bits 
1 
0 
16-bits 
1 
1 
32-bits 


When src1 is immediate and bit 28 is set, bit zero of 
the immediate value is forced to zero. 


For fld. fst. pfld. pst. and flush. bit 0 selects autoin- 
crement 
addressing 
if set. For fld, fst, pfld, and pst. 
bits one and two select the operand 
size as follows: 


Bit 1 
BIt2 
Operand 
Size 


0 
0 
64-bits 
0 
1 
128-bits 
1 
0 
32-bits 
1 
1 
32-bits 


For flush. bits one and two must be zero. 
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When 
src1 
is immediate. 
bits zero and one 
of the 
immediate 
value are forced to zero to maintain 
align- 
ment. When bit one of the immediate 
value is clear. 
bit two is also forced 
to zero. 


For the instructions 
Idlo, stlo, Idlnt, and scyc. 
the 
operand 
size is encoded 
by bits 9 and 10 as follows. 


For other 
instructions. 
these 
bits are reserved 
and 
should be set to zero. 


Operand 
Size 
Bit 10 
Bit 9 


8 Bits (.b) 
0 
0 
16 Bits (.s) 
0 
1 
32 Bits (.I) 
1 
0 
reserved 
1 
1 


1.M2921127 
'52~2J2221 
'2019 III 17 18 514 
IJ 
12 If 
0911765~J2 
I 
0 
I 
OPCODE/I 
SRC2 
DEST 
SRC' 
I•••• EDIATE. OffSET. 
OR NULL 


240874-74 


'!!1.M2921127m'252~2J2221'201911117111 
51~IJ12ff10 
9 11 7 6 5 ~ J 2 
10 
I 
OPCODE 
, 
SRC2 
DEST 
1•••• EDlATE 


240874-75 


':fIJ029211272 
125 2~ 2J 2221.'2019 III 17 III 1514 
IS 12 If 
10 9 11 7 6 5 ~ S 2 
I 
0 
I 
OPCODE/I 
SRC2 
OffSET 
SRC' 
OffSET 
LOW 
HIGH 
SRC'S 


240874-76 


fJ0211282712fJ'252-12J2221.f20fIl181716. 
5,.,'.$1211.10 
11 8 765., 
oS2 
'0 
I 
OPCODE 
, 
SRC2 
OffSET 
I•••• EDIATE 
OffSET 
LOW 
HIGH 


240874-77 


Figure 10.1. REG-Format 
Variations 
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Table 
10.4. REG-Format 
Opcodes 
31 
30 
27 
26 
29 
28 


Id.x 
Load Integer 
0 
0 
0 
L 
0 
I 
st.x 
Store Integer 
0 
0 
0 
L 
1 
1 
Ixfr 
Integer to F-P Reg Transfer 
. 
0 
0 
0 
0 
1 
0 
- 
(reserved) 
0 
0 
0 
1 
1 
0 


fld.x, 
fst.x 
Load/Store 
F-P 
0 
0 
1 
0 
LS 
I 
flush 
Flush 
0 
0 
1 
1 
0 
1 
pst.d 
Pixel Store 
0 
0 
1 
1 
1 
1 
Id.c, st.c 
Load/Store 
Control Register 
0 
0 
1 
1 
LS 
0 


brl 
Branch Indirect 
0 
1 
0 
0 
0 
0 
trap 
Trap 
0 
1 
0 
0 
0 
1 
- 
(Escape for F-P Unit) 
0 
1 
0 
0 
1 
0 
- 
(Escape for Core Unit) 
0 
1 
0 
0 
1 
1 
bte, btne 
Branch Equal or Not Equal 
0 
1 
0 
1 
E 
I 
pfld.y 
Pipelined 
F-P Load 
0 
1 
1 
0 
0 
I 
- 
(CTRL-Format 
Instructions) 
0 
1 
1 
x 
x 
x 


addu, 
os, subu, 
-s 
Add/Subtract 
1 
0 
0 
SO 
AS 
I 
shl, shr 
Logical Shift 
1 
0 
1 
0 
LR 
I 
shrd 
Double Shift 
1 
0 
1 
1 
0 
0 
bla 
Branch LCC Set and Add 
1 
0 
1 
1 
0 
1 
shra 
Arithmetic 
Shift 
1 
0 
1 
1 
1 
I 


and(h) 
AND 
1 
1 
0 
0 
H 
I 
andnot(h) 
ANDNOT 
1 
1 
0 
1 
H 
I 
or(h) 
OR 
1 
1 
1 
0 
H 
I 
xor(h) 
XOR 
1 
1 
1 
1 
H 
I 
- 
(reserved) 
1 
1 
x 
x 
1 
0 


L 
Integer Length 
AS 
Add/Subtract 
0 
--a bits 
0 
-Add 
1 
-16 
or 32 bits (selectedby bit 0) 
1 
-Subtract 
LS 
Load/Store 
LR 
Left/Right 
0 
-Load 
0 
-Left 
Shift 
1 
-Store 
1 
-Right 
Shift 
SO 
Signed/Ordinal 
E 
Equal 
0 
-Ordinal 
0 
-Branch on Unequal 
1 
-Signed 
1 
-Branch on Equal 
H 
High 
Immediate 
0 
_nd, 
or, andnot, xor 
0 
-src1 is register 
1 
_ndh, 
orh, andnoth, xorh 
1 
-src1 is immediate 


f~nMD2m_Dlln~gM"~UUUUUW9876.~S2 
fO 
10 
1 0 
0 
1 
1 
SRC2 
DEST 
SRC 1 
SIZE t:::~::::~::::g: 
OPCODE 


[ill RESERVED 
BY 
INTEL 
CORPORATION 
(SET 
TO ZERO) 
240814-78 


Figure 
10.2. Core Escape 
Instructions 


2-132 


intel~ 
i860TM XP MICROPROCESSOR 


Table 
10.5. Core Escape 
Opcodes 
4 
3 
1 
o 
2 
- 
(reserved) 
0 
0 
0 
0 
0 
lock 
Begin Interloacked 
Sequence 
0 
0 
0 
0 
1 
calli 
Indirect Subroutine 
Call 
0 
0 
0 
1 
0 
- 
(reserved) 
0 
0 
0 
1 
1 
Introvr 
Trap on Integer Over1low 
0 
0 
1 
0 
0 
- 
(reserved) 
0 
0 
1 
0 
1 
- 
(reserved) 
0 
0 
1 
1 
0 
unlock 
End Interlocked 
Sequence 
0 
0 
1 
l' 
1 
Idlo' 
Load I/O 
0 
1 
0 
0 
0 
stlo' 
Store I/O 
0 
1 
0' 
0 
1 
Idlnt' 
Load Interrupt Vector 
0 
1 
0 
1 
0 
scyc' 
Special Cycles 
0 
1 
0 
1 
1 
- 
(reserved) 
0 
1 
1 
x 
x 
- 
(reserved) 
1 
0 
x 
x 
x 
- 
(reserved) 
1 
1 
x 
x 
x 


NOTE: 
•Available only with i860 XP CPU, not with i860 X~ CPU 


10.2.2 
CTRL·FORMAT 
INSTRUCTIONS 


The CTRL-Format 
instructions 
do not refer to registers; 
so, instead 
of the register 
fields, 
they have a 26-bit 
relative 
branch 
offset. 
Figure 10.3 shows the format 
of these 
instructions 
and Table 
10.6 defines 
the encod- 
ings. 


ISfsoa 
'8272 
~_._n.g~".uUUU"~9876S4S2 
I a 
10 1 1 
ope 
BRorFSET 


240874-79 


NOTE: 
BROFFSET 
is a signed 26·bit relative branch offset 


Figure 
10.3. CTRL·Format 
Instructions 


Table 
10.6. CTRL·Format 
Opcodes 
28 
27 
26 
- 
(reserved) 
0 
0 
0 
- 
(reserved) 
0 
0 
1 
br 
Branch Direct 
0 
1 
0 
call 
Call 
0 
1 
1 
bc(.t) 
Branch on CC Set 
1 
, 
0 
T 
bnc(.t) 
Branch on CC Clear 
1 
1 
T 


T 
Taken 
o 
--be 
or bnc 
1 
--be.1 
or bnc.t 


10.2.3 
FLOATING·POINT 
INSTRUCTION 
ENCODING 


the floating-point 
instructions, 
and Table 
10.7 gives 
the encodings. 
Within the dual-operation 
instructions 
is a subcode 
DPC whose 
values 
are given in Table 
10.9 along 
with the mnemonic 
that corresponds 
to 
each. 
The floating-point 
instructions 
also constitute 
an es- 
cape series. All these 
instructions 
begin with the bit 
sequence 
010010. 
Figure 
10.4 shows 
the format 
of 
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S 
Source Precision 
1 
Double-precision source operands 
o 
Single-precision source operands 
R 
Result Precision 
1 
Double-preclsionresult 
o 
Single-precisjon result 


'J1.HJ2928272, '12524252221. '2019181718 75141512 n 70 9 87 
8 5 4 52 
1 0 


10 1 0 0 
1 01 
SRC2 
DEST 
SRC 1 
P 0 S R 
OPCODE 
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SRC1, SRC2 
Source; one of 32 floating-point registers 
DEST 
Destination; one of 32 floating-point registers (except fxfr; one of 32 integer registers) 


P 
Pipelining 
1 
Pipelined instruction mode 
o 
Scalar instruction mode 
o 
Dual-Instruction Mode 
1 
Dual-instruction mode 
o 
Single-instruction mode 


Figure 10_4_Floating-Point Instruction 
Encoding 


Table 10_7_Floating-Point Opcodes 
654 
3 
2 
1 


pfam 
Add and Multiply' 
0 
0 
0 
OPC 
pfmam 
Multiply with Add' 
pfsm 
Subtract and Multiply' 
0 
0 
1 
OPC 


pfmsm 
Multiply with Subtract' 


(p)fmul 
Multiply 
0 
1 
0 
0 
0 
0 
0 
fmlow 
Multiply Low 
0 
1 
0 
0 
0 
0 
1 
frcp 
Reciprocal 
0 
1 
0 
0 
0 
1 
0 
frsqr 
Reciprocal Square Root 
0 
1 
0 
0 
0 
1 
1 
pfmul3_dd 
3-Stage Pipelined Multiply 
0 
1 
0 
0 
1 
0 
0 


(p)fadd 
Add 
0 
1 
1 
0 
0 
0 
0 
(p)fsub 
Subtract 
0 
1 
1 
0 
0 
0 
1 
(p)flx 
Fix 
0 
1 
1 
0 
0 
1 
0 
(p)famoY 
Adder Move 
0 
1 
1 
0 
0 
1 
1 
pfgtlpfle" 
Greater Than 
0 
1 
1 
0 
1 
0 
0 
pfeq 
Equal 
0 
1 
1 
0 
1 
0 
1 
(p)ftrunc 
Truncate 
0 
1 
1 
1 
0 
1 
0 


fxfr 
Transfer to Integer Register 
1 
0 
0 
0 
0 
0 
0 
(p)fladd 
Long-Integer Add 
1 
0 
0 
1 
0 
0 
1 
(p)flsub 
Long-Integer Subtract 
1 
0 
0 
1 
1 
0 
1 


(p)fzchkl 
Z-Check Long 
1 
0 
1 
0 
1 
1 
1 
(p)fzchks 
Z-Check Short 
1 
0 
1 
1 
1 
1 
1 
(p)faddp 
Add with Pixel Merge 
1 
0 
1 
0 
0 
0 
0 
(p)faddz 
Add with Z Merge 
1 
0 
1 
0 
0 
0 
1 
(p)form 
OR with MERGE Register 
1 
0 
1 
1 
0 
1 
0 


NOTE: 
All opcodes not shown are reserved . 
• pfam and pfsm have P-bit set; pfmam and pfmsm 
have P-bit clear. 
•• pfgt has R bit cleared; pfle has R bit set. 
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Table 
10.8. OPC Encoding 


OPC 
PFAM 
PFSM 
M-Unit 
M-Unit 
A-Unit 
A-Unit 
T 
K 
Mnemonic 
Mnemonic 
01 
op2 
op1 
op2 
Load 
Load· 


0000 
r2p1 
r2s1 
KR 
src2 
src1 
M result 
No 
No 
0001 
r2pt 
r2st 
KR 
src2 
T 
M result 
No 
Yes 
0010 
r2ap1 
r2as1 
KR 
src2 
src1 
A result 
Yes 
No 
0011 
r2apt 
r2ast 
KR 
src2 
T 
A result 
Yes 
Yes 


0100 
12p1 
i2s1 
KI 
src2 
src1 
M result 
No 
No 
0101 
12pt 
i2st 
KI 
src2 
T 
M result 
No 
Yes 
0110 
12ap1 
i2as1 
KI 
src2 
src1 
A result 
Yes 
No 
0111 
12apt 
i2ast 
KI 
src2 
T 
A result 
Yes 
Yes 


1000 
rat1p2 
rat1s2 
KR 
A result 
src1 
src2 
Yes 
No 
1001 
m12apm 
m12asm 
src1 
src2 
A result 
M result 
No 
No 
1010 
ra1p2 
ra2s2 
KR 
A result 
src1 
src2 
No 
No 
1011 
m12ttpa 
m12ttsa 
src1 
src2 
T 
A result 
Yes 
No 


1100 
lat1p2 
iat1s2 
KI 
A result 
src1 
src2 
Yes 
No 
1101 
m12tpm 
m12tsm 
src1 
src2 
T 
M result 
No 
No 
1110 
la1p2 
ia1s2 
KI 
A result 
src1 
src2 
No 
No 
1111 
m12tpa 
m12tsa 
src1 
src2 
T 
A result 
No 
No 


DPC 
PFMAM 
PFMSM 
M-Unit 
M-Unit 
A-Unit 
A-Unit 
T 
K 
Mnemonic 
Mnemonic 
op1 
op2 
op1 
op2 
Load 
Load· 


0000 
mr2p1 
mr2s1 
KR 
src2 
src1 
M result 
No 
No 
0001 
mr2pt 
mr2st 
KR 
src2 
T 
M result 
No 
Yes 
0010 
mr2mp1 
mr2ms1 
KR 
src2 
src1 
M result 
Yes 
No 
0011 
mr2mpt 
mr2mst 
KR 
src2 
T 
M result 
Yes 
Yes 


0100 
ml2p1 
mi2s1 
KI 
src2 
src1 
M result 
No 
No 
0101 
ml2pt 
mi2st 
KI 
src2 
T 
M result 
No 
Yes 
0110 
ml2mp1 
mi2ms1 
KI 
src2 
src1 
M result 
Yes 
No 
0111 
ml2mpt 
mi2mst 
KI 
src2 
T 
M result 
Yes 
Yes 


1000 
mrmt1p2 
mrmt1s2 
KR 
M result 
src1 
src2 
Yes 
No 
1001 
mm12mpm 
mm12msm 
src1 
src2 
M result 
M result 
No 
No 
1010 
mrm1p2 
mrm1s2 
KR 
M result 
src1 
src2 
No 
No 
1011 
mm 12ttpm 
mm12ttsm 
src1 
src2 
T 
M result 
Yes 
No 


1100 
mlmt1p2 
mimt1s2 
KI 
M result 
src1 
src2 
Yes 
No 
1101 
mm12tpm 
mm12tsm 
src1 
src2 
T 
M result 
No 
No 
1110 
mlm1p2 
mim1s2 
KI 
M result 
srct 
src2 
No 
No 


1111 
Intel Reserved 


NOTE: 
• If K·load 
is set, KR is loaded 
when 
operand-I 
of the multiplier 
is KR; KI is loaded 
when 
operand-1 
of the multiplier 
is KI. 
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10.3 Instruction Timings 


Generally, 
i860 XP microprocessor 
instructions 
take 
one 
clock 
to execute. 
unless 
a freeze 
condition 
is 
invoked. 
De~d 
times, along with freeze conditions 
and their associated 
delays, 
are shown 
in the table 
on the following 
pages. 
The following 
symbols 
are 
used for brevity in the timing table: 
+n 
n clocks 
must 
be added 
to the execution 
time if the stated 
conditions 
apply. 
~ 
n The processor 
requires at least n clocks be- 
tween the indicated 
instructions. 
The actual 
delay will be n minus the number 
of clocks 
for 
e~ecuting 
intervening 
instructions 
(or 
dual-mode 
pairs). If the time for intervening 
instructions 
is ::?; n, there is no delay. 
n..m 
Indicates 
a range 
of clocks. 
These 
cases 
are accompanied 
by a reference 
to a note 
where further 
explanation 
is available. 


XR: 
Applies 
to i860 XR microprocessors 
only. 


XP: 
Applies 
to i860 XP microprocessors 
only. 


OA 
The number of clocks to finish all outstand- 
ing accesses. 


R1 
The number 
of clocks 
from ADS# 
through 
the 
first 
READY # 
(80860XR) 
or BRDY # 
(80860XP) 
of the indicated 
bus activity. 


R2 
The number 
of clocks 
from ADS# 
through 
the second 
READY # or BRDY #. 


RL 
The number 
of clocks 
from ADS# 
through 
the last READY # or BRDY #. 


RL 1 
XP: 
The 
number 
of 
clocks 
through 
last 
BRDY # of first access. 


RN 
XR: The number of clocks 
until next nonre- 
peated 
address 
can be issued 
(i.e., an ad- 
dress 
that 
is not the 
2nd-4th 
cycle 
of a 
cache fill, the 2nd-8th 
cycle of a CS8 mode 


. instruction 
fetch, nor the 2nd cycle of a 128- 
bit write). 


RX 
The number 
of clocks 
throug!1 READY # or 
BRDY # for the next 64-bit-or-less 
write cy- 
cle or second 
READY # or BRDY # for the 
next 128-bit write cycle. 


a. 


NOTES: 


"Address 
path full" 
means 
one address 
inter- 
nally waiting for bus while external 
bus pipeline 
full. 
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b. 
"Store 
path full" means two stores or one 256- 
bit write-back 
internally 
waiting for bus plus ex- 
ternal bus pipeline 
full. 


c. 
If a floating-point 
instruction, 
graphics-unit 
in- 
struction, 
fst, or pst is executed 
when a scalar 
floating-point 
operation 
(other 
than 
frcp 
or 
frsqr) 
is in progress, 
the scalar operation 
must 
complete 
first: two additional 
clocks 
for fadd, 
fix, 
fmlow, 
fmul.ss, 
fmul.sd, 
ftrunc, 
and 
fsub; 
three additional 
clocks 
for fmul.dd. 
Add 
one if either or both of these situations 
occur: 


1. There is an overlap 
between 
the result reg- 
ister of the 
previous 
scalar 
operation 
and 
the source 
of the floating-point 
operation, 


and the destination 
precision 
of the scalar 
operation 
differs 
from the source 
precision 
of the floating-point 
operation. 


2. The 
floating-point 
operation 
is 
pipelined 
and its destination 
is not fO. 


TLB 
TLB 
miss. 
Five 
clocks 
plus 
the 
number 
of 
clocks 
to finish two reads plus the number 
of 
clocks 
to set A-bits (if necessary). 


In addition, 
any instruction 
may be delayed 
due to an 
instruction 
cache 
miss or TLB 
miss during 
the 
in- 
struction 
fetch. 
The time for a TLB 
miss is shown 
above 
in note TLB. 
An instruction 
cache 
miss adds 
the following 
delays: 


• 
The number 
of clocks 
to get the next instruction 
' 


from 
the bus (ADS# 
clock 
to first 
READY # 
or 
BRDY # clock, 
inclusive). 


• 
XR: When 
any of the instructions 
in the new in- 
struction-cache 
line is a branch 
or call or causes 
a freeze, 
the time through 
the last READY # for 
the new line. 


• 
If the data cache 
is being accessed 
when the in- 
struction-cache 
miss occurs, 
two clocks 
for data 
cache miss; one clock for hit. 


Not included 
in the table 
is the delay 
caused 
by a 
trap. This depends 
on the trap handler. 


In dual instruction 
mode, 
each 
pair of instructions 
requires the maximum 
of the times required 
by each 
individual 
instruction. 


Instruction 
Execution 
Clocks 
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'Condltlon 


adds 


addu 


and 


andh 


and not 


andnoth 


bc 


bc.t 


bla 


bnc 


bnc.t 


br 


brl 


bte 


btne 


call 
1 
+1 
+1 +R1 
+1+R2 


1 
2 
+1 


1 
2 
+ 


If branch not taken. 
If branch taken. 
If the prior instruction 
is addu, 
adds, 
subu, subs, 
pfeq, 
or pfgt. 


calli 
2 
+1 
+1 +R1 
+1+R2 


fadd.p 
1 
~2 
..4 


If branch taken. 
If branch not taken. 
If the prior instruction 
is addu, 
adds, 
subu, subs, 
pfeq, 
or pfgt. 


1 
2 
If branch taken. 
If branch not taken. 


(same as bc) 


(same as bc.t) 


2 


1 
3 
If branch not taken. 
If branch taken. 


(same as bte) 


If r1 referenced 
in next instruction. 
If data cache load miss in progress 
for a read of less than 128 bits. 


If data cache load miss in progress 
for 128-bit read. 


If r1 referenced 
in next instruction. 


If data cache load miss in progress 
for a read of less than 128 bits. 


If data cache load miss in progress 
for 128-bit read. 


( ... 
and all other A-unit instructions 
except dual operations) 
If executed 
when a scalar floating-point 
operation 
(other than frcp 


or frsqr) 
is in progress.(e) 
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Execution 
Instruction 
Clocks 
Condition 


faddp 


faddz 


famoY.r 


fladd.w 


flsub.w 


flx.Y 


fld.y 


flush 


fmlow.dd 


fmev.r 


fmul.p 


1 
( ... 
and all other G·unit instructions 
except fladd.w, 
fxfr) 
+ 1 
If fdest is used by next instruction 
and next instruction 
is G·, M· or A·unit instruction 
- 
2..4 
If executed 
when a scalar floannq-point 
operation 
(other than frcp 
or frsqr) 
is in 
progress.(c) 


(same as faddp) 


(same as fadd.p) 


1 
+ 1 
If fdest is used by next instruction 
and next instruction 
is M· or A·unit instruction 
(except when fiadd 
is used for fmoy.dd 
or fmoy.ss). 


+ 1 
If fdest is used by next instruction 
and next instruction 
is G·unit instruction. 


- 
2..4 
If executed 
when a scalar floating·point 
operation 
(other than frcp 
or frsqr) 
is in 
progress.(c) 


(same as faddp) 


(same as fadd.p) 


1 


+ 1 
If this is the instruction 
after a st, fst or pst 
that hits the data cache. 


- 
2 
If fdest is referenced 
in the next two instructions. 
+ 1+ R1 
If 32·bit fld.l or 54·bit fld.d 
misses the data cache. 
+ 1+ R2 
If 128·bit fld.q 
misses the data cache. 


+ 1+ RL 
If data cache load miss in progress 
(except in the following 
case). 


- 
2 
XP: If this instruction 
follows a data cache access that misses in the virtual tags but 
hits in the physical tags. 
+ 2 
XP: If the prior instruction 
is a pfld.y 
that hits a modified 
line in the data cache. 


+ R2 
XP: If data-cache 
line write-back 
due to snoop is in progress. 


+ RN 
XR: If address 
path fulL<a) 
+ RL1 
XP: If address 
path fulL<a) 
+ TLB 
If TLB miss. 


1 
- 
3 
XR: If preceded 
by another flush. 


- 
2 
XP: If preceded 
by another flush. 


+ R2 
XP: If data-cache 
line write-back 
due to snoop is in progress. 


+ 1+ RX 
If flush to modified 
line when store path fulL<b) 
+ TLB 
If TLB miss. 


1 
( ... 
and all other M·unit instruction 
except dual operations) 


+ 1 
If fsrct refers to result of the prior operation 
(either scalar or pipelined). 
+ 1 
If the prior operation 
is a double-precision 
multiply. 


- 
2..4 
If executed 
when a scalar tloatinq-polnt 
operation 
(other than frcp 
or frsqr) 
is in 
progress.!c) 


rmcv.ss 
and fmoy.dd 
same as fiadd.w 
fmcv.sd 
and fmoy.ds 
same as fadd.p 


(same as fmlow.dd) 
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Instruction 
Execution 
Clocks 


Id.x 
1 
+1 
+1 
+1 +RL 
-1+R1 


-2 


+2 
+R2 
+RN 
+RL1 
+1+RX 
+TLB 


Idint.x 


Idio.x 


lock 


mov 


nop 


or 


orh 


pfadd.p 


pfaddp 


pfaddz 


pfam.p 
1 
+1 
+1 
-2 
..4 


Condition 


If ides! referenced 
in next instruction. 
If this is the instruction 
after a st, fst or pst 
that hits the data cache. 


If data cache load miss in progress. 
If Id.x misses the data cache and a subsequent 
instruction 
references 
the 
ides! of the Id.x (except for following 
case). 


XP: If this instruction 
follows 
a data cache access that misses in the virtual 
tags but hits in the physical tags. 
XP: If the prior instruction 
is a pfld.y 
that hits a modified 
line in the data cache. 


XP: If data-cache 
line write-back 
due to snoop is in progress. 


XR: If address 
path full.!a) 
XP: If address path full.(a) 
If cache miss when store path fulL<b) 
IfTLB 
miss. 


+ OA 


+ OA 


(same as fadd.p) 


(same as faddp) 


(same as faddp) 


( ... 
and all other dual operations) 
If tsrc 1 refers to result of the prior operation 
(either scalar or pipelined). 


If the prior operation 
is a double-precision 
multiply. 
If executed 
when a scalar floating-point 
operation 
(other than frcp 
or frsqr) 
is 
in progress.!c) 


pfamov.r 
(same as fadd.p) 


pfeq.p 
(same as fadd.p) 


pfgt.p 
(same as fadd.p) 


pfiadd.w 
(same as faddp) 


pfisub.w 
(same as faddp) 


pflx.v 
(same as fadd.p) 
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Instruction 
Execution 
Clocks 
Condition 


pfld.y 


pfle.p 


pfmam.p 


pfmov.r 


pfmsm.p 


pfmul.p 


pfmul3.dd 


pform 


pfsm.p 


pfsub.p 


pftrunc.v 


pfzchkl 


pfzchks 


pst.d 


scyc.x 


shl 


shr 


shra 


shrd 


st.c 


1 
+1+RL 
+-+2 
+1 +RL1 
+2+0A 
+2 


+-+2 


1 + OA 


3 
+1 +R1 
+1+R2 


+R2 
+RN 
+RL1 
+TLB 


If data cache load miss in progress. 
If tdest is referenced 
in the next two instructions. 


If three pfld's are outstanding. 
XR: If pfld hits data cache. 
XP: If the prior instruction 
is a pfld.y that hits a modified 
line in the 
data cache. 
XP: If this instruction 
follows 
a data cache access that misses in 
the virtual tags but hits in the physical tags. 
XP: If data-cache 
line write-back 
due to snoop is in progress. 


XR: If address path fulUa) 
XP: If address path fulUa) 
IfTLB 
miss. 
• 
(same as pfam.p) 


pfmov.ss 
and pfmov.dd 
same as faddp 
pfmov.sd 
and pfmov.ds 
same as fadd.p 


(same as pfam.dd) 


(same as fmlow.dd) 


(same as fmlow.dd) 


(same as faddp) 


(same as pfam.dd) 


(same as fadd.p) 


(same as fadd.p) 


(same as faddp) 


(same as faddp) 


(same as fst.d) 


If data cache load miss in progress 
for a read of less than 128 bits. 
If data cache load miss in progress 
for 128-bit read. 
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If data cache load miss in progress. 
XP: If the prior instruction is a pfld.y that hits a modified line in the data cache. 
XP: If this instruction follows a data cache access that misses in the virtual 
tags but hits in the physical tags. 
XP: If data-cache line write-back due to snoop is in progress. 
XR: If address path fulUa) 
XP: If address path fulUa) 
If cache miss when store path fulUb) 
IfTLB miss. 


Instruction 
Execution 
Clocks 


st.x 
1 
+1+RL 
+2 
-2 


+R2 
+RN 
+RL1 
+1+RX 


+TLB 


stlo.x 
1 + OA 


subs 


subu 


trap 


unlock 


xor 


xorh 


Condition 


10.4 Instruction Characteristics 


The following table lists some of the characterisics 
of each instruction. The characteristics are: 


• What processing unit executes the instruction. 


The codes for processing units are: 
A 
Floating-point adder unit 
E 
Core execution unit 
G Graphics unit 
M Floating-point multiplier unit 
• Whether the instruction is pipelined or not. A P 


indicates that the instruction is pipelined. 
• Whether the instruction is a delayed branch in- 
struction. A D marks the delayed branches. 
• Whether execution is suppressed in user mode. 


An SU marks supervisor-only instructions. 
• Whether the instruction is available on both the 


i860 XR and i860 XP microprocessors. An XL 
marks instructions that are available only on the 
i860 XP microprocessor. 


• Whether the instruction changes the condition 
code CC. A CC marks those instructions that 
change CC. 


• Which faults can be caused by the instruction. 
The codes used for exceptions are: 
IT 
Instruction Fault 
SE 
Floating-Point Source Exception 
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RE 
Floating-Point Result Exception, including 
overflow, underflow, inexact result 
OAT 
Data Access Fault 


Note that this is not the same as specifying at which 
instructions faults may be reported. A result excep- 
tion is reported on the subsequent floating-point in- 
struction, pst, fst, or sometimes fld, pfld, and Ixfr. 


The instruction access fault IAT and the interrupt 
trap IN are not shown in the table because they can 
occur for any instruction. 


• Performance notes. These comments regarding 
optimum performance are recommendations only. 
If these recommendations are not followed, the. 
i860 XP microprocessor automatically waits the 
necessary number of clocks to satisfy internal 
hardware requirements. The following notes de- 
fine the numeric codes that appear in the instruc- 
tion table: 


1. The following instruction should not be a condi- 
tional branch (bc, bnc, bc.t, or bnc.t). 


2. The destination should not be a source oper- 
and of the next two instructions. 


3. A load should not directly follow a store that is 
expected to hit in the data cache. 
4. When the 
prior instruction 
is scalar, tsrc 1 


should not be the same as the {des! of the prior 
operation. 
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5. The fdest should 
not reference 
the destination 
of the 
next 
instruction 
if that 
instruction 
is a 
pipelined 
floating-point 
operation. 
6. The destination 
should 
not be a source 
oper- 
and of the next instruction. 
(For call and calli, 
the destination 
is r1.) 
7. When the prior operation 
is scalar and multipli- 
er op1 is fsrc1, fsrc2should not be the same as 
the fdest of the prior operation. 
8. When 
the 
prior 
operation 
is scalar, 
src1 and 
src2 of the current 
operation 
should 
not be the 
same as dest of the prior operation. 


9. A pfld 
should 
not immediately 
follow 
a pfld. 


• 
Programming 
restrictions. 
These 
indicate 
combi- 
nations of conditions 
that must be avoided 
by pro- 
grammers, 
assemblers, 
and 
compilers. 
The 
fol- 
lowing 
notes 
define 
the 
alphabetic 
codes 
that 
appear 
in the instruction 
table: 


a, The sequential 
instruction 
following 
a delayed 
control-transfer 
instruction 
may not be another 
control-transfer 
instruction, 
nor a trap 
instruc- 
tion, nor the target of a control-transfer 
instruc- 
tion. 


b. When using a brl to return from a trap handler, 


, programmers 
should take care to prevent traps 
from occurring 
on that or on the next sequen- 
tial instruction. 
IM should 
be zero 
(interrupts 
disabled) 
when the brl is executed. 


c. 
If fdest is not zero, fsrc1 must not be the same 
as fdest. 


d. When fsrc1 goes to multiplier 
op1 or to KR or 
KI, fsrc1 miJst not be the same as fdest. 


e, 
If dest is not zero, src1 and src2 must not be 
the same as dest. 
2 


f. 
Isrc1 must not be the same register as ;src2for 
the autoincrementing 
form of this instruction. 


g. Isrc1 must not be the same register 
as ;src2. 


h. Do not use the flush instruction 
in Ouallnstruc- 
tion Mode or locked 
sequences. 


Plpellned? 


InJtructlon 
Execution 
' Delayed? 
Sets 
Faults 
Performance 
Programming 
Unit 
Supervisor? 
CC? 
Notes 
Restrictions 
1860TM XP Only? 


adds 
E 
CC 
1 
addu 
E 
CC 
1 
and 
E 
CC 
andh 
E 
CC 
andnot 
E 
CC 


andnoth 
E 
CC 
bc 
E 


bc.t 
E 
0 
a 
bla 
E 
0 
a,9 
bnc 
E 


bnc.t 
E 
0 
a 
br 
E 
0 
a 
brl 
E 
0 
a,b 
bte 
E 
btne 
E 


call 
E 
0 
6 
a 
calli 
E 
0 
6 
a 
fadd.p 
A 
SE,RE 
faddp 
G 
8 
faddz 
G 
8 


famov.r 
A 
SE,RE 
fladd.w 
G 
8 
flsub.w 
G 
8 
flx.p 
A 
SE,RE 
fld.y 
E 
OAT 
2,3 
f 


NOTES: 
• On the i860 XP microprocessor, 
the pipelined 
instructions 
can generate 
ITR with PI. 


•• 
On the i860 XR micropocessor, 
the 128-bit 
pfld.q 
is not available. 
If used it causes 
an instruction 
trap. 
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Plpelined? 
Execution 
Delayed? 
Sets 
Performance 
Programming 
Instruction 
Unit 
Supervisor? 
CC? 
Faults 
Notes 
Restrictions 
1860TM XP Only? 


flush 
E 
h 
fmlow.dd 
M 
4 
fmul.p 
M 
SE,RE 
4 
form 
G 
8 
.,.... 


frcp.p 
M 
SE,RE 


frsqr.p 
M 
SE,RE 
fsly 
E 
OAT 
5 
f 
fsub.p 
A 
SE,RE 
ftrunc.p 
A 
SE,RE 
fxfr 
G 
6,8 


fzchkl 
G 
8 
fzchks 
G 
8 
, 


Intovr 
E 
IT 
Ixfr 
E 
2 
Id.c 
E 
~ 
. 


Id.x 
E 
OAT 
6 


Idlnt.x 
E 
SU,XP 
OAT 
Idlo.x 
11 
E 
SU,XP 
OAT 
lock 
E 
or 
E 
CC 
orh 
E 
CC 


pfadd.p 
A 
P 
SE,RE' 
pfaddp 
G 
P 
• 
8 
e 
pfaddz 
G 
P 
• 
8 
e 
pfam.p 
A&M 
P 
SE,RE' 
7 
d 
pfamov.r 
A 
P 
SE,RE' 


pfeq.p 
A 
P 
CC 
SE' 
1 
pfgt.p 
A 
> 
P 
CC 
SE' 
1 
pfladd.w 
G 
P 
• 
8 
e 
pflsub.w 
G 
P 
• 
8 
e 
pflx.p 
A 
P 
SE,RE' 


pfld.y 
E 
P,(XP)" 
OAT" 
2,9 
f 
pfmam.p 
A&M 
P 
SE,RE' 
7 
d 
pfmsm.p 
A&M 
P 
SE,RE' 
7 
d 
pfmul.p 
M 
P 
SE,RE' 
4 
c 
pfmul3.dd 
M 
P 
SE,RE' 
4 
c 


pform 
G 
P 
• 
8 
e 
pfsm.p 
A&M 
P 
SE,RE' 
7 
d 
pfsub.p 
A 
P 
SE,RE' 
pftrunc.p 
A 
P 
SE,RE' 
pfzchkl 
G 
P 
· 
8 


NOTES: 


• On the i860 XP microprocessor, 
the pipelined 
instructions 
can generate 
ITR with PI. 


•• On the i860 XR micropocessor, 
the 128-bit pfld.q 
is not available. 
It used it causes 
an instruction 
trap. 
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Plpellned? 
I 
Programming 
Execution 
Delayed? 
Sets 
Performance 
Instruction 
Unit 
Supervisor? 
CC? 
Faults 
Notes 
Restrictions 


1860TM XP Only? 


pfzchks 
G 
P 
• 
8 
pst.d 
E 
OAT 
5 
f 
scyc.x 
E 
SU,XP 
OAT 
shl 
E 
shr 
E 


shra 
E 
shrd 
E 
st.c 
E 


I 
E 
OAT 
st.x 
stlo.x 
E 
SU,XP 
OAT 


subs 
E 
CC 
1 
subu 
E 
CC 
1 
trap 
E 
IT 
unlock 
E 
xor 
E 
CC 
xorh 
E 
CC 


NOTES: 
. 
PI 
'On the i860 XP microprocessor, the pipelined instructions can generate ITA with 
'. 
. 
"On 
the i860 XA microprocessor, the 128-bit pfld.q is not available. If used it causes an Instruction trap. 


10.5 Software Compatibility 


10.5.1 REQUIRED CHANGES 


To port existing systems software from the i860 XR 
microprocessor to the i860 XP microprocessor, the 
following changes may be required. Applications 
software does not require changes. 
1. Data cache flush. All four ways of the data cache 
must be flushed on the i860 XP microprocessor. 
The cache flush routine can be modified to check 
processor type in epsr or the OCS field of dlr- 
base and flush the appropriate number of ways. 


2. Parity and bus error traps. If the i860 XP system 
signals these errors, the trap handler must be ex- 
tended to handle them. Software must avoid test- 
ing the BEF and PEF bits unless executing on the 
i860 XP microprocessor. 


3. LOCK# deactivation. On the i860 XP microproc- 
essor, traps do not automatically deactivate the 
LOCK# signal, so the trap handler must do a 
data access to deactivate LOCK#. Trap handlers 
that already access data soon after invocation do 
not require this modification. 
4. Load pipe precision. The precision of the last 
stage of the load pipeline is specified by the LRP 
bit on the i860 XR microprocessor but by the 
LRPOand LRP1 bits on the i860 XP microproces- 


sor. The procedure that restores the load pipe 
must check the processor type, use the appropri- 
ate bits, and restore the correct precision. Pipe 
restoration code for the i860 XR microprocessor 
will work correctly on the i860 XP microprocessor 
if pfld.q is not used. 
5. Pre-accessed trap handler pages. Page-directory 
and page-table entries for the instruction pages 
of the trap handler and for the first data page 
accessed by the trap handler must always have 
A = 1. Software modified to allocate page tables 
this way works on both i860 XR and i860 XP mi- 
croprocessors. 
6. Page directory entry bit 7 must be zero. This is 
the bit that selects four Mbyte or four Kbyte page 
size. On the i860 XR microprocessor, it is 
re- 
served and should be set to zero. It must be set 
to zero for four Kbyte pages to work on the 
i860 XP microprocessor. 


10.5.2 PERFORMANCE OPTIMIZATIONS 


Software developers may wish to make the following 
performance enhancements in systems software for 
the i860 XP microprocessor. Systems software that 
must execute on both i860 XP and i860 XR systems 
can contain code both with and without the optimiza- 
tions. By testing the processor type, the appropriate 
instruction path can be determined. 
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1. Data cache flush. On the i860 XP microproces- 
sor, a complete flushing of the data cache is not 
needed when changing context or marking a 
page not present. 


2. The epsr bits AI, 01, PI, and PT can be used on 


the i860 XP microprocessor to make trap han- 
dlers more efficient. 


3. Four-Mbyte pages can be allocated to frame buff- 
ers and the operating-system kernel, thereby re- 
ducing the cost of TlB misses. 


10.5.3 NEW FEATURES 


Software that uses the new features available only 
on the i860 XP microprocessor will not be compati- 
ble with the i860 XR microprocessor unless alter- 
nate instruction paths are provided. 


Systems software features: 
1. New instructions Idlo, stlo, Idlnt, and scyc. 
2. Four-Mbyte pages. 
3. Privileged Registers pO,p1, p2, and p3. 
4. Concurrency control unit. 
5. 128-bit load instruction pfld.q. 
6. Support for virtual address aliases. 


Applications 
software features: 
1. Concurrency control unit. 
2. 128-bit load instruction pfld.q. The i860 XR mi- 
croprocessor traps on pfld.q; therefore, software 
has the opportunity to emulate a pfld.q with two 
pfld.d instructions. However, this strategy does 
not yield optimal performance on the i860 XR mi- 
croprocessor. 


10.5.4 NOTES 


On the i860 XP microprocessor, pages with WT = 1 
are cached with the write-through policy; whereas, 
on the i860 XR microprocessor, they are not cached 
at all. Because this change in the function of WT 
was anticipated in the i860 XR microprocessor docu- 
mentation, no incompatibility should arise. 


11.0 DATA 
BOOK 
REVISION 
SUMMARY 


The following list represents the differences be- 
tween version 001 and version 002 of the 80860XP 
Microprocessor Data Book. 
1. Section 2.2.3-Description 
of the IN bit expand- 
ed to clarify under what conditions the bit is set. 


2. Section 2.2.4-Description 
of the BS bit on page 
15 expanded to better explain the relationship 
between the Bus error Address Register (BEAR), 
BERR, and BRDY# and to clarify how to set the 
bit correctly. 
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3. Section 2.2.5-Description 
of the PI bit on page 
16 expanded to clarify the relationship between 
the PI bit and the KNF (Kill Next Floating-point) 
instruction. 
4. Table 2.4-Modified 
the meaning of numerical 
value 11 to show that TlB 
replacement is dis- 


'abled, 


5. Table 3.4-Added 
note 4. 


6. Section 4.2.5-Elaborated 
on the relationship of 
the BERR and BRDY# pins and the Bus Error 
Address register (BEAR). 
7. Section 
4.2.17-Modified 
the 
parameters 
of 
EWBE# assertion during RESET which causes 
the processor to select either strong or weak or- 
dering mode. 
8. Section 
4.2.18-Modified 
the 
parameters 
of 
FLlNE# assertion during RESET which causes 
the processor to select either normal or late 
back-off mode. 
9. Section 4.2.24-Modified 
the parameters of INT 
assertion during RESET which causes the proc- 
essor to select either normal or code size eight 
(CS8) mode. 


10. Section 4.2.33-Elaborated 
on the relationship 
of PCHK# to PEN# and BRDY#. 
11. Section 
4.2.35-Modified 
the 
parameters 
of 
PEN# assertion during RESET which causes 
the processor to select either small or large out- 
put buffer mode. 
12. Section 4.2.37-Deleted 
sentence stating RE- 
SET was synchronous with ClK. 
13. Figure 5.3-Modified 
the transition point of the 
W/R# 
pin. 
14. Figure 5.5-Modified 
the BRDY#, KEN#, and 
DATA signals so that each one becomes active 
one clock after ADS#. 


15. Table 5.2-Added 
the three asterisks in the 
'Data Cache Read Miss KEN# = l' column so 
that note # 1 now pertains to them. 
16. Section 5.3.4-Clarified 
what it means when 
EWBE# is asserted and deasserted. 


17. Figure 5.27-Modified 
the ADS# 
and W/R# 
signals. 
18. Figure 5.32-Modified 
the assertion point of 
EWBE# and FLlNE# signals. Added a separate 
signal for INTICS8 as its assertion point during 
RESET is different. Added note 2. 


19. Table 7.2-Modified 
table so that data lines DO 
through D63 are sequential. 


20. Section 10.4-Added 
note H to the 'Program- 
ming Restrictions' section. 
21. Section 11.O-Added 
the Data Book Revision 
Summary section showing the differences be- 
tween revisions 001 and 002. 
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12.0 
DESIGN NOTES 


1. VccCLK 


VCCCLK (P01) is the power 
supply 
to the internal 
clock-generation 
Phase-locked-Loop 
(PLL). As a good 
design practice, 
a 0.1 /J-Fde-coupling 
capacitor 
should be placed 
near VccCLK 
to connect 
from vcc plane to 
ground 
plane. 
' 


2. A.C.lD.C. 
Characteristics 
for BO-stepping 


The A.C. and D.C. characteristics 
for the BO-stepping 
at 50 MHz and 40 MHz are guaranteed 
only at the 
specified 
operating 
conditions. 
See the attached 
Tables 
1 and 3. Note the VIHC (clock 
input high voltage), 
temperature, 
and vcc limits. 


3.BRDYH,BRDYU 


BRDY2# 
is equivalent 
to BRDY# 
(U01) as shown 
in the data sheet. 


BRDY1 # is equivalent 
to RSRVD (P04) as shown in the data sheet. 
BRDY1 # (a CMOS level input) is driven 
by the secondary 
cache 
(if present), 
while BRDY2# 
(a TTL input) comes from the bus controller. 
These pins 
are ORed together 
and their ORed value is used by internal logic. If the 82495XP 
secondary 
cache is not used, 
BRDY1 # must be inactive 
for at least three 
clocks 
preceding 
the falling 
edge of the RESET 
signal, 
and all 
inputs and outputs 
are TTL compatible. 


CM OS mode is triggered 
by having BRDY1 # active at RESET. When the CPU is configured 
in CMOS 
mode, 
some inputs are CMOS level compatible. 
The CMOS inputs are those that have an entry in the "CMOS 
Level" 


in the A.C. Characteristics 
tables. 


The input threshold 
of CMOS inputs is 2.5V. Setup and hold times for CMOS inputs are specified 
relative to the 
1.5V level of the rising edge of CLK and refer to the point that the CMOS input reaches 
2.5V. Output specifica- 
tions in the "CMOS 
Level" 
column 
are relative 
to the point the signal reaches 
2.5V. 


4. PFLD Bandwidth 


The memory bandwidth 
of the PFLD instruction 
in the i860™ XP processor 
is partly documented 
in the i860™ 
XP Microprocessor 
Data Book, pg. 133 (timing description 
of PFLD instruction). 


To summarize, 
external 
memory 
wait states and Late-Backoff-Mode 
decrease 
the bandwidth, 
resulting 
in the 
following 
peak speeds: 


• 200 MB/s 
for PFLD.D (50 MHz, 3 wait states or faster) 
• 178 MB/s 
for PFLD.D with 4 wait states 
• 160 MB/s 
for PFLD.D due to LB-mode + 4 WS 
• 400 MB/s 
for PFLD.Q (50 MHz, 2 wait states or faster) 
• 356 MB/s 
for PFLD.Q with 3 wait states 
• 320 MB/s 
for PFLD.Q with 4 wait states 
• 291 MB/s 
for PFLD.Q due to LB-mode + 4 WS 


(11 % degradation) 
(20% degradation) 


(11 % degradation) 
(20% degradation) 
(27% degradation) 


This bandwidth 
analysis 
results from simulations 
of the CPU RTL (Register 
Transfer 
Language) 
model. 
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13.0 
INDEX 


A 


AA 


fsr U-bit (update 
bit), 2.2.8 


16-bit pixel 


data type, 2.1.4 


access 
rights 


address 
translation 
caches, 
3.1 


A.C. characteristics 


electrical 
data, 9.3 


8-bit pixel 


data type, 2.1.4 


16-bit values 


alignment 
requirements, 
2.3 


32-bit binary floating-point 


single-precision 
real, 2.1.3 


addressing 


i860 XP microprocessor, 
2.3 


modes, 
2.7 


32-bit integer 


data type, 2.1.1 


address 
space 


consistency, 
3.3.1 


address 
translation 


algorithm, 
2.4.5 


caches, 
3.1 


faults, 2.4.6 


P (present) 
bit, 2.4.4.2 


virtual addressing, 
2.4 


adds (Add Signed) 


epsr OF (overflow 
flag), 2.2.4 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


32-bit ordinal 


data type, 2.1.2 


32-bit pixel 


data type, 2.1.4 


32-bit values 


alignment 
requirements, 
2.3 


64-bit binary floating-point 


double-precision 
real, 2.1.3 


floating-point 
register file, 2.2.2 


64-bit integer 


data type, 2.1.1 


floating-point 
register 
file, 2.2.2 


64-bit values 


alignment 
requirements, 
2.3 


128-bit load and store instructions 


floating-point 
register 
file, 2.2.2 


128-bit values 


alignment 
requirements, 
2.3 


82495XP/82490XP 
cache 


BRDY # (burst ready), 4.2.7 


external 
secondary 
cache, 
1.0 


write-once 
policy, 3.2.4.2 


A31-A3 
(address 
pins) 


signal description, 4.2.1 


addu (Add Unsigned) 


epsr OF (overflow 
flag), 2.2.4 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


ADS# 
(address 
status) 


AHOLD 
(address 
hold), 4.2.3 


signal description, 
4.2.2 


A (accessed) 


page-table 
entries 
(PTEs), 2.4.4.6 


AE 


fsr U-bit (update 
bit), 2.2.8 


AHOLD 
(address 
hold) 


bus arbitration, 
5.2 


signal description, 
4.2.3 


algorithm 


address 
translation, 
2.4.5 


cache replacement, 
3.2.3 


aliasing 


instruction 
cache, 
3.2.2 


internal 
instruction 
and data caches, 
3.2 
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alignment 


requirements, 
2.3 


andh 
(Logical 
AND High) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


and 
(Logical 'AND) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


andnoth 
(Logical 
AND NOT High) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


and not 
(Logical 
AND NOT) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


ANSIIIEEE 
Standard, 
754 to 1985, 1.0 


AO 


fsr 
U-bit (update 
bit)! 2.2.8 


arbitration 


bus operation, 
5.2 


HOLD and HLDA, 5.2.1 


ATE (address 
translation 
enable) 


address 
translation, 
2.4 


dirbase 
format 
description, 
2.2.6 


AU 


fsr 
U-bit (update 
bit), 2.2.8 


B 


back-off 


bus cycle, 5.2.2 


late modes, 
5.2.2.3 


one-clock 
late mode, 5.2.2.4 


two-clock 
late mode, 5.2.2.5 


bc (Branch 
on CC) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


bc.t 
(Branch 
on CC, Taken) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 
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BE7#-BEO# 
(byte enables) 


signal description, 
4.2.4 


bear 
(bus error address 
register) 


format 
description, 
2.2.10 


BE (big endian) 


data cache, 
3.2.1 


epsr 
format 
description, 
2.2.4 


BEF (bus error flag) 


epsr 
format 
description, 
2.2.4 


BEn# 


BEU-BEO# 
(byte enables), 
4.2.4 


BERR (bus error) 


bear 
(bus error address 
register), 
2.2.10 


bus error trap, 2.8.7 


epsr 
BEF (bus error flag), 2.2.4 


psr 
IM (interupt 
mode), 2.2.3 


signal description, 
4.2.5 


big endian 
mode 


addressing, 
2.3 


bla (Branch 
on LCC and Add) 


epsr 
AI (trap on autoincrement 
instruction), 
2.2.4 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


BL (bus lock) 


dlrbase 
format 
description, 
2.2.6 


bnc (Branch 
on Not CC) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


bnc.t 
(Branch 
on Not CC, Taken) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


BOFF # (back-off) 


ADS# 
(address 
status), 
4.2.2 


BERR (bus error), 4.2.5 


bus arbitration, 
5.2 


dlrbase 
LB (late back-off 
mode), 2.2.6 


FLlNE# 
choice, 
5.3.5.1 


signal description, 
4.2.6 
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boundary 
scan 


register 
cell ordering, 
6.5 


BPR (bypass 
register) 


test, 6.2 


br (Branch 
Direct Unconditionally) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


BR (break read) 


debugging 
i860 XP microprocessor, 
2.9 


par format 
description, 
2.2.3 


BRDY # (burst ready) 


bear (bus error address 
register), 
2.2.10 


BERR (bus error), 4.2.5 


epar IL (interlock), 
2.2.4 


locked 
access, 
3.2.4.3 


signal description, 
4.2.7 


write-once 
policy, 3.2.4.2 


BREQ (bus request) 


signal description, 
4.2.8 


brl (Branch 
Indirect 
Unconditionally) 


instruction 
definition, 
10.1 


brl (Branch 
Indirect 
Unconditionally) 


instruction 
timing, 
10.3 


BS (bus or parity error trap in supervisory 
mode) 


epar format 
description, 
2.2.4 


BSR (boundary 
scan register) 


test, 6.2 


bte (Branch 
If Equal) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


btne (Branch 
If Not Equal) 


instruction 
timing, 
10.3 


buffer 


models, 
9.4 


size, selection 
with PEN #, 4.2.35, 5.5, 9.4.3 


burst cycles 
bus cycle, 5.f .2 


bus arbitration 


bus operation, 
5.2 
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bus and cache control 
unit 


function 
of, 1.0 


bus cycles 


back-off 
and restart, 
5.2.2 


bus operation, 
5.1 


type output 
pins, 4.1 


bus errors 


bear (bus error address 
register), 
2.2.10 


trap, 2.8.7 


bus operation 


i860 XP microprocessor, 
5.0 


BW (break write) 


debugging 
i860 XP microprocessor, 
2.9 


, 
par format 
description, 
2.2.3 


BYPASS# 
(bypass) 


signal description, 
4.2.9 


TAP encoding, 
6.3 


C 


CACHE# 
(cacheability) 


BE7# -BEO# 
(byte enables), 
4.2.4 


signal description, 
4.2.10 


cache 


address 
translation, 
3.1 


consistency 
protocol, 
3.2.4 


external 
secondary, 
1.0 


inquiry cycles 
(snooping), 
5.3 


internal 
instruction 
and data, 3.2 


invalidating 
entries, 
3.3 


on-Chip, 3.0 


replacement 
algorithm, 
3.2.3 


cacheability 


address 
translation 
caches, 
3.1 


consistency, 
3.3.4 


Calli (Indirect 
Subroutine 
Call) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


call (Subroutine 
Call) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


capture-OR 


test state, 6.4.5 
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capture-IR 


test state, 6.4.11 
cc (condition 
code) 


par format 
description, 
2.2.3 


ccr (concurrency 
control 
register) 


OCCU initialization, 
2.5.1 


format 
description, 
2.2.12 


CCUBASE 


ccr (concurrency 
control 
register), 
2.2.12 


OCCU addressing, 
2.5.2 


OCCU initialization, 
2.5.1 


CO (cache 
disable) 


bypassing 
instruction 
and data cache, 
3.3 


page-table 
entries 
(PTEs), 2.4.4.5 


CLK (clock) 


signal description, 
4.2.11 
eo (CCU on) 


ccr (concurrency 
control 
register), 
2.2.12 


color intensity 
shading 


pixel formats, 
2.1.4 


compatibility 


pipe lined cycles, 
5.1.3 


software 
changes, 
10.5.1 


concurrency 
control 
unit (CCU) 


ccr (concurrency 
control 
register), 
2.2.12 


detached 
CCU, 2.5 


NEWCURR 
register, 
2.2.13 


, 


consistency 


address 
space, 3.3.1 


cacheability, 
3.3.4 


instruction 
cache, 
3.3.2 


internal 
cache, 
3.3 


load pipe, 3.3.5 


page table, 3.3.3 


protocol, 
3.2.4 


write-once 
policy, 3.2.4.2 


control 
registers 


register 
set, 2.2 
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copy-back 
policy 


data cache 
update, 
3.2.1.1 


core execution 
unit 


function 
of, 1.0 


CS8 (code size 8-bit) 


BE7#-BEO# 
(byte enables), 
4.2.4 


dirbase 
format 
description, 
2.2.6 


CTRL-format 


instructions, 
10.2.2 


CTYP (cycle type) 


signal description, 
4.2.12 


current 
mode 


high vs. normal, 4.2.35, 
5.5, 9.3, 9.4.3 


cycles 


back-off, 
5.2.2.1 


burst cycles, 
5.1.2 


interrupt 
~cknowledge, 
5.1.4 


pipelined, 
5.1.3 


restart, 5.2.2.2 


special 
bus, 5.1.5 


D 


063-00 
(data pins) 


signal description, 
4.2.14 


data access 


fault, 2.8.5 


data cache 


bypassing, 
3.3 


flushing, 
3.3 


function 
of, 1.0 


operation, 
3.2 


organization, 
3.2.1 


states, 3.2.4.1 


update 
policies, 
3.2.1.1 


data types 
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2.1 


OAT (data access 
trap) 


debugging 
i860 XP microprocessor, 
2.9 


par format 
description, 
2.2.3 
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db (data breakpoint 
register) 


debugging 
i860 XP microprocessor, 
2.9 


format 
description, 
2.2.5 


psr BR (break read) and BW (break write), 2.2.3 


D bit 


dual-instruction 
mode, 2.6.2 


D/C# 
(data/code) 


signal description, 
4.2.13 


D.C. characteristics 


electrical 
data, 9.2 


DCCU (detached 
concurrency 
control 
unit) 


addressing, 
2.5.2. 


ccr (concurrency 
control 
register), 
2.2.12 


function 
of, 1.0 


initialization, 
2.5.1 


internals, 
2.5.3 


DCS (data cache 
size) 


epsr format 
description, 
2.2.4 


D (dirty) 


page-table 
entries 
(PTEs), 2.4.4.6 


debugging 


i860 XP microprocessor, 
2.9 


deferred-write 
policy 


data cache 
update, 
3.2.1.1 


denormal 


special floating-point 
values, 
2.1.3 


Detached 


STA'r register 
description, 
2.2.14 


detached 
CCU 


i860 XP microprocessor, 
2.5 


d_fnop 


dual-instruction 
mode, 2.6.2 


DID (device 
identification 
register) 


test, 6.2 


DIR 


virtual address, 
2.4.2 
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dlrbase 
(directory 
base register) 


address 
space consistency, 
3.3.1 


cache replacement 
algorithm, 
3.2.3 


DCCU initialization, 
2.5.1 


format 
description, 
2.2.6 


instruction 
cache consistency, 
3.3.2 


page directory, 
2.4.3 


page table consistency, 
3.3.3 


P (present) 
bit, 2.4.4.2 


disassemblers 


big endian 
mode, 2.3 


DI (trap on delayed 
instruction) 


epsr format 
description, 
2.2.4 


DM (dual instruction 
mode) 


psr format 
description, 
2.2.3 


DO (detached 
only) 


ccr (concurrency 
control 
register), 
2.2.12 


double-precision 
real 


data type, 2.1.3 


double 
real value 


floating-point 
registers, 
2.1.3 


double-shift 
instruction 


par SC (shift count), 
2.2.3 


DP7-DPO 
(data parity) 


signal description, 
4.2.15 


DPC (data-path 
control) 


dual-operation 
instructions, 
2.6.3 


DPS (DRAM page size) 


dirbase 
format 
description, 
2.2.6 


DS (delayed 
switch) 


psr format 
description, 
2.2.3 


DTB (directory 
table base) 


dlrbase 
format 
description, 
2.2.6 


dual-instruction 
mode 


parallellism, 
2.6.2 


dual-operation 
instructions 


floating-point, 
2.6.3 


E 


EADS# 


AHOLD 
(address 
hold), 4.2.3 


EADS# 
(external 
address 
status) 


signal description, 
4.2.16 


epsr (extended 
processor 
status register) 


data cache, 
3.2.1 


DCCU internals, 
2.5.3 


format 
description, 
2.2.4 


page-table 
entries 
(PTEs), 2.4.4.3 


EWBE # (external 
write buffer empty) 


epsr SO (strong 
ordering), 
2.2.4 


signal description, 
4.2.17 


exit1-DR 


test state, 6.4.7 


exit1-IR 


test state, 6.4.13 


exit2-DR 


test state, 6.4.9 


exit2-IR 


test state, 6.4.15 


EXTEST 


TAP encoding, 
6.3 


F 


faddp 
(Add with Pixel Merge) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


fadd.p 
(Floating-Point 
Add) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


faddz 
(Add with Z Merge) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


famov.r 
(Floating-Point 
Adder Move) 


instruction 
definition, 
10.1 


instruction 
timing, 10.3 
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fault 


address 
translation, 
2.4.6 


data access, 
2.8.5 


floating-point, 
2.8.3 


instruction 
access, 
2.8.4 


result exception 
fault, 2.8.3.1 


source 
exception 
fault, 2.8.3.1 


fladd.w 
(Long-Integer 
Add) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


fir (fault instruction 
register) 


epsr 01 (trap on delayed 
instruction), 
2.2.4 


format 
description, 
2.2.7 


flsub.w 
(Long-Integer 
Subtract) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


flx.v (Floating-Point 
to Integer Conversion) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


fld.y (Floating-Point 
Load) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


FLlNE # (flush line) 


BOFF# 
choice, 
5.3.5.1 


signal description, 
4.2.18 


floating-point 


adder, 
1.0 


control 
unit, 1.0 


fault, 2.8.3 


instruction 
encoding, 
10.2.3 


multiplier, 
1.0 


register 
file, 2.2.2 


flush (Cache Flush) 


cache replacement 
algorithm, 
3.2.3 


dirbase 
RB (replacement 
block), 
2.2.6 


flushing 
data cache, 
3.3 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


requirements 
summary, 
3.3.6 
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fmlow.dd 
(Floating·Point 
Multiply 
Low) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


fmov.r 
(Floating· Point Reg-Reg 
Move) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


fmul.p 
(Floating-Point 
Multiply) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


fnop (Floating-Point 
No Operation) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


form (OR with MERGE 
Register) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


frcp.p 
(Floating-Point 
Reciprocal) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


frsqr.p 
(Floating-Point 
Reciprocal 
Square 
Root) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


fsr (floating-point 
status register) 


format 
description, 
2.2.8 


pipelining 
status information, 
2.6.1.2 


fst.y (Floating-Point 
Store) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


fsub.p 
(Floating-Point 
Subtract) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


FTE (floating-point 
trap enable) 


fsr format 
description, 
2.2.8 
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FT (floating-point 
trap) 


psr format 
description, 
2.2.3 


ftrunc.v 
(Floating-Point 
to Integer Conversion) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


fxfr (Transfer 
F-P to Integer 
Register) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


fzchkl 
(32-Bit Z-Buffer 
Check) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


fzchks 
(16-Bit Z-Buffer 
Check) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


FZ (flush zero) 


fsr format 
description, 
2.2.8 


G 


graphics 
unit 


function 
of, 1.0 


H 


hardware 
interface 


i860 XP microprocessor, 
4.0 


HIT 11 (cache 
inquiry hit) 


signal description, 
4.2.19 


HITMII 
(hit modified 
line) 


internal 
cache 
consistency, 
3.3 


signal description, 
4.2.20 


HLDA (bus hold acknowledge) 


signal description, 
4.2.21 


HOLD (bus hold) 


bus arbitration, 
5.2 


signal description, 
4.2.22 


i860 XP microprocessor 


bus operation, 
5.0 


functional 
description, 
1.0 


hardware 
interface, 
4.0 


instruction 
set, 8.0 


mechanical 
data, 7.0 


on-chip 
caches, 
3.0 


programming 
interface, 
2.0 


testability, 
6.0 


IAT (instruction 
access 
trap) 


psr format 
description, 
2.2.3 


10COOE 


TAP encoding, 
6.3 


IEEE Standard 


for Binary Floating-Point 
Arithmetic, 
1.0 


P1149.1 106 testability, 
6.0 


IL (interlock) 


epsr format 
description, 
2.2.4 


IM (interrupt 
mode) 


psr format 
description, 
2.2.3 


indefinite 


special floating-point 
values, 
2.1.3 


inexact 
result 


result exception 
fault, 2.8.3.2 


initialization 


at RESET, 
5.5 


infinity 


special 
floating-point 
values, 
2.1.3 


IN (interrupt) 


psr format 
description, 
2.2.3 


InLoop 


STAT register 
description, 
2.2.14 


inquiry cycles 


data cache states, 
3.2.4.1 


for line being cached, 
5.3.2.1 


tor line being replaced, 5.3.2.2 


snooping, 
5.3 


write-back, 
5.3.1 
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instruction 


access 
fault, 2.8.4 


characteristics, 
10.4 


CTRL-format, 
10.2.2 


definitions, 
10.1 


dual-operation, 
2.6.3 


encoding 
floating-point, 
10.2.3 


fault, 2.8.2 


format 
and encoding, 
10.2 


REG-format, 
10.2.1 


timing, 
10.3 


instruction 
cache 


bypassing, 
3.3 


consistency, 
3.3.2 


function 
of, 1.0 


operation, 
3.2 


organization, 
3.2.2 


instruction 
set 


abbreviations, 
10.0 


extensions 
of i860 XR, 2.6 


i860 XP microprocessor, 
8.0 


INT/CS8 
(interrupt/code-size 
8-bits) 


signal description, 
4.2.24 


integer 


data type, 2.1.1 


register file, 2.2.1 


internal 
cache 


consistency, 
3.3 


interrupt 


acknowledge 
cyctes, 5.1.4 


i860 XP microprocessor, 
2.8 


trap, 2.8.8 


INT (interrupt) 


epsr format 
description, 
2.2.4 


Intovr (Software 
Trap on Integer Overflow) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


INTpin 


epsr INT (interrupt), 
2.2.4 


psr IM (interrupt 
mode), 2.2.3 
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invalidation 
requirements 


summary, 
3.3.6 


INV (invalidate) 


signal description, 
4.2.23 


IR (instruction 
register) 


test, 6.3 


IRP (integer 
graphics) 


fsr format 
description, 
2.2.8 


ITI (cache 
and TLB invalidate) 


dirbaae 
format 
description, 
2.2.6 


IT (instruction 
trap) 


par format 
description, 
2.2.3 


Ixfr (Transfer 
Integer to F-P Register) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


K 


KBO, KB1 (cache 
block) 


signal description, 
4.2.25 


KEN# 
(cache 
enable) 


BE7#-BEO# 
(byte enables), 
4.2.4 


bypassing 
instruction 
and data cache, 
3.3 


DCCU addressing, 
2.5.2 


internal 
instruction 
and data caches, 
3.2 


locked 
access, 
3.2.4.3 


signal description, 
4.2,26 


KI 


special 
purpose 
register 
description, 
2.2.9 


KNF (kill next floating-point 
instruction) 


psr format 
description, 
2.2.3 


KR 


special 
purpose 
register 
description, 
2.2.9 


L 


LB (late back-off 
mode) 


dlrbase 
format 
description, 
2.2.6 


LCC (loop condition 
code) 


psr CC (condition 
code), 2.2.3 
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Id.e (Load from Control 
Register) 


fir (fault instruction 
register), 
2.2.7 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


Idlnt.x (Load Interrupt 
Vector) 


big endian 
mode, 2.3 


epsr BE (big endian), 
2.2.4 


extensions 
of i860 XR, 2.6 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


Idlo.x (Load 1/0) 


big endian 
mode, 2.3 


extensions 
of i860 XR, 2.6 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


Id.l 
. 


flushing 
data cache, 
3.3 


Id.x (Load Integer) 


DCCU internals, 
2.5.3 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


LEN (data length) 


signal description, 
4.2.27 


LFBSR (linear feedback 
shift register) 


cache 
replacement 
algorithm, 
3.2.3 


little endian 
mode 


addressing, 
2.3 


load pipe 


consistency, 
3.3.5 


LOCK# 
(address 
lock) 


A (accessed) 
bit, 2.4.4.6 


cycle attribute, 
5.4 


dlrbase 
BL (bus lock), 2.2.6 


signal description, 
4.2.28 


lock (Begin Interlocked 
Sequence) 


dlrbase 
BL (bus lock), 2.2.6 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


locked 
access, 
3.2.4.3 


locked 
access 


cache consistency, 
3.2.4.3 


lock instruction 


epsr IL (interlock), 
2.2.4 


lock protocol 


instruction 
fault, 2.8.2.1 


LRPO (load pipe result precision) 


fsr format 
description, 
2.2.8 


LRP1 (load pipe result precision) 


fsr format 
description, 
2.2.8 


M 


MA 


fsr U-bit (update 
bit), 2.2.8 


mechanical 
data 


i860 XP microprocessor, 
7.0 


MERGE 


special 
purpose 
register 
description, 
2.2.9 


MESI 


cache 
consistency 
protocol, 
3.2.4 


write cycle reordering, 
5.3.3 


MI 


fsr U-bit (update 
bit), 2.2.8 


M/IO,*, 
(memory-I/O) 


signal description, 
4.2.29 


MO 


far U-bit (update 
bit), 2.2.8 


mev 
(Constant-to-Register 
Move) 


instruction 
definition, 
10.1 


moy (Register-Register 
Move) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


MU 


fsr U-bit (update 
bit), 2.2.8 
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N 


NA,*, (next address 
request) 


locked 
access, 
3.2.4.3 


signal description, 
4.2.30 


write-once 
policy, 3.2.4.2 


NaN (Not a Number) 


special floating-point 
values, 
2. .3 


NENE,*, 
(next near) 


dlrbase 
DPS (DRAM 
page size), 2.2.6 


signal description, 
4.2.31 


Nested 


STAT register 
description, 
2.2.14 


NEWCURR 
register 


DCCU internals, 
2.5.3 


format 
description, 
2.2.13 


non pipe lined cycle 


bus cycle, 5.1.3 


nop (Core-Unit 
No Operation) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


o 


offset 


addressing 
modes, 
2.7 


virtual address, 
2.4.2 


OF (overflow 
flag) 


epar format 
description, 
2.2.4 


on-chip 
caches 


i860 XP microprocessor, 
3.0 


ordinal 


data type, 2.1.2 


orh (Logical 
OR High) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


or (Logical 
OR) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 
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output 
pins 


pins overview, 
4.1 


overflow 


result exception 
fault, 2.8.3.2 


p 


package 


thermal 
specifications, 
8.0 


PAGE 


virtual address, 
2.4.2 


page directory 


little endian 
mode, 2.3 


page tables, 
2.4.3 


paged virtual-address 
space 


addressing, 
2.3 


page frame 


address, 
2.4.4.1 


physical 
main memory, 
2.4.1 


page table 


combining 
protection, 
2.4.4.8 


consistency, 
3.3.3 


entry format 
description, 
2.4.4 


format 
description, 
2.4.3 


little endian 
mode, 2.3 


for trap handlers, 
2.4.4.7 


paging 
unit 


address 
translation 
caches, 
3.1 


function 
of, 1.0 


parallelism 


dual-instruction 
mode, 2.6.2 


use of, 2.6 


parity error 


bear (bus error address 
register), 
2.2.10 


par IM (interrupt 
mode), 2.2.3 


trap, 2.8.6 


pause-OR 


test state, 6.4.8 


pause-IR 


test state, 6.4.14 
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PBM (page-table 
bit mode) 


epsr format 
description, 
2.2.4 


PCO (page cache disable) 


bypassing 
instruction 
and data cache, 
3.3 


CD (cache 
disable), 
2.4.4.5 


signal description, 
4.2.32 


PCHK# 
(parity check) 


signal description, 
4.2.33 


PCYC (page cycle) 


signal description, 
4.2.34 


PEF (parity error flag) 


epsr format 
description, 
2.2.4 


PEN # (parity enable) 


bear (bus error address 
register), 
2.2.10 


parity error trap, 2.8.6 


signal description, 
4.2.35 


performance 
optimizations 


software 
compatibility, 
10.5.2 


pfaddp 
(Pipelined 
Add with Pixel Merge) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


pfadd.p 
(Pipe lined Floating-Point 
Add) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


pfaddz 
(Pipelined 
Add with Z Merge) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


pfamov.r 
(Pipe lined Floating-Point 
Adder 
Move) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


pfam.p 
(Pipelined 
Floating-Point 
Add and Multiply) 


dual-operation, 
2.6.3 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


special 
purpose 
registers, 
2.2.9 


pfeq.p 
(Pipelined 
Floating-Point 
Equal Compare) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 
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pfgt.p 
(Pipelined 
Floating-Point 
Greater-Than 
Compare) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


pfladd.w 
(Pipelined 
Long-Integer 
Add) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


pflsub.w 
(Pipelined 
Lonq-Inteqar 
Subtract) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


pflx.v (Pipelined 
Floating-Point 
to Integer 
Conversion) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


pfld (Pipelined 
Flcatlnq-Point 
Load) 


epsr PT (trap on pipeline 
use), 2.2.4 


load pipe consistency, 
3.3.5 


pipeline 
loads, 2.6.1.5 


pfld.q 


extensions 
of i860 XR, 2.6 


pfld.y (Pipelined 
Floating-Point 
Load) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


pfle.p (Pipelined 
F-P Less-Than 
or Equal Compare) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


pfmam.p 
(Pipelined 
Floating-Point 
Add and Multiply) 


dual operation, 
2.6.3 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


special 
purpose 
registers, 
2.2.9 


pfmov.r 
(Pipelined 
Floating·Point 
Reg·Reg 
Move) 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


pfmsm.p 
(Pipelined 
Floating-Polnt 
Subtract 
and Multiply) 


dual operation, 
2.6.3 


instruction 
definition, 
10.1 


instruction 
timing, 
10.3 


special 
purpose 
registers, 
2.2.9 
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pfmul3.dd 
(Three-Stage 
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• 
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Up 
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Results per 
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Arithmetic 
-lnteI386™/486 
Microprocessor 
Data 
Formats 
and Page Table Entries 
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Pin Grid 
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Outlines and Dimensions, 
order 
#231369) 
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Easy to Use 
- 
On-Chip Debug Register 
-Assembler, 
L1nker, Simulator, 
Debugger, 
C and FORTRAN 
Compilers, 
FORTRAN 
Vectorlzer, 
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Scalar and Vector 
Math Libraries 
for 
both OS/2* 
and UNIX* 
Environments 


The Intel iB60™ 
XR Microprocessor 
(order codes ABOB60XR-25, 
ABOB60XR-33 
and ABOB60XR-40) 
delivers 
supercomputing 
performance 
in a single VLSI component. 
The 64-bit design of the iB60 XR microprocessor 
balances 
integer, 
floating 
point, and graphics 
performance 
for applications 
such as engineering 
workstations, 


scientific 
computing, 
3-0 graphics 
workstations, 
and multi user systems. 
Its parallel architecture 
achieves 
high 
throughput 
with RISC design techniques, 
pipelined 
processing 
units, wide data paths, 
large on-chip 
caches, 


million-transistor 
design, and fast one-micron 
CHMOS 
IV silicon technology . 
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1.0 FUNCTIONAL 
DESCRIPTION 


As shown by the block diagram on the front page, 
the i860 XR microprocessor consists of 9 units: 


1. Core Execution Unit 
2. Floating-Point Control Unit 
3. Floating-Point Adder Unit 
4. Floating-Point Multiplier Unit 
5. Graphics Unit 
6. Paging Unit 
7. Instruction Cache 
8. Data Cache 
9. Bus and Cache Control Unit 


The core execution unit controls overall operation of 
the i860 XR microprocessor. The core unit executes 
load, store, integer, bit, and control-transfer opera- 
tions, and fetches instructions for the floating-point 
unit as well. A set of 32 x 32-bit general-purpose 
registers are provided for the manipulation of integer 
data. Load and store instructions move 8-, 16-, and 
32-bit data to and from these registers. Its full set of 
integer, logical, and control-transfer instructions give 
the core unit the ability to execute complete systems 
software and applications programs. A trap mecha- 
nism provides rapid response to exceptions and ex- 
ternal interrupts. Debugging is supported by the abili- 
ty to trap on data or instruction reference. 


The floating-point hardware is connected to a sepa- 
rate set of floating-point registers, which can be 
accessed as 16 x 64-bit registers, or 32 x 32-bit reg- 
isters. Special load and store instructions can also 
access these same registers as 8 x 128-bit registers. 
All floating-point instructions use these registers as 
their source and destination operands. 


The floating-point control unit controls both the float- 
ing-point adder and the floating-point multiplier, issu- 
ing instructions, 
handling all source and result 
exceptions, and updating status bits in the floating- 
point status register. The adder and multiplier can 
operate in parallel, producing up to two results per 
clock. The floating-point data types, floating-point in- 
structions, and exception handling all support the 
IEEE Standard for Binary Floating-Point Arithmetic 
(ANSI/IEEE Std 754-1985). 


The floating-point adder performs addition, subtrac- 
tion, comparison, and conversions on 64- and 32-bit 
floating-point values. An adder instruction executes 
in three clocks; however, in pipelined mode, a new 
result is generated every clock. 


The floating-point multiplier performs floating-point 
and integer multiply and floating-point reciprocal op- 
erations on 64- and 32-bit floating-point values. A 
multiplier instruction executes in three to four clocks; 
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however, in pipelined mode, a new result can be 
generated every clock for single-precision and every 
other clock for double precision. 


The graphics unit has special integer logic that sup- 
ports three-dimensional drawing in a graphics frame 
buffer, with color intensity shading and hidden sur- 
face elimination via the Z-buffer algorithm. The 
graphics unit recognizes the pixel as an 8-, 16-, or 
32-bit data type. It can compute individual red, blue, 
and green color intensity values within a pixel; but it 
does so with parallel operations that take advantage 
of the 64-bit internal word size and 64-bit external 
bus. The graphics features of the i860 XR micro- 
processor assume that the surface of a solid object 
is drawn with polygon patches whose shapes ap- 
proximate the original object. The color intensities of • 
the vertices of the pOlygonand their distances from 
the viewer are known, but the distances and intensi- 
ties of the other points must be calculated by inter- 
polation. The graphics instructions of the i860 XR 
microprocessor directly aid such interpolation. 


The paging unit implements protected, paged, virtual 
memory via a 64-entry, four-way set-associative 
memory called the TLB (Translation Lookaside Buff- 
er). The paging unit uses the TLB to perform the 
translation of logical address to physical address, 
and to check for access violations. The access pro- 
tection scheme employs two levels of privilege: user 
and supervisor. 


The instruction cache is a two-way set-associative 
memory of four Kbytes, with 32-byte blocks. It trans- 
fers up to 64 bits per clock (320 Mbyte/sec 
at 
40 MHz). 


The data cache is a two-way set-associative memo- 
ry of eight Kbytes, with 32-byteblocks. 
It transfers 
up to 128 bits per clock (640 Mbyte/sec at 40 MHz). 
The i860 XR microprocessor normally uses write- 
back caching, i.e. memory writes update the cache 
(if applicable) without necessarily updating memory 
immediately; however, caching can be inhibited by 
software where necessary. 


The bus and cache control unit performs data and 
instruction accesses for the core unit. It receives cy- 
cle requests and specifications from the core unit, 
performs the data-cache or instuction-cache miss 
processing, controls TLB translation, and provides 
the interface to the external bus. Its pipelined struc- 
ture supports up to three outstanding bus cycles. 


2.0 PROGRAMMING 
INTERFACE 


The programmer-visible aspects of the architecture 
of the i860 XR microprocessor include data types, 
registers, instructions, and traps. 
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2.1 Data Types 


The i860 XR microprocessor provides operations for 
integer and floating-point data. Integer operations 
are performed on 32-bit operands with some support 
also for 64-bit operands. Load and store instructions 
can reference 8-bit, 16-bit, 32-bit, 64-bit, and 128-bit 
operands. Floating-point operations are performed 
on IEEE-standard 32- and 64-bit formats. Graphics 
oriented instructions operate on arrays of 8-, 16-, or 
32-bit pixels. 


2.1.1 INTEGER 


An integer is a 32-bit signed value in standard two's 
complement form. A 32-bit integer can represent a 
value 
in the 
range 
-2,147,483,648 
(-231) 
to 
2,147,483,647 (+ 231 - 
1). Arithmetic operations on 
8- and 16-bit integers can be performed by sign-ex- 
tending the 8- or 16-bit values to 32 bits, then using 
the 32-bit operations. 


There are also add and subtract instructions that op- 
erate on 64-bit long integers. 


Load and store instructions may also reference (in 
addition to the 32- and 64-bit formats previously 
mentioned) 8- and 16-bit items in memory. When an 
8- or 16-bit item is loaded into a register, it is con- 
verted to an integer by sign-extending the value to 
32 bits. When an 8- or 16-bit item is stored from a 
register, the corresponding number of low-order bits 
of the register are used. 
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2.1.2 ORDINAL 


Arithmetic operations are available for 32-bit ordi- 
nals. An ordinal is an unsigned integer. An ordinal 
can 
represent 
values 
in 
the 
range 
0 
to 
4,294,967,295 (+232 
- 
1). 


Also, there are add and subtract instructions that op- 
erate on 64-bit ordinals. 


2.1.3 SINGLE- AND DOUBLE-PRECISION REAL 


Figure 2.1 shows the real number formats. A single- 
precision real (also called "single real") data type is 
a 32-bit binary floating-point number. Bit 31 is the 
sign bit; bits 30..23 are the exponent; and bits 22..0 
are the fraction. In accordance with ANSI/IEEE 
standard 754, the value of a single-precision real is 
defined as follows: 
1. If e = 0 and f -:/= 0 or e = 255 then generate a 
floating-point 
source-exception trap when en- 
countered in a floating-point operation. 


2. If 0 < e < 255, then the value is (- 1)S x 1.1x 


29-127. 


3. If e = 0 and f = 0, then the value is signed zero. 


A double-precision real (also called "double real") 
data type is a 64-bit binary floating-point number. Bit 
63 is the sign bit; bits 62..52 are the exponent; and 
bits 51..0 are the fraction. In accordance with ANSII 
IEEE standard 754, the value of a double-precision 
real is defined as follows: 
1. If e = 0 and f -:/= 0 or e = 2047, then generate a 
floating-point 
source-exception trap when en- 
countered in a floating-point operation. 
2. If 0 < e < 2047, then the value is (-1)5 
x 1.1x 


29-1023. 


Double-Precision Real 
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Single-Precision Real 


31 
23 
0 
EJr------:;-1 ---------.::..1 


t 
L fRACTION 
l 
-'================= 
EXPONENT 
SIGN 


63~ 
~5~2 
0 
EJ 
Ir------------..,.;;.I 


I 
L FRACTION 
'.__ -================ 
EXPONENT 
- 
SIGN 
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Figure 2.1. Real Number Formats 
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3. If e = 0 and f = 0, then the value is signed zero. 


The special values infinity, NaN ("Not a Number"), 
indefinite, and denormal generate a trap when en- 
countered. The trap handler implements IEEE-stan- 
dard results. 


A double real value occupies an even/odd pair of 
floating-point registers. Bits 31..0 are stored in the 
even-numbered floating-point register; bits 63..32 
are stored in the next higher odd-numbered floating- 
point register. 


2.1.4 PIXEL 


A pixel may be 8, 16, or 32 bits long depending on 
color and intensity resolution requirements. Regard- 
less of the pixel size, the i860 XR microprocessor 
always operates on 64 bits worth of pixels at a time. 
The pixel data type is used by two kinds of instruc- 
tions: 
• The selective pixel-store instruction that helps im- 


plement hidden surface elimination. 
• The pixel add instruction that helps implement 
3-~ color intensity shading. 


To perform color intensity shading efficiently in a va- 
riety of applications, the i860 XR microprocessor de- 
fines three pixel formats according to Table 2.1. 


Figure 2.2 illustrates one way of assigning meaning 
to the fields of pixels. These assignments are for 
illustration purposes only. The i860 XR microproces- 
sor defines only the field sizes, not the specific use 
of each field. Other ways of using the fields of pixels 
are possible. 
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Table 2.1. Plxel Formats 


Plxel 
Bits of 
Bits of 
Bits of 
Bits of 
Other 
Size 
Color 1 
Color2 
Color3 
Attribute 
(In bits) 
Intensity 
Intensity 
Intensity 
(Texture) 


8 
N (~ 8) bits of intensity' 
8-N 
16 
6 
I 


6 
I 


4 
32 
8 
8 
8 
8 


The intensity attribute fields may be assigned to colors In 
any order convenient to the application. 


·With 8-bit pixels, up to 8 bits can be used for intensity; the 
remaining bits can be used for any other attribute, such as 
color. The intensity bits must be the low-order bits of the 
pixel. 


2.2 Register Set 
• 


As Figure 2.3 shows, the i860 XR microprocessor 
has the following registers: 


• An integer register file 
• A floating-point register file 
• Six control registers (psr, epsr, db, dlrbase, fir, 
and fsr) 
• Four special-purpose registers (KR, KI, T, and 
MERGE) 


The control registers are accessible only by load 
and store control-register instructions; the integer 
and floating-point registers are accessed by arithme- 
tic operations and load and store instructions. The 
special-purpose registers KR, KI, T, and MERGE are 
used by a few specific instructions. 


7 
5 
0 


8-BIT 
PIXEL8 
I 


15 
9 
3 
0 


16-BIT 
PIXEL I 
R 
I 
G 
I 
B 
I 


32-BIT 
PIXEL 


31 
23 
15 
7 
0 
I 
R 
I 
G 
I 
B 
I 
T 
I 
240296-4 
I-Intensity, 
R-Red 
intensity, G-Green 
intensity, B-Blue 
intensity, C-Color, 
T-Texture 


These assignments of specific meanings to the fields of pixels are for illustration purposes only. Only the field sizes are 
defined, not the specific use of each field. 


Figure 2.2. Plxel Format Example 
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2.2.1 INTEGER 
REGISTER 
FILE 


There are 32 integer registers, each 32 bits wide, 
referred to as rO through r31, which are used for 
address computation and scalar integer computa- 
tions. Register rO always returns zero when read, 
independently of what is stored in it. 


2.2.2 FLOATING·POINT 
REGISTER 
FILE 


There are 32 floating-point registers, each 32-bits 
wide, referred to as fO through f31, which are used 
for floating-point computations. Registers fO and f1 
always return zero when read, independently of 
what is stored in them. The floating-point registers 
are also used by a set of graphics operations, pri- 
marily for 3D graphics computations. 


When accessing 64-bit floating-point or integer val- 
ues, the i860 XR microprocessor uses an even/odd 
pair of registers. When accessing 128-bit values, it 
uses an aligned set of four registers (fO,f4, f8, ... , 
f28). The instruction must designate the lowest reg- 
ister number of the set of registers containing 64- or 
128-bit values. Misaligned register numbers produce 
undefined results. The register with the lowest num- 
ber contains the least significant part of the value. 
For 128-bit values, the register pair with the lower 
numbers contain the least significant 64 bits while 
the register pair with the higher numbers contain the 
most significant 64 bits. 
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The 128-bit load and store instructions, along with 
the 128-bit data path between the floating-point reg- 
isters and the data cache help to sustain the extraor- 
dinarily high rate of computation. 


2.2.3 PROCESSOR 
STATUS 
REGISTER 


The processor status register (psr) contains miscel- 
laneous state information for the current process. 
Figure 2.4 shows the format of the psr. 


• BR (Break Read) and BW (Break Write) enable a 
data access trap when the operand address 
matches the address in the db register and a 
read or write (respectively) occurs. 


• Various instructions set CC (Condition Code) ac- 
cording to tests they perform. The branch-on- 
condition-code instructions test its value. The bla 
instruction sets and tests LCC (Loop Condition 
Code). 


• IM (Interrupt Mode) enables external interrupts if 
set; disables interrupts if clear. 


• U (User Mode) is set when the i860 XR micro- 
processor is executing in user mode; it is clear 
when the i860 XR microprocessor is executing in 
supervisor mode. In user mode, writes to some 
control registers are inhibited. This bit also con- 
trols the memory protection mechanism. See 
section 2.4.4.3 for a description of memory pro- 
tection in user and supervisor modes. 


31 


32 


CONTROL 
REGISTERS 
32 


64 


32 
32 


32 


ADDRESS 
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Figure 2.3. Registers 
and Data Paths 
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BREAK 
READ 


BREAK 
WRITE 
-------------------------------------------------, 
CONDmON 
CODE 
---------------------------------------------, 
LOOP 
CONDmON 
CODE 
-----------:------------------------------, 
INTERRUPT 
MODE 
-------------'---------------------------, 
PREVIOUS 
INTERRUPT 
MODE 
---------------------------------, 
USER 
MODE 


PREVIOUS 
USER 
MODE 
I 


INSTRUCTION 
TRAP -------------------------------1 
INTERRUPT 
-------------------------------, 


INSTRUCTION 
ACCESS 
TRAP -----------------------1 
DATA ACCESS 
TRAP 
I 
fLOATING-POINT 
TRAP 
'1 


g~~Y;?S:U~~~ON 
MODE 
1 


31 


PM 


KILL 
NEXT 
fLOATING-POINT 
INSTRUCTION 


(RESERVED) 
SHIfT 
COUNT 
PIXEL 
SIZE 
PIXEL 
MASK 


• Can be changed 
only from supervisor 
level. 
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Figure 2.4 Processor 
Status Register 


, 
INTERLOCK ---------------------------1 
WRITE-PROTECT 
MODE 
1 
DATA CACHE SIZE 
--------- 


31 


(RESERVED) 


t t t 
L- 
(RESERVED) 
-------------------PAGE-TABlE 
BIT WODE 
---------------------- 
BIG EHDIAH 
WODE 
----------------------- 
OVERflOW 
fLAG 


'Can 
be changed 
only from supervisor 
level. 
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Figure 2.5 Extended 
Processor 
Status Register 


• PIM (Previous Interrupt Mode) and PU (Previous 
User Mode) save the corresponding status bits 
(IM and U) on a trap, because those status bits 
are changed when a trap occurs. They are re- 
stored into their corresponding status bits when 
returning from a trap handler with a branch indi- 
rect instruction when a trap flag is set in the psr. 
• FT (Floating-Point Trap), DAT (Data Access 
Trap), IAT (Instruction Access Trap), IN (Inter- 
rupt), and IT (Instruction Trap) are trap flags. 
They are set when the corresponding trap condi- 
tion occurs. The trap handler examines these bits 
to determine which condition or conditions have 
caused the trap. 


o 


PROCESSOR 
TYPE 


• DS (Delayed Switch) is set if a trap occurs during 
the instruction before dual-instruction mode is en- 
tered or exited. If DS is set and DIM (Duallnstruc- 
tion Mode) is clear, the i860 XR microprocessor 
switches to dual-instruction mode one instruction 
after returning from the trap handler. If DS and 
DIM are both set, the i860 XR microprocessor 
switches to single-instruction mode one instruc- 
tion after returning from the trap handler. 
• When a trap occurs, the i860 XR microprocessor 
sets DIM if it is executing in dual-instruction 
mode; it clears DIM if it is executing in single-in- 
struction mode. If DIM is set after returning from a 
trap handler, the i860 XR microprocessor re- 
sumes execution in dual-instruction mode. 
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• When KNF (Kill Next Floating-Point Instruction) is 


set, the next floating-point instruction is sup- 
pressed (except that its dual-instruction mode bit 
is interpreted). A trap handler sets KNF if the 
trapped floating-point instruction should not be 
reexecuted. 
• SC (Shift Count) stores the shift count used by 


the last right-shift instruction. It controls the num- 
ber of shifts executed by the double-shift instruc- 
tion. 
• PS (Pixel Size) and PM (Pixel Mask) are used by 


the pixel-store instruction and by the graphics in- 
structions. The values of PS control pixel size as 
defined by Table 2.2. The bits in PM correspond 
to pixels to be updated by the pixel-store instruc- 
tion pst.d. The low-order bit of PM corresponds 
to the low-order pixel of the 64-bit source oper- 
and of pst.d. The number of low-order bits of PM 
that are actually used is the number of pixels that 
fit into 64-bits, which depends upon PS. If a bit of 
PM is set, then pst.d stores the corresponding 
pixel. Refer also to the pst.d instruction in section 
8. 


Table 2.2. Values of PS 


Value 
PlxelSlze 
Plxel Size 
In bits 
In bytes 


00 
8 
1 
01 
16 
2 


10 
32 
4 
11 
(undefined) 
(undefined) 
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2.2.4 EXTENDED PROCESSOR STATUS 
REGISTER 


The extended processor status register (epsr) con- 
tains additional state information for the current pro- 
cess beyond that stored in the psr. Figure 2.5 shows 
the format of the epsr. 
• The processor type is one for the i860 XR micro- 


processor. 
• The stepping number has a unique value that dis- 


tinguishes among different revisions of the proc- 
essor. 


• IL (Interlock) is set if a trap occurs after a lock 


instruction but before the load or store following 
the subsequent unlock instruction. IL indicates to 
the trap handler that a locked sequence has 
been interrupted. When the trap handler finds IL 
set, it should scan backwards for the lock in- 
struction and restart at that point. The absence of 
a lock instruction within 30-33 instructions of the 
trap indicates a programming error. 
• WP (write protect) controls the semantics of the 


W bit of page table entries. A clear W bit in either 
the directory or the page table entry causes 
writes to be trapped. When WP is clear, writes 
are trapped in user mode, but not in supervisor 
mode. When WP is set, writes are trapped in both 
user and supervisor modes. After the value of the 
WP bit is changed, the TLB must be invalidated 
by setting the ITI bit of the dlrbase register, be- 
fore any stores are performed. 
• INT (Interrupt) is the value of the INT input pin. 
• DCS (Data Cache Size) is a read-only field that 


tells the size of the on-Chipdata cache. The num- 
ber of bytes actually available is 212+DCS; 
there- 


fore, a value of zero indicates 4 Kbytes, one indi- 
cates 8 Kbytes, etc. 


ADDRESS TRANSLATION ENABLE -----------------------, 
DRAM PAGE SIZE 
-------------------------, 
BUS LOCK ------------------------, 


I-CACHE, 
TLB INVALIDATE------------------1 
(RESERVED)---------------------1 
CODE SIZE 8-BIT 
-------------------[ 
REPLACEMENT BLOCK 
1 • 


REPLACEMENT CONTROL 
------------- 


31 


DIRECTORY TABLE BASE (DTS) 


12 


• 
• 
240296-7 
• 
• 
••• 
• 
• 


'Can 
be changed 
only from supervisor 
level 


Figure 2.6. Directory Base Register 
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• 
PBM (Page-Table 
Bit Mode) determines 
which bit 
of page-table 
entries 
is output 
on the 
PTB pin. 


When PBM is clear, the PTB signal reflects 
bit CD 
of the page-table 
entry used for the current cycle. 
When PBM is set, the PTB signal reflects 
bit WT 
of the page-table 
entry used for the current cycle. 


• 
BE (Big Endian) 
controls 
the 
ordering 
of bytes 
within a data item in memory. 
Normally 
(i.e. when 
BE is clear) the i860 XR microprocessor 
operates 


in little endian mode, in which the addressed 
byte 


is the low-order 
byte. When BE is set (big endian 
mode), 
the 
low-order 
three 
bits of all load 
and 
store 
addresses 
are 
complemented, 
then 
masked 
to the 
appropriate 
boundary 
for 
align- 
ment. This causes 
the addressed 
byte to be the 
most significant 
byte. Section 
2.3 discusses 
little 
and big endian 
addressing. 


• 
OF (Overflow 
Flag) is set by adds, 
addu, 
subs, 


and 
subu 
when 
integer 
overflow 
occurs. 
For 
adds 
and subs, 
OF is set if the carry from bit 31 
is different 
than the carry from bit 30. For addu, 
OF is set if there is a carry from bit 31. For subu, 
OF is set if there is no carry from bit 31. Under all 
other 
conditions, 
it is cleared 
by these 
instruc- 
tions. 
OF controls 
the function 
of the Intovr 
in- 
struction. 
OF cannot 
be written 
in user mode us- 


ing ST.C. 


2.2.5 DATA 
BREAKPOINT 
REGISTER 


The data breakpoint 
register 
(db) is used to gener- 
ate a trap when the i860 XR microprocessor 
makes 
a data-operand 
access 
to the address 
stored 
in this 


register. 
The trap is enabled 
by BR and BW in psr. 
The db register 
can only be changed 
from supervi- 


sor level. When 
comparing, 
a number 
of low order 
bits of the address 
are ignored, 
depending 
on the 


size of the operand. 
For example, 
a 16-bit access 


ignores 
the low-order 
bit of the address 
when com- 


paring to db; a 32-bit access 
ignores 
the low-order 


two bits. This ensures 
that any access 
that overlaps 


the address 
contained 
in the register will generate 
a 
trap. The 
OAT occurs 
before 
the data is accessed 
and prevents 
the load or store from completing. 


2.2.6 DIRECTORY 
BASE 
REt;ISTER 


The directory 
base register dlrbase 
(shown in Figure 
2.6) controls 
address 
translation, 
caching, 
and bus 
options. 
The dlrbase 
register 
can only be changed 
from 
supervisor 
level. The 
BL bit is changed 
from 
user level with the lock 
and unlock 
instructions. 


• 
ATE (Address 
Translation 
Enable), when set, en- 
ables 
the 
virtual-address 
translation 
algorithm. 


The data cache must be flushed 
before changing 
the ATE bit. 


• 
DPS (DRAM 
Page Size) controls 
how many bits 
to ignore 
when 
comparing 
the current 
bus-cycle 
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address 
with the previous 
bus-cycle 
address 
to 
generate 
the NENE'" 
signal. This feature 
allows 
for higher 
speeds 
when 
using 
static 
column 
or 
page-mode 
DRAMs 
and consecutive 
reads 
and 
writes 
access 
the row. The comparison 
ignores 
the low-order 
12 + DPS bits. A value of zero is 
appropriate 
for one bank of 256K x n RAMs, 
1 
for 1M X n RAMS, etc. For interleaved 
memory, 


increase 
DPS by one for each power of interleav- 
ing-add 
one for 2-way, and two for 4-way, etc. 


• 
When BL (Bus Lock) is set, external 
bus access- 
es are locked. The LOCK'" 
signal is asserted 
the 
next bus cycle whose 
internal bus request 
is gen- 
erated 
after 
BL is set. 
It remains 
set on every 
subsequent 
bus cycle as long as BL remains 
set. 


The 
LOCK'" 
signal 
is deasserted 
on 
the 
next 
load or store instruction 
after BL is cleared. Traps 
immediately 
clear 
BL. The 
lock 
and 
unlock 
in- 
structions 
control 
the BL bit. The result of modify- 


ing BL with the st.c 
instruction 
is not defined. 


• 
ITI (I-Cache, TLB Invalidate), 
when set in the val- 
ue that is loaded 
into dlrbase, 
causes 
all entries 


in the instruction 
cache 
and address-translation 
cache 
(TLB) to be invalidated. 
The 
ITI bit does 
not remain set in dlrbase. 
ITI always 
appears 
as 
zero when reading 
dlrbase. 
Section 
2.5 discuss- 


es flushing 
the data cache before invalidating 
the 
TLB. 


• 
When 
CS8 (Code 
Size 8-Bit) 
is set, 
instruction 
cache misses are processed 
as 8-bit bus cycles. 


When 
this bit is clear, 
instruction 
cache 
misses 
are processed 
as 64-bit bus cycles. 
This bit can 
not be set by software; 
hardware 
sets this bit at 


initialization 
time. 
It can be cleared 
by software 
(one time only) to allow the system to execute 
out 
of 64-bit 
memory 
after 
bootstrapping 
from 
8-bit 
EPROM. 
A nondelayed 
branch 
to code 
in 64-bit 
memory should directly follow the st.C (store con- 
trol register) 
instruction 
that clears 
CS8, in order 
to make the transition 
from 8-bit to 64-bit memory 
occur at the correct 
time. The branch 
instruction 


must be aligned on a 64-bit boundary. 


• 
RB 
(Replacement 
Block) 
identifies 
the 
cache 
block to be replaced 
by cache replacement 
algo- 
rithms. The high-order 
bit of RB is ignored 
by the 
instruction 
and data 
caches. 
RB conditions 
the 
cache flush instruction 
flush, 
which 
is discussed 
in Section 
8. Table 2.3 explains 
the values of RB. 


• 
RC 
(Replacement 
Control) 
controls 
cache 
re- 


placement 
algorithms. 
Table 2.4 explains 
the sig- 


nificance 
of the values 
of RC. 


• 
DTB (Directory 
Table Base) contains 
the high-or- 
der 20 bits of the physical 
address 
of the page 
directory 
when address translation 
is enabled 
(i.e. 


ATE = 1). The low-order 
12 bits of the address 
are zeros. 
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Table 
2.3. Values 
of RB 


Figure 
2.7. Floating-Point 
Status 
Register 


2.2.7 FAULT 
INSTRUCTION 
REGISTER 


Value 
Replace 
Replace 
Instruction 
TLBBlock 
and Data Cache 
Block 


0 
0 
0 
0 


0 
1 
1 
1 


1 
0 
2 
0 


1 
1 
3 
1 


Table 
2.4. Values 
of RC 


Value 
Meaning 


00 
Selects the normal replacement 
algorithm 
where any block in the set 
may be replaced 
on cache misses in all 
caches. 


01 
Instruction, 
data, and TLB cache 
misses replace the block selected 
by 
RB. The instruction 
and data caches 


ignore the high·order 
bit of RB. This 
mode is used for instruction 
cache and 
TLB testing. 


10 
Data cache misses replace the block 
selected 
by the low-order 
bit of RB. 


Instruction 
and TLB caches 
use 


random replacement. 


11 
Disables data cache replacement. 
Instruction 
and TLB caches use 
random replacement. 


When 
a trap occurs, 
this register 
contains 
the ad- 


dress of the trapping 
instruction 
(not necessarily 
the 


instruction 
that created 
the conditions 
that required 


the trap). The fir is a read-only 
register. 
In single·in· 


struction 
mode, 
using a Id.c instruction 
to read the 


fir anytime 
except the first time after a trap saves in 
tdest the address 
of the Id.c instruction; 
in dual-in- 
struction 
mode, the address of its floatinq-point 
corn- 


panion 
(address 
of the Id.c 
- 
4) is saved. 


2.~.8 FLOATING-POINT 
STATUS 
REGISTER 


The floating-point 
status 
register 
(fsr) 
contains 
the 
floating-point 
trap and rounding-mode 
status for the 
current process. 
Figure 2.7 shows its format. The fsr 


is writable 
in user level. 


• 
If FZ (Flush Zero) is clear and underflow 
occurs, 


a result-exception 
trap is generated. 
When 
FZ is 


set and underflow 
occurs, the result is set to zero, 


and no trap due to underflow 
occurs. 


• 
If TI (Trap Inexact) 
is clear, inexact 
results do not 


cause a trap. If TI is set, inexact 
results 
cause a 


trap. The sticky inexact 
flag (SI) is set whenever 


an inexact 
result 
is produced, 
regardless 
of the 


setting of TI. 


• 
RM (Rounding 
Mode) 
specifies 
one 
of the four 
rounding 
modes 
defined 
by the 
IEEE standard. 


Given a true result b that cannot 
be represented 
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Table 
2 5 Values 
of RM 
.. 


Value 
Rounding 
Mode 
Rounding 
Action 


00 
Round to nearest or even 
Closer to b of 8 or c; if equally 
close, select even number 
(the one whose least 
significant 
bit is zero). 
01 
Round down (toward 
- 
00 ) 
8 
10 
Round up (toward + 00 
c 
11 
Chop (toward zero) 
Smaller in magnitude 
of 8 or c. 


by the target data type, the i860 XR microproces- 
sor determines 
the two representable 
numbers 8 
and c that most closely bracket b in value (8 < b 
< c). The i860 XR microprocessor 
then rounds 
(changes) b to 8 or c according 
to the mode se- 
lected 
by RM as defined 
in Table 2.5. Rounding 
introduces 
an error in the result that is less than 
one least-significant 
bit. 


• 
The U-bit (Update 
Bit), if set in the value that is 
loaded 
into fsr 
by a st.c 
instruction, 
enables 
up- 
dating 
of the result-status 
bits (AE, AA, AI, AO, 
AU, MA, MI, MO, and MU) in the first-stage 
of the 
floating-point 
adder and multiplier 
pipelines. 
If this 
bit is clear, 
the result-status 
bits are unaffected 
by a st.c instruction; 
st.e ignores the correspond- 
ing bits in the value that is being 
loaded. 
A st.e 
always 
updates 
fsr 
bits 21 ..17 and 8..0 directly. 
The U-bit does not remain set; it always appears 
as zero when read. 


• 
The FTE (Floating-Point 
Trap Enable) bit, if clear, 


disables 
all floating-point 
traps (invalid input oper- 
and, overflow, 
underflow, 
and inexact 
result). 


• 
SI (Sticky 
Inexact) 
is set when the last stage re- 
sult of either the multiplier 
or adder is inexact 
(i.e. 
when either AI or MI is set). SI is "sticky" 
in the 
sense that it remains 
set until reset by software. 


AI and MI, on the other hand, can by changed 
by 
the subsequent 
floating-point 
instruction. 


• 
SE (Source 
Exception) 
is set when 
one 
of the 
source 
operands 
of a floating-point 
operation 
is 
invalid; 
it is cleared 
when all the input operands 
are valid. 
Invalid 
input 
operands 
include 
donor- 
mals, infinities, 
and all NaNs (both quiet and sig- 
naling). 


• 
When read from the fsr, the result-status 
bits MA, 
MI, 
MO, 
and 
MU 
(Multiplier 
Add-One, 
Inexact, 


Overflow, 
and 
Underflow, 
respectively) 
describe 
the last stage result of the multiplier. 


When read from the fsr, the result-status 
bits AA, 
AI, AO, 
AU, 
and 
AE (Adder 
Add-One, 
Inexact, 


Overflow, 
Underflow, 
and Exponent, 
respectively) 
describe 
the last stage 
result 
of the adder. 
The 
high-order 
three bits of the 11-bit exponent 
of the 
adder result are stored in the AE field. 


The Adder Add One and Multiplier 
Add One bits 
indicate that the absolute 
value of the result frac- 
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tion 
grew 
by 
one 
least-significant 
bit 
due 
to 


rounding. 
AA and MA are not influenced 
by the 
sign of the result. 


After a floating-point 
operation 
in a given unit (ad- 
der or multiplier), 
the result-status 
bits of that unit 
are undefined 
until the point 
at which 
result 
ex- 
ceptions 
are reported. 


When 
written 
to the fsr 
with 
the 
U-bit 
set, the 
result-status 
bits are placed 
into the first stage of 
the 
adder 
and 
multiplier 
pipelines. 
When 
the 
processor 
executes 
pipelined 
operations, 
it prop- 
agates 
the 
result-status 
bits of a particular 
unit 
(multiplier 
or adder) one stage for each pipelined 
floating-point 
operation 
for that 
unit. When 
they 
reach the last stage, they replace 
the normal 
re- 
sult-status 
bits in the fsr. 
When 
the U-bit is not 
set, result-status 
bits in the word being written 
to 
the fsr are ignored. 


In a floating-point 
dual-operation 
instruction 
(e.g. 


add-and-multiply 
or 
subtract-and-multiply), 
both 


the 
multiplier 
and the adder 
may set exception 
bits. The result-status 
bits for a particular 
unit re- 


main set until the next operation 
that 
uses that 


unit. 


• 
RR 
(Result 
Register) 
specifies 
which 
floating- 
point register 
(fO-f31) 
was the destination 
regis- 
ter when a result-exception 
trap occurs 
due to a 
scalar operation. 


• 
LRP (Load 
Pipe Result 
Precision), 
IRP (Integer 
(Graphics) 
Pipe Result Precision), 
MRP (Multiplier 
Pipe Result Precision), 
and ARP (Adder 
Pipe Re- 
sult Precision) 
aid in restoring 
pipeline 
state after 
a trap or process 
switch. 
Each defines 
the preci- 
sion of the last stage result in the corresponding 
pipeline. 
One of these 
bits is set when the result 
in the last stage of the corresponding 
pipeline 
is 
double precision; 
it is 'cleared if the result is single 
precision. 
These bits cannot 
be changed 
by soft- 


ware. 


2.2.9 KR, KI, T, AND 
MERGE 
REGISTERS 


The KR, KI, and T registers 
are special-purpose 
reg- 
isters 
used 
by 
the 
dual-operation 
floating-point 
instructions 
pfam, 
pfmam, 
pfsm, 
and 
pfmsm, 


which initiate both an adder (A-unit) operation 
and a 
multiplier 
(M-unit) operation. 
The KR, KI, and T regis- 
ters 
can 
store 
values 
from 
one 
dual-operation 
in- 
struction 
and supply 
them 
as inputs 
to subsequent 
dual-operation 
instructions. 
(Refer to Figure 2.14.) 


The 
MERGE 
register 
is used 
only by the graphics 
instructions. 
The purpose 
of the MERGE 
register 
is 
to accumulate 
(or merge) the results of multiple-ad- 
dition operations 
that use as operands 
the color-in- 
tensity 
values from pixels or distance 
values 
from a 
Z-buffer. 
The 
accumulated 
results 
can 
then 
be 
stored 
in one 64-bit operation. 


Two 
multiple-addition 
instructions 
and 
an 
OR 
in- 


struction 
use the MERGE 
register. 
The addition 
in- 
structions 
are designed 
to add interpolation 
values 
to each color-intensity 
field in an array of pixels or to 
each distance 
value in a Z-buffer. 


Refer to the instruction 
descriptions 
in section 
8 for 
more information 
about these registers. 


2.3 Addressing 


Memory 
is addressed 
in byte units with a paged vir- 
tual-address 
space 
of 232 bytes. 
Data and instruc- 
tions 
can 
be 
located 
anywhere 
in 
this 
address 
space. 
Address 
arithmetic 
is performed 
using 32-bit 
input values and produces 
32-bit results. The low-or- 
der 32 bits of the result are used in case of overflow. 


Normally, 
multibyte 
data values are stored in memo- 
ry in little endian format, i.e., with the least significant 
byte at the lowest 
memory 
address. 
As an option, 


the 
ordering 
can 
be dynamically 
selected 
by soft- 
ware in supervisor 
mode. The i860 XR microproces- 
sor also offers 
big endian 
mode, in which 
the most 
significant 
byte of a data item is at the lowest 
ad- 
dress. 
Figure 2.8 shows the difference 
between 
the 
two storage 
modes. Big endian and little endian data 
areas should not be mixed within a 64-bit data word. 
Illustrations 
of 
data 
structures 
in this 
data 
sheet 
show data stored 
in little endian 
mode, i.e., the low- 
order byte is at the lowest 
memory 
address. 
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Code 
accesses 
are always 
done 
with 
little 
endian 
addressing. 
This implies that code will appear differ- 


ently than documented 
here when accessed 
as big 
endian 
data. 
Intel recommends 
that 
disassemblers 
running 
in a big endian 
system, 
convert 
instructions 
which 
have been read as data back to little endian 
form 
and 
present 
them 
in the format 
documented 
here. 


Page directories 
and page tables are also accessed 
in little endian 
mode, 
regardless 
of the value of the 
BE bit. 


Alignment 
requirements 
are as follows 
(any violation 
results in a data-access 
trap): 


• 
128-bit values are aligned 
on 16-byte 
boundaries 
when 
referenced 
in memory 
(i.e. the 
four 
least 
significant 
address 
bits must be zero). 


• 
64-bit 
values 
are aligned 
on 8-byte 
boundaries 
when 
referenced 
in memory 
(i.e. the three 
least 
significant 
address 
bits must be zero). 


• 
32-bit 
values 
are aligned 
on 4-byte 
boundaries 
when 
referenced 
in memory 
(i.e. the 
two 
least 
significant 
address 
bits must be zero). 


• 
16-bit 
values 
are aligned 
on 2-byte 
boundaries 
when referenced 
in memory 
(i.e. the least signifi- 


cant address 
bit must be zero). 


2.4 Virtual Addressing 


When 
address 
translation 
is enabled, 
the 
i860 XR 
microprocessor 
maps instruction 
and data virtual ad- 
dresses 
into physical 
addresses 
before 
referenCing 
memory. 
This address 
transformation 
is compatible 
with that of the Intel386™ 
microprocessor 
and im- 
plements 
the basic features 
needed 
for page-orient- 
ed virtual-memory 
systems 
and 
page-level 
protec- 
tion. 


The address translation 
is optional. 
Address 
transla- 
tion is in effect 
only when the ATE bit of dlrbase 
is 
set. This bit is typically 
set by the.operatinq 
system 
during 
software 
initialization. 
The ATE bit must 
be 
set if the operating 
system 
is to implement 
page-ori- 
ented protection 
or page-oriented 
virtual memory. 
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Figure 2.9. Format of a Virtual Address 


Address translation is disabled when the processor 
is reset. It is enabled when a store to dirbase 
sets 
the ATE bit. It is disabled again when a store clears 
the ATE bit. 


2.4.1 PAGE FRAME 


A page 
frame 
is a 4-Kbyte unit of contiguous ad- 
dresses of physical main memory. Page frames be- 
gin on 4-Kbyte boundaries and are fixed in size. A 
page 
is the collection of data that occupies a page 
frame when that data is present in main memory. 
The data may also occupy some location in second- 
ary storage when there is not sufficient space in 
main memory. 


2.4.2 VIRTUAL 
ADDRESS 


A virtual address refers indirectly to a physical ad- 
dress by specifying a page table, a page within that 


table, and an offset within that page. Figure 2.9 
shows the format of a virtual address. 


Figure 2.10 shows how the i860 XR microprocessor 
converts the DIR, PAGE, and OFFSET fields of a 
virtual address into the physical address by consult- 
ing two levels of page tables. The addressing mech- 
anism uses the DIR field as an index into a page 
directory, uses the PAGE field as an index into the 
page table determined by the page directory, and 
uses the OFFSET field to address a byte within the 
page determined by the page table. 


2.4.3 PAGE TABLES 


A page table is simply an array of 32-bit page specifi- 
ers. A page table is itself a page, and therefore con- 
tains 4 Kbytes of memory or at most 1K 32-bit en- 
tries. 


I DIR IPAGE I OffSET I 
PAGE fRAME 
I 
PHYSICAL 
ADDRESS 


PAGE DIRECTORY 
PAGE TABLE 


-. 
PG TBl 
ENTRY 
I.-..t 
DIR ENTRY 
I-- 


DTB 
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Figure 2.10. Address 
Translation 
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Two levels of tables are used to address a page of 
memory. At the higher level is a page directory. The 
page directory addresses up to 1K page tables of 
the second level. A page table of the second level 
addresses up to 1K pages. All the tables addressed 
by one page directory, therefore, can address 1M 
pages (220).Because each page contains 4 Kbytes 
(212 bytes), the tables of one page directory can 
span the entire physical address space of the i860 
XR microprocessor (220 X 212 = 232). 


The physical address of the current page directory is 
stored in DTB field of the dlrbase 
register. Memory 
management software has the option of using one 
page directory for all processes, one page directory 
for each process, or some combination of the two. 


2.4.4 PAGE-TABLE 
ENTRIES 


Page-table entries (PTEs) in either level of page ta- 
bles have the same format. Figure 2.11 illustrates 
this format. 


2.4.4.1 Page Frame Address 


The page frame address specifies the physical start- 
ing address of a page. Because pages are located 
on 4K boundaries, the low-order 12 bits are always 
zero. In a page directory, the page frame address is 
the address of a page table. In a second-level page 
table, the page frame address is the address of the 
page frame that contains the desired memory oper- 
and. 


2.4.4.2 Present 
Bit 


The P (present) bit indicates whether a page table 
entry can be used in address translation. P = 1 indi· 


infel~ 


cates that the entry can be used. When P = 0 in 
either level of page tables, the entry is not valid for 
address translation, and the rest of the entry is avail- 
able for software use; none of the other bits in the 
entry is tested by the hardware. If P = 0 in either 
level of page tables when an attempt is made to use 
a page-table entry for address translation, the proc- 
essor signals either a data-access fault or an in- 
struction-access fault. In software systems that sup- 
port paged virtual memory, the trap handler can 
bring the required page into physical memory. 


Note that there is no P bit for the page directory 
itself. The page directory may be not-present while 
the associated process is suspended, but the oper- 
ating system must ensure that the page directory 
indicated by the dlrbase 
image associated with the 
process is present in physical memory before the 
process is dispatched. 


2.4.4.3 Wrltable 
and User Bits 


The W (writable) and U (user) bits are used for page- 
level protection, which the i860 XR microprocessor 
performs at the same time as address translation. 
The concept of privilege for pages is implemented 
by assigning each page to one of two levels: 
1. Supervisor level (U = O)-for 
the operating sys- 
tem and other systems software and related data. 


2. User level (U = 1)-for 
applications procedures 
and data. 


The U bit of the psr indicates whether the i860 XR 
microprocessor is executing at user or supervisor 
level. The i860 XR microprocessor maintains the U 
bit of psr as follows: 


PRESENT------------------------, 
WRITABLE----------------------, 
USER----------------------, 
WRITE-THROUGH 
------------------, 
CACHEDISABLE------------------, 
ACCESSED--------------------, 
DIRTY-------------------, 


(RESERVED)----------------,1 
AVAILABLEfOR SYSTEhtSPROGRAIoI 
USE------1 


31 
12 
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NOTE: 
X indicates 
Intel reserved. 
Do not use. 


Figure 2.11. Format of a Page Table Entry 
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• The i860 XR microprocessor clears the psr U bit 
to indicate supervisor level when a trap occurs 
(including when the trap instruction causes the 
trap). The prior value of U is copied into PU. 


• The i860 XR microprocessor copies the psr PU 
bit into the U bit when an indirect branch is exe- 
cuted and one of the trap bits is set. If PU was 
one, the i860 XR microprocessor enters user 
level. 


With the U bit of psr and the Wand 
U bits of the 
page table entries, the i860 XR microprocessor im- 
plements the following protection rules: 


• When at user level, a read or write of a supervi- 


sor-level page causes a trap. 


• When at user level, a write to a page whose W bit 
is clear causes a trap. 
• When at user level, st.c to certain control regis- 
ters is ignored. 


When the i860 XR microprocessor is executing at 
supervisor level, all pages are addressable, but, 
when it is executing at user level, only pages that 
belong to the user-level are addressable. 


When the i860 XR microprocessor is executing at 
supervisor level, all pages are readable. Whether a 
page is writable depends upon the write-protection 
mode controlled by WP of epsr: 


WP = 0 
All pages are writable. 
WP = 1 
A write to a page whose W bit is 
clear causes a trap. 


When the i860 XR microprocessor is executing at 
user level, only pages that belong to user level and 
are marked writable are actually writable; pages that 
belong to supervisor level are neither readable nor 
writable from user level. 


2.4.4.4 Write-Through 
Bit 


The i860 XR microprocessor does not implement a 
write-through caching policy for the on-chip data 
cache; however, the WT (write-through) bit in the 
second-level page-table entry does determine inter- 
nal caching policy. If WT is set in a PTE, on-chip 
caching of data from the corresponding.page is in- 
hibited. The i860 XR CPU may place pages having 
WT = 
1 into the instruction cache. Future imple- 
mentations of the i860 XR architecture may adhere 
to a write-through data caching policy. Therefore, 
they may cache pages having the WT bit of the PTE 
set. If WT is clear, the normal write-back policy is 
applied to data from the page in the on-chip caches. 
The WT bit of page directory entries is not refer- 
enced by the processor, but is reserved. 


The WT bit is independent of the CD bit; therefore, 
data may be placed in a second-level coherent 
cache, but kept out of the on-chip caches. 
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2.4.4.5 Cache Disable Bit 


If the CD (cache disable) bit in the second-level 
page-table entry is set, data from the associated 
page is not placed in instruction or data caches. 
Clearing CD permits the cache hardware to place 
data from the associated page into caches. The CD 
bit of page directory entries is not referenced by the 
processor, but is reserved. 


To control external caches, the i860 XR microproc- 
essor outputs on its PTB pin either the CD or WT bit. 
The PBM bit of epsr determines which bit is output. 


2.4.4.6 Accessed and Dirty Bits 
, 


The A (aceessed) 
and D (dirty) bits provide data 
about page usage in both levels of the page tables. 


The i860 XR microprocessor sets the corresponding 
accessed bits in both levels of page tables before a 
read or write operation to a page. The processor 
tests the dirty bit in the second-level page table be- 
fore a write to an address covered by that page table 
entry, and, under certain conditions, causes traps. 
The trap handler then has the opportunity to main- 
tain appropriate values in the dirty bits. The dirty bit 
in directory entries is not tested by the i860 XR mi- 
croprocessor. The precise algorithm for using these 
bits is specified in Section 2.4.5. 


An operating system that supports paged virtual 
memory can use these bits to determine what pages 
to eliminate from physical memory when the de- 
mand for memory exceeds the physical memory 
available. The D and A bits in the PTE (page-table 
entry) are normally initialized to zero by the operat- 
ing system. The processor sets the A bit when a 
page is accessed either by a read or write operation. 
When a data- or instruction-access fault occurs, the 
trap handler sets the D bit if an allowable write is 
being performed, then re-executes the instruction. 


The operating system is responsible for coordinating 
its updates to the accessed and dirty bits with up- 
dates by the CPU and by other processors that may 
share the page tables. The i860 XR microprocessor 
automatically asserts the LOCK# signal while set- 
ting the A bit. If an A-bit of a PTE is found not set 
during a locked sequence (created by the lock in- 
struction), a trap will occur and the processor will not 
update the A-bit. 


2.4.4.7 Combining Protection of Both Levels of 


Page Tables 


For anyone 
page, the protection attributes of its 


page directory entry may differ from those of its 
page table entry. The i860 XR microprocessor com- 
putes the effective protection attributes for a page 
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by examining the protection attributes in both the 
directory and the page table. Table 2.6 shows the 
effective protection provided by the possible combi- 
nations of protection attributes. 


2.4.5 ADDRESS TRANSLATION ALGORITHM 


The algorithm below defines the translation of each 
virtual address to a physical address. Let DIR, 
PAGE, and OFFSET be the fields of the virtual ad- 
dress; let PFA1 and PFA2 be the page frame ad- 
dress fields of the first and second level page tables 
respectively; DTB is the page directory table base 
address stored in the dirbase register. 
1. Read the PTE (page table entry) at the physical 
address formed by DTB:DIR:OO. 


2. If P in the PTE is zero, generate a data- or instruc- 
tion-access fault. 


3. If W in the PTE is zero, the operation is a write, 


and either the U-bit of the PSR is set or WP = 1, 
generate a data or instruction access fault. 


4. If the U·bit in the PTE is zero and the U-bit in the 
psr is set, generate a data or instruction access 
fault. 


5. If A in the PTE is zero, and if the TLB miss oc- 


curred while the bus was locked, generate a 


data or instruction access fault. (The trap allows 
software to set A to one and restart the se- 
quence. This avoids ambiguity in determining 
what address corresponds to a locked sema- 
phore for external bus hardware use.) 
6. If A in the PTE is zero, and if the TLB miss oc- 


curred while the bus was not locked, assert 
LOCK;#. Re-fetch and check the PTE, set A, and 
store the PTE. Deassert LOCK;# during the store. 


7. Locate the PTE at the physical address formed by 
PFA1:PAGE:00. 
8. Perform the P, W, U, and A checks as in steps 2 
through 6 with the second-level PTE. 


9. If D in the PTE is clear and the operation is a 
write, generate a data or instruction access fault. 


10. Form the physical address as PFA2:0FFSET. 


The i860 XR microprocessor looks only in external 
memory for Page Directories and Page Tables, in 
the translation process. The data cache is not 
searched. Therefore, any code which modifies Page 
Directories or Page Tables must keep them out of 
the cache. The tables should be kept in non-cache- 
able memory, or flushed from the cache. 


Table 2.6. Combining Directory and Page Protections 


Page Directory 
Page Table 
Combined Protection 


Entry 
Entry 
User 
Supervisor 
Access 
Access 


U-blt 
W-blt 
U-blt 
W-blt 
WP= 
X 
WP = 0 
WP = 1 


0 
0 
0 
0 
N 
R/W 
R 
0 
0 
0 
1 
N 
R/W 
R 
0 
0 
1 
0 
N 
R/W 
R 
0 
0 
1 
1 
N 
R/W 
R 


0 
1 
0 
0 
N 
R/W 
R 
0 
1 
0 
1 
N 
R/W 
R/W 
0 
1 
1 
0 
N 
R/W 
R 
0 
1 
1 
1 
N 
R/W 
R/W 


1 
0 
0 
0 
N 
R/W 
R 
1 
0 
0 
1 
N 
R/W 
R 
1 
0 
. 1 
0 
R 
R/W 
R 
1 
0 
1 
1 
R 
R/W 
R 


1 
1 
0 
0 
N 
R/W 
R 
1 
1 
0 
1 
N 
R/W 
R/W 
1 
1 
1 
0 
R 
R/W 
R 
1 
1 
1 
1 
R/W 
R/W 
R/W 


NOTES: 
N = No access 
allowed 
R = Read access 
only 
R/W 
= Both reads and writes 
allowed 
X = Don't 
care 
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The i860 XR microprocessor expects Page Directo- 
ries and Page Tables to be in little endian format. 
The operating system must maintain these tables in 
little endian format by either setting BE = 0 when 
manipulating the tables or by complementing bit 2 of 
the address when loading or storing entries. 


2.4.6 ADDRESS TRANSLATION FAULTS 


The address translation fault is one instance of the 
data-access fault. The instruction causing the fault 
can be re-executed upon returning from the trap 
handler. 


2.4.7 PAGE TRANSLATION CACHE 


For greatest efficiency in address translation, the 
i860 XR microprocessor stores the most recently 
used page-table data in an on-chip cache called the 
TLB (translation lookaside buffer). Only if the neces- 
sary paging information is not in the cache must 
both levels of page tables be referenced. 


2.5 Caching and Cache Flushing 


The i860 XR microprocessor has the ability to cache 
instruction, data, and address-translation informa- 
tion in on-chip caches. Caching uses virtual-address 
tags. The effects of mapping two different virtual ad- 
dresses in the same address space to the same 
physical address are undefined. 


Instruction, data, and address-translation caching on 
the i860 XR microprocessor are not transparent. Be- 
cause the data cache uses a write-back protocol, 
writes do not immediately update memory, and 
writes to memory by other bus devices do not up- 
date the cache. Changes to page tables do not auto- 
matically update the TLB, and changes to instruc- 
tions do not automatically update the instruction 
cache. Under certain circumstances, such as 1/0 
references, 
self-modifying 
code, 
page-table 
up- 
dates, or shared data in a multiprocessing system, it 
is necessary to bypass or to flush the caches. The 
i860 XR 
microprocessor 
provides the following 
methods for doing this: 


• Bypassing 
Instruction 
and Data Caches. 
If 
deasserted during cache-miss processing, the 
KEN., pin disables instruction and data caching 
of the referenced data. If the CD bit of the associ- 
ated second-level PTE is set, caching of data and 
instructions is disabled. The i860 XR CPU may 
place pages having WT = 1 into the instruction 


i860TM XR MICROPROCESSOR 


cache. Future implementations of the i860 XR ar- 
chitecture may adhere to a write-through data 
cache policy. Thus, they may cache pages having 
the WT bit of the PTE set. The value of the CD bit 
or the WT bit is output on the PTB pin for use by 
external caches. 


• Invalidating 
Instruction 
and Address-Transla- 
tion Caches. Storing to the dlrbase register with 
the ITI bit set invalidates the contents of the in- 
struction and address-translation caches. This bit 
should be set when modifying a page table, when 
modifying a page containing instructions, or when 
changing the DTB field of dirbase or the WP bit 
of the epsr. Note that in order to make the in- 
struction or address-translation caches consist- 
ent with the data cache, the data cache must be 
flushed before invalidating the other caches. 


NOTE: 
The mapping of the page containing the 
currently 
executing 
instruction 
and 
the 
next six instructions should not be differ- 
ent in the new page tables when st.c dlr- 
base changes DTB or activates ITI. The 
six instructions following the st.c should 
be nops and should lie in the same page 
as the st.c. 
• Flushing the Data Cache. The data cache is 
flushed by a software routine using the flush in- 
struction. The data cache must be flushed prior to 
invalidating the instruction or address-translation 
caches (as controlled by the ITI bit of dlrbase) or 
enabling or disabling address translation (via the 
ATE bit). The data cache does not need flushing 
if the program is modifying only the P, U, W, A, or 
o bits of a PTE (as long as the Page Frame Ad- 
dress is not changed and the PTE itself was not 
in the data cache.) The i860 XR CPU does not 
check these protection bits on cache line write- 
back. Thus, a trap handler can service a OAT for 
D-bit-zero by setting 0 = 1 and then ITI = 1. In 
the case of setting the P or A bits active, there is 
no need to invalidate or flush any caches be- 
cause the processor does not load entries into 
the TLB that have P = 0 or A = O. The i860 XR 
microprocessor searches only external memory 
for Page Directories and Page Tables in the 
translation 
process. 
The 
data 
cache 
is 
not 


searched. Therefore, Page Tables and Directo- 
ries should be kept in non-cacheable memory, or 
flushed from the cache by any code which ac- 
cesses them. 


; 
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2.6 Instruction Set 


Table 2.7 shows the complete set of instructions 
grouped by function within processing unit. Refer to 
Section 8 for an algorithmic definition of each in- 
struction. 


The architecture of the i860 XR microprocessor 
uses parallelism to increase the rate at which opera- 
tions may be introduced into the unit. Parallelism in 
the i860 XR microprocessor is not transparent; rath- 
er, programmers have complete control over paral- 
lelism and therefore can achieve maximum perform- 
ance for a variety of computational problems. 


2.6.1 PIPELlNED AND SCALAR OPERATIONS 


One type of parallelism used within the floating-point 
unit is "pipelining". The pipelined architecture treats 
each operation as a series of more primitive opera- 
tions (called "stages") that can be executed in par- 
allel. Consider just the floating-point adder unit as an 
example. Let A represent the operation of the adder. 
Let the stages be represented by A1' A2' and A3. 
The stages are designed such that AI+ 1for one ad- 
der instruction can execute in parallel with AI for the 
next adder instruction. Furthermore, each AI can be 
executed in just one clock. The pipelining within the 
multiplier and graphics units can be described simi- 
larly, except that the number of stages may be differ- 
ent. 


Figure 2.12 
illustrates three-stage 
pipelining as 
found in the floating-point adder (also in the floating- 
point multiplier when single-precision input operands 
are employed). The columns of the figure represent 
the three stages of the pipeline. Each stage holds 
intermediate results and also (when introduced into 
first stage by software) holds status information per- 
taining to those results. The figure assumes that the 
instruction stream consists of a series of consecu- 
tive floating-point instructions, all of one type (Le.all 
adder instructions or all single-precision multiplier in- 
structions). The instructions are represented as I, 
1+ 1, etc. The rows of the figure represent the states 
of the unit at successive clock cycles. Each time a 
pipelined operation is performed, the result of the 
last stage of the pipeline is stored in the destination 
register tdest, the pipeline is advanced one stage, 
and the input operands tsrct and tsrce are trans- 
ferred to the first stage of the pipeline. 
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In the i860 XR microprocessor, the number of pipe- 
line stages ranges from one to three. A pipelined 
operation with a three-stage pipeline stores the re- 
sult of the third prior operation. A pipelined operation 
with a two-stage pipeline stores the result of the sec- 
ond prior operation. A pipelined operation with a 
one-stage pipeline stores the result of the prior oper- 
ation. 


There are four floating-point pipelines: one for the 
multiplier, one for the adder, one for the graphics 
unit, and one for floating-point loads. The adder 
pipeline has three stages. The number of stages in 
the multiplier pipeline depends on the precision of 
the source operands in the pipeline. Single precision 
has three stages and double precision has two 
stages. The graphics unit has one stage for all preci- 
sions. The load pipeline has three stages for all pre- 
cisions. 


Changing the FZ (flush zero), RM (rounding mode), 
or RR (result register) bits of fsr while there are re- 
sults in either the multiplier or adder pipeline produc- 
es effects that are not defined. 


2.6.1.1 Scalar Mode 


In addition to the pipelined execution mode, the i860 
XR microprocessor also can execute floating-point 
instructions in "scalar" mode. Most floating-point in- 
structions have both pipelined and scalar variants, 
distinguished by a bit in the instruction encoding. In 
scalar mode, the floating-point unit does not start a 
new operation until the previous floating-point oper- 
ation is completed. The scalar operation passes 
through all stages of its pipeline before a new opera- 
tion is introduced, and the result is stored automati- 
cally. Scalar mode is used when the next operation 
depends on results from the previous few floating- 
point operations (or when the compiler or program- 
mer does not want to deal with pipelining). 


2.6.1.2 Plpellnlng Status Information 


Result status information in the fsr consists of the 
M, AI, Aa, AU, and AE bits, in the case of the ad- 
der, and the MA, MI, MO, and MU bits, in the case of 
the multiplier. This information arrives at the fsr via 
the pipeline in one of two ways: 
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Table 
2.7. Instruction 
Set 


Core Unit 


Mnemonic 
Description 


Load and Store 
Instructions 


Id.x 
Load integer 
st.x 
Store integer 
fld.y 
F·P load 
pfld.z 
Pipelined 
F·P load 
fst.y 
F·P store 
pst.d 
Pixel store 


Register 
to Register 
Moves 
. 


ixfr 
Transfer 
integer to F·P register 


Integer 
Arithmetic 
Instructions 


addu 
Add unsigned 
adds 
Add signed 
subu 
Subtract 
unsigned 
subs 
Subtract 
signed 


Shift 
Instructions 


shl 
Shift left 
shr 
Shift right 
shra 
Shift right arithmetic 
shrd 
Shift right double 


Logical 
Instructions 


and 
Logical AND 
andh 
Logical AND high 
andnot 
Logical AND NOT 
andnoth 
Logical AND NOT high 
or 
Logical OR 
orh 
Logical OR high 
xor 
Logical exclusive 
OR 
xorh 
Logical exclusive 
OR high 


Control- 
Transfer 
Instructions 


trap 
Software 
trap 
intovr 
Software 
trap on integer overflow 
br 
Branch direct 
bri 
Branch indirect 
bc 
Branch on CC 
bc.t 
Branch on CC taken 
bnc 
Branch on not CC 
bnc.t 
Branch on not CC taken 
bte 
Branch if equal 
btne 
Branch if not equal 
bla 
Branch on LCC and add 
call 
Subroutine 
call 
calli 
Indirect subroutine 
call 


System 
Control 
Instructions 


flush 
Cache flush 
Id.c 
Load from control 
register 
st.c 
Store to control 
register 
lock 
Begin interlocked 
sequence 
unlock 
End interlocked 
sequence 


floating-Point 
Unit 


Mnemonic 
Description 


Register 
to Register 
Moves 


fxfr 
Transfer 
F-P to integer register 


f-P 
Multiplier 
Instruction 


fmul.p 
F-P multiply 
pfmul.p 
Pipelined 
F-P multiply 
pfmul3.dd 
3-Stage pipelined 
F-P multiply 
fmlow.p 
F-P multiply low 
frcp.p 
F-P reciprocal 
frsqr.p 
F-P reciprocal 
square root 


f-P Adder 
Instructions 


fadd.p 
F-P add 
pfadd.p 
Pipelined 
F-P add 


famov.r 
F-P adder move 
pfamov.r 
Pipelined 
F-P adder move 


fsub.p 
F-P subtract 
pfsub.p 
Pipelined 
F-P subtract 
pfgt.p 
Pipelined 
F-P greater-than 
compare 


pfeq.p 
Pipelined 
F-P equal compare 


fix.p 
F-P to integer conversion 
pfix.p 
Pipelined 
F-P to integer conversion 
ftrunc.p 
F-P to integer truncation 
pftrunc.p 
Pipelined 
F-P to integer truncation 


Dual-Operation 
Instructions 


pfam.p 
Pipelined 
F-P add and multiply 
pfsm.p 
Pipelined 
F-P subtract 
and multiply 


pfmam.p 
Pipelined 
F-P multiply with add 
pfmsm.p 
Pipelined 
F-P multiply with subtract 


Long 
Integer 
Instructions 


fisub.z 
Long·integer 
subtract 
pfisub.z 
Pipelined 
lonq-inteqer 
subtract 
fiadd.z 
Long-integer 
add 
pfiadd.z 
Pipe lined lonq-inteqer 
add 


Graphics 
Instructions 


fzchks 
16-bit Z-buffer 
check 
pfzchks 
Pipelined 
16-bit Z-buffer 
check 


fzchkl 
32-bit Z-buffer 
check 
pfzchkl 
Pipelined 32-bit Z-buffer 
check 
faddp 
Add with pixel merge 
pfaddp 
Pipelined add with pixel merge 
faddz 
Add with Z merge 
pfaddz 
Pipe lined add with Z merge 
form 
OR with MERGE register 
pform 
Pipelined OR with MERGE register 


Assembler 
Pseudo-Operatlons 


Mnemonic 
Description 


mov 
Integer register-register 
move 
fmov.r 
F-P reg-reg move 
pfmov.r 
Pipelined 
F-P reg-reg move 
nop 
Core no-operation 
fnop 
F-P no-operation 
pfle.p 
Pipelined 
F-P less-than 
or equal 
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Figure 2.12. Plpellned Instruction 
Execution 


1. It is calculated by the last stage of the pipeline. 
This is the normal case. 
2. It is propagated from the first stage of the pipe- 
line. This method is used when restoring the state 
of the pipeline after a preemption. When a store 
instruction updates the fsr and the value of the 
U bit in the word being written into the fsr is set, 
the store updates the result status bits in the first 
stage of both the adder and multiplier pipelines. 
When software changes the result-status bits of 
the first stage of a particular unit (multiplier or ad- 
der), the updated result-status bits are propagat- 
ed one stage for each pipelined floating-point op- 
eration for that unit. In this case, each stage of the 
adder and multiplier pipelines holds its own copy 
of the relevant bits of the f8r. When they reach 
the last stage, they override the normal result- 
status bits computed from the last stage result. 
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At the next floating-point instruction (or at certain 
core instructions), after the result reaches the last 
stage, the i860 XR microprocessor traps if any of the 
status bits of the fsr indicate exceptions. Note that 
the instruction that creates the exceptional condition 
is not the instruction at which the trap occurs. 


2_6_1.3Precision In the Pipelines 


In pipelined mode, when a floating-point operation is 
initiated, the result of an earlier pipelined floating- 
point operation is returned. The result precision of 
the current instruction applies to the operation being 
initiated. The precision of the value stored in fdest is 
that which was specified by the instruction that initia- 
ted that operation. 
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Figure 2.13. Dual-Instruction 
Mode Transitions 


If Idest is the same as'tsrct or Isrc2, the value being 
stored in Idest is used as the input operand. In this 
case, the precision of Idest must be the same as the 
source precision. 


The multiplier pipeline has two stages when the 
source operand is double-precision and three stages 
when the precision of the source operand is single. 
This means that a pipelined multiplier operation 
stores the result of the second previous multiplier 
operation for double-precision inputs and third previ- 
ous for single-precision inputs (except when chanq- 
ing precisions). 


2.6.1.4 Transition 
between 
Scalar and Pipe lined 
Operations 


When a scalar operation is executed, it passes 
through all stages of the pipeline; therefore, any un- 
stored results in the affected pipeline are lost. To 
avoid losing information, the last pipelined opera- 
tions before a scalar operation should be dummy 
pipelined operations that unload unstored results 
from the affected pipeline. 


After a scalar operation, the values of all pipeline 
stages of the affected unit (except the last) are un- 
defined. No spurious result-exception traps result 
when the undefined values are subsequently stored 
by pipelined operations; however, the values should 
not be referenced as source operands. 


For best performance a scalar operation should not 
immediately precede a pipelined operatlon whose 
Idest is nonzero. 


2.6.2 DUAL-INSTRUCTION 
MODE 


Another form of parallelism results from the fact that 
the i860 XR microprocessor can execute both a 
floating-point and a core instruction simultaneously. 
Such parallel execution is called dual-instruction 
mode. When executing in dual-instruction mode, the 
instruction sequence consists of 64-bit aligned in- 
structions with a floating-point instruction in the low- 
er 32 bits and a core instruction in the upper 32 bits. 
Table 2.7 identifies which instructions are executed 
by the core unit and which by the fI()ating-point unit. 
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Programmers 
specify 
dual-instruction 
mode 
either 
by including 
in the mnemonic 
of a floating-point 
in- 
struction 
a d. prefix or by using the Assembler 
direc- 
tives .dual 
.... 
enddual. 
Both of the specifications 
cause 
the 
O-bit of floating-point 
instructions 
to be 
set. If the 
i860 XR microprocessor 
is executing 
in 
single-instruction 
mode 
and encounters 
a floating- 
point instruction 
with the O-bit set, one more 32-bit 
instruction 
is executed 
before 
dual-mode 
execution 
begins. If the i860 XR microprocessor 
is executing 
in 
dual-instruction 
mode and a floating-point 
instruction 


is encountered 
with a clear O-bit, then one more pair 
of instructions 
is executed 
before resuming single-in- 
struction 
mode. Figure 2.13 illustrates 
two variations 
of this 
sequence 
of events: 
one for extended 
se- 
quences 
of dual-instructions 
and one for a single in- 
struction 
pair. 


When 
a 64-bit 
dual-instruction 
pair sequentially 
fol- 
lows a delayed 
branch 
instruction 
in dual-instruction 
mode, both 32-bit instructions 
are executed. 


2.6.3 DUAL-OPERATION 
INSTRUCTIONS 


Special 
dual-operation 
floating-point 
instructions 
(add-and-multiply, 
subtract-and-multiply) 
use 
both 
the 
multiplier 
and 
adder 
units 
within 
the 
floating- 
point unit in parallel to efficiently 
execute 
such com- 
mon 
tasks 
as evaluating 
systems 
of 
linear 
equa- 
tions, performing 
the Fast Fourier 
Transform 
(FFT), 
and performing 
graphics 
transformations. 


The 
instructions 
pfam 
tsrct, fsre2, tdest (add and 
multiply), 
pfsm 
tsrct, fsre2, tdest (subtract 
and mul- 
tiply), 
pfmam 
tscrt, fsre2, tdest (multiply 
and add), 
and pfmsm 
tsrct, fsre2, tdest (multiply and subtract) 
initiate 
both an adder operation 
and a multiplier 
op- 
eration. 
Six operands 
are required, 
but the instruc- 
tion format 
specifies 
only three operands; 
therefore, 
there are special 
provisions 
for speci~,ing 
the oper- 
ands. These 
special 
provisions 
consist 
of: 


• 
Three 
special 
registers 
(KR, KI, and T), that can 
store values 
from 
one dual-operation 
instruction 
and supply 
them 
as inputs 
to subsequent 
dual- 
operation 
instructions. 


1. The constant 
registers 
KR and KI can store the 
value 
of fsret 
and 
subsequently 
supply 
that 
value to the multiplier 
pipeline 
in place of tsrct, 


2. The transfer 
register T can store the last stage 
result 
of 
the 
multiplier 
pipeline 
and 
subse- 
quently 
supply that value to the adder pipeline 
in place of fsre1. 


• 
A four-bit 
data-path 
control 
field 
in the opcode 
(OPC) that specifies 
the operands 
and loading of 
the special 
registers. 


1.0perand-1 
of the multiplier 
can be KR, KI, or 
fsret. 


2. Operand-2 
of the multiplier 
can be fsre2 or the 
last stage result of the adder pipeline. 
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3.0perand-1 
of 
the 
adder 
can 
be fsret, 
the 
T-register, 
or the last stage result of the adder 
pipeline. 


4. Operand-2 
of the adder can be fsre2, the last 
stage 
result 
of the 
multiplier 
pipeline, 
or the 
last stage result of the adder pipeline. 


Figure 
2.14 shows 
all the possible 
data paths 
sur- 
rounding 
the 
adder 
and 
multiplier. 
A ope field 
in 
these 
instructions 
select 
different 
data paths. Table 
8.8 shows 
the various 
encodings 
of the ope field. 
Refer to Dual Operation 
Instructions 
section 
in the 
i860 Microprocessor 
Programmer's 
Reference 
Man- 
ual for pictorial 
description. 


SRC1 
SRC2 
RDEST 


MULnPLlER 
UNIT 


ADDER UNIT 


RESULT 
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Figure 
2.14. Dual-Operation 
Data Paths 


Note 
that 
the 
mnemonics 
pfam.p, 
pfsm.p, 


pfmam.p, 
and pfmsm.p 
are never used as such in 
the assembly 
language; 
these 
mnemonics 
are used 
here 
to 
designate 
classes 
of 
related 
instructions. 
Each value of ope has a unique mnemonic 
associ- 
ated with it. 


2.7 Addressing Modes 


Data access 
is limited to load and store instructions. 
Memory 
addresses 
are computed 
from two fields of 
load and store instructions: 
isrct and isre2. 
1. isre1 either contains 
the identifier 
of a 32-bit inte- 
ger register 
or contains 
an immediate 
16-bit ad- 
dress offset. 


. 2. isre2 always 
specifies 
a register. 
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Table 2.8. Types of Traps 


Type 
Indication 
Caused by 


PSR,EPSR 
FSR 
Condition 
Instruction 


Instruction 
IT 
OF 
Software traps 
trap, Intovr 
Fault 
IL 
Missing unlock 
Any 


Floating 
SE 
Floating-point source exception 
Any M- or A-unit except fmlow 


Point 
Floating-point result exception 
Any M- or A-unit except fmlow, pfgt, 


Fault 
FT 
AO,MO 
overflow 
and pfeq. Reported on any F-P 


AU,MU 
underflow 
instruction plus pst, fst, and 


AI,MI 
inexact result 
sometimes fld, pfld, Ixfr 


Instruction 
IAT 
Address translation exception 
Any 
Access Fault 
during instruction fetch 


Data Access 
Load/store address translation 
Any load/store 
Fault 
exception 
OAP 
Misaligned operand address 
Any load/store 
Operand address matches 
Any load/store 
db register 


Interrupt 
IN 
External interrupt 


Reset 
No trap bits set 
Hardware RESETsignal 


NOTES: 
-These 
cases 
can be distinguished 
by examining 
the operand 
addresses. 
The IL bit of the epsr must be checked 
by the trap handler 
to tell if the bus is currently 
in a locked 
sequence. 


Because either isrc1 or isrc2 may be null (zero), a 
variety of useful addressing modes result: 


offset + register 
Useful for accessing fields within 
a record, where register 
points 
to the beginning of the record. 
Useful for accessing items in a 
stack frame, where register 
is 
r3, the register used for pointing 
to the beginning of the stack 
frame. 


register + register 
Useful for twO-dimensional ar- 
rays or for array access within 
the stack frame. 


register 
Useful as the end result of any 
arbitrary address calculation. 


offset 
Absolute address into the first or 
last 32K of the logical address 
space. 


In addition, the floating-point load and store instruc- 
tions may select autoincrement addressing. In this 
mode isrc2 is replaced by the sum of isrc1 and isrc2 
after performing the load or store. This mode makes 
stepping through arrays more efficient, because it 
eliminates one address-calculation instruction. 


2.8 Traps and Interrupts 


Traps are caused by exceptional conditions detect- 
ed in programs or by external interrupts. Traps 
cause interruption of normal program flow to exe- 


cute a special program known EIS a trap handler. 
Traps are divided into the types shown in Table 2.8. 
Interrupts and traps start execution in single instruc- 
tion mode at virtual address OxFFFFFFOOin supervi- 
sor level (U = 0). 


2.8.1 TRAP HANDLER INVOCATION 


This section applies to traps other than reset. When 
a trap occurs, execution of the current instruction is 
aborted. The instruction is restartable. The proces- 
sor takes the following steps while transferring con- 
trol to the trap handler: 
1. Copies U (user mode) of the psr into PU (previous 


U). 


2. Copies IM (interrupt mode) into PIM (previous IM). 
3. Sets U to zero (supervisor mode). 
4. Sets IM to zero (interrupts disabled). 
5. If the processor is in dual instruction mode, it sets 


DIM; otherwise it clears DIM. 


6. If the processor is in single-instruction mode and 


the next instruction will be executed in dual- 
instruction mode or if the processor is in dual-in- 
struction mode and the next instruction will be 
executed in single-instruction mode, OS is set; 
otherwise, it is cleared. 


7. The appropriate trap type bits in psr are set (IT, 


IN, IAT, OAT, FT). Several bits may be set if the 
corresponding 
trap 
conditions 
occur 
simulta- 


neously. 
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8. An address is placed in the fault instruction regis- 
ter (fir) to help locate the trapped instruction. In 
single-instruction mode, the address in fir is the 
address of the trapped instruction itself. In dual-in- 
struction mode, the address in fir is that of the 
floating-point half of the dual instruction. If an in- 
struction or data access fault occurred, the asso- 
ciated core instruction is the high-order half of the 
dual instruction (fir 
+ 
4). In dual-instruction 
mode, when a data access fault occurs in the ab- 
sence of other trap conditions, the floating-point 
half of the dual instruction will already have been 
executed. 


The processor begins executing the trap handler 
by 
transferring 
execution 
to 
virtual 
address 
OxFFFFFFOO.The trap handler begins execution in 
single-instruction mode. The trap handler must ex- 
amine the trap-type bits in psr (IT, IN, IAT, DAT, FT) 
to determine the cause or causes of the trap. 


2.8.2 INSTRUCTION FAULT 


This fault is caused by any of the following condi- 
tions. In all cases the processor sets the IT bit be- 
fore entering the trap handler. 
1. By the trap instruction. When trap is executed in 
dual-instruction mode, the floating-point compan- 
ion of the trap instruction is not executed before 
the trap is taken. 
2. By the Intovr instruction. The trap occurs only if 
OF in epsr is set when Intovr is executed. The 
trap handler should clear OF before returning. 
When Intovr 
causes a trap in dual-instruction 
mode, the floating-point companion of the Intovr 
instruction is completely executed before the trap 
is taken. 
3. By violation of lock/unlock protocol, explained be- 
low. (Note that trap and Intovr 
should not be 
used within a locked sequence; otherwise, it 
would be difficult to distinguish between' this and 
the prior cases.) 
The lock protocol requires the following sequence 
of activities: 
1. lock 
2. Any load or store instruction that misses the 
cache 
3. unlock 
4. Any load or store instruction (regardless of 
whether it misses the cache) 


There may be other instructions between any of 
these steps. The bus is locked after step 2, and re- 
mains locked until step 4. Step 4 must follow step 1 
by 30 instructions or less, otherwise the instruction 
trap occurs. In case of a trap, IL is also set. If the 
load or store instruction in step 2 hits the cache, the 
sequence is legal, but the bus is not locked. 
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2.8.3 FLOATING-POINT FAULT 
The floating-point fault is reported on floating-point 
instructions, pst, fst, and sometimes fld, pfld, Ixfr. 
The floating-point faults of the i860 XR microproces- 
sor support the floating-point exceptions defined by 
the IEEE standard as well as some other useful 
classes of exceptions. The i860 XR microprocessor 
divides these into two classes: source exceptions 
and result exceptions. The nurnertcs library supplied 
by Intel provides the IEEE standard default handling 
for all these exceptions. 


2.8.3.1 Source Exception Faults 
When used as inputs to the multiplier or adder, all 
exceptional operands, including infinities, denormal- 
ized numbers and NaNs, cause a floating-point fault 
and set SE in the fsr. Source exceptions are report- 
ed on the instruction that initiates the operation. For 
pipelined operations, the pipeline is not advanced. 


The SE value is undefined for faults on fld, pfld, fst, 
pst, and Ixfr instructions when in single-instruction 
mode'or when in dual-instruction mode and the com- 
panion instruction is not a multiplier or adder opera- 
tion. 


2.8.3.2 Result Exception Faults 


The class of result exceptions includes any of the 
following conditions: 
• Overflow. 
The absolute value of the rounded 
true result would exceed the largElstpositive finite 
number in the destination format. 


• Underflow 
(when FZ is clear). Tile absolute val- 
ue of the rounded true result would be smaller 
than the smallest positive finite number in the 
destination format. 


• Inexact result (when TI is set). The result is not 
exactly representable in the destination format. 
For example, the fraction % cannot be precisely 
represented in binary form. This exception occurs 
frequently and indicates that some (generally ac- 
ceptable) accuracy has been lost. 


The point at which a result exception is reported de- 
pends upon whether pipelined operations are being 
used: 
• Scalar (nonplpellned) 
operations. 
Result ex- 
ceptions are reported on the next floating-point, 
fst.x, or pst.x (and sometimes fld, pfld, Ixfr) in- 
struction after the scalar operation. When a trap 
occurs, the last stage of the affl3cted unit con- 
tains the result of the scalar operation, 
• Plpellned operations. 
Result exceptions are re- 
ported when the result is in the last stage and the 
next floating-point, fst.x or pst.x (and sometimes 
fld, pfld, Ixfr) instruction is executed. When a 
trap occurs, the pipeline is not advanced, and the 
last stage results (that caused the trap) remain 
unchanged. 


When no trap occurs (either because 
FTE is clear or 
because 
no exception 
occurred), 
the pipeline 
is ad- 
vanced 
normally 
by the new floating-point 
operation. 


The result-status 
bits of the affected 
unit are unde- 
fined until the point that result exceptions 
are report- 
ed. At this point, the last stage result-status 
bits (bits 
29 ..22 and 16..9 of the fsr) reflect 
the values 
in the 
last stages of both the adder and multiplier. 
For ex- 
ample, 
if the 
last stage 
result 
in the 
multiplier 
has 
overflowed 
and a pipelined 
floating-point 
pfadd is 
started, 
a trap occurs 
and MO is set. 


For scalar operations, 
the RR bits of fsr specify 
the 
register 
in which the result was stored. 
RR is updat- 
ed when the scalar instruction 
is initiated. 
The trap, 
however, 
occurs 
on a subsequent 
instruction. 
Pro- 
grammers 
must 
prevent 
intervening 
stores 
to fsr 
from modifying 
the RR bits. Prevention 
may take one 
of the following 
forms: 


• 
Before 
any store to fsr when a result exception 
may be pending, 
execute 
a dummy floating-point 
operation 
to trigger the result-exception 
trap. 


• 
Always 
read 
from fsr before 
storing 
to it, and. 


mask 
updates 
so 
that 
the 
RR 
bits 
are 
not 
changed. 


For pipelined 
operations, 
RR is cleared 
and the re- 
sult is in the last stage of the pipeline 
of the appro- 
priate unit. The trap handler 
must flush the pipeline, 
saving the results and the status bits. 


In either 
pipelined 
or scalar 
mode, the trap handler 
must 
then 
compute 
the 
trapping 
result. 
In either 
case, 
the result 
has the same 
fraction 
as the true 
result 
and has an exponent 
which 
is the low-order 
bits of the true result. The trap handler 
can inspect 
the 
result, 
compute 
the 
result 
appropriate 
for that 
instruction 
(a NaN or an infinity, 
for example), 
and 
store the correct 
result. The result is either stored in 
the register 
specified 
by RR (if nonzero) 
or (if RR = 
0) the trap handler 
must reload the pipeline with the 
saved results 
and status bits. 


Result 
exceptions 
may be reported 
for both the ad- 
der 
and 
multiplier 
units 
at the 
same 
time. 
In this 
case, the trap handler 
should fix up the last stage of 
both pipelines. 


2.8.4 INSTRUCTION 
ACCESS 
FAULT 


This 
trap 
occurs 
during 
address 
translation 
for in- 


struction 
fetches 
in any of these 
cases: 


• 
The address 
fetched 
is in a page whose 
P (pres- 
ent) bit in the page table is clear (not present). 


• 
The 
address 
fetched 
is in a supervisor 
mode 
page, but the processor 
is in user mode. 


• 
The address fetched 
is in a page whose 
PTE has 
A = 0, and the access 
occurs 
during 
a locked 
sequence 
(i.e., between 
lock 
and unlOCk). 
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Note 
that 
several 
instructions 
are fetched 
at one 
time, 
either 
due to instruction 
prefetching 
or to in- 
struction 
caching. 
Therefore, 
a trap 
handler 
can 
change 
from supervisor 
to user mode and continue 
to execute 
instructions 
fetched 
from 
a supervisor 
page. An instruction 
access 
trap occurs 
only when 
the next group of instructions 
is fetched 
from a su- 
pervisor page (up to eight instructions 
later). If, in the 
meantime, 
the handler 
branches 
to a user page, no 
instruction 
access 
trap occurs. 
No protection 
viola- 
tion results, 
because 
the processor 
does not permit 
data accesses 
to supervisor 
pages while running 
in 
user mode. 


2.8.5 DATA 
ACCESS 
FAULT 


This trap results from an abnormal 
condition 
detect- 
ed during data operand 
fetch 
or store. Such an ex- 
ception 
can be due only to one of the following 
caus- 
es: 


• 
An 
attempt 
is being 
made 
to 
write 
to 
a page 
whose 
D (Dirty) bit is clear. 


• 
A memory 
operand 
is misaligned 
(is not located 
at an address 
that is a multiple 
of the length 
of 
the data). 


• 
The address 
stored 
in the db register 
is equal to 
one of the addresses 
spanned 
by the operand. 


• 
The operand 
is in a not-present 
page. 


• 
An attempt 
is being made from user level to write 
to a read-only 
page or to access a supervisor-lev- 
el page. 


• 
The operand 
was in a page whose 
PTE had A = 
0, and the access 
occurred 
during 
a locked 
se- 
quence. 
(i.e., between 
lock 
and unlOCk.) 


• 
Write protection 
(determined 
by epsr bit WP = 1) 
is violated 
in supervisor 
mode. 


2.8.6 INTERRUPT 
TRAP 


An interrupt 
is an event that is signaled 
from an ex- 
ternal 
source. 
If the processor 
is executing 
with in- 
terrupts 
enabled 
(IM set in the psr), the processor 
sets the interrupt 
bit IN in the psr, and generates 
an 
interrupt 
trap. Vectored 
interrupts 
are implemented 
by interrupt 
controllers 
and software. 


2.8.7 RESET TRAP 


When 
the i860 XR microprocessor 
is reset, 
execu- 
tion begins in single-instruction 
mode at physical 
ad- 
dress OxFFFFFFOO. This is the same address 
as for 
other traps. The reset trap can be distinguished 
from 
other traps by the fact that no trap bits are set. The 
instruction 
cache 
is flushed. 
The bits DPS, Bl, 
and 
ATE in dirbase 
are cleared. 
CS8 is initialized 
by the 
value at the INT pin at the end of reset. The read- 
only fields of the espr are set to identify the proces- 
sor, while the Il, WP, and PBM bits are cleared. 
The 


•• 
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bits U, IM, BR, and BW in psr are cleared, 
as are the 


trap bits FT, OAT, IAT, IN, and IT. All other 
bits of 
psr and all other register 
contents 
are undefined. 


Refer to Table 2.9 for a summary 
of these initial set- 
tings. 


Table 
2.9. Register 
and Cache 
Values 
after 
Reset 


Registers 
Initial 
Value 


Integer Registers 
Undefined 
Floating-Point 
Undefined 
Registers 
psr 
U, IM, BR, BW, FT, OAT, IAT, IN, 
IT = 0; others are undefined 
epsr 
u, WP, PBM, BE = 0; 
Processor 
Type, Stepping 
Number, OCS are read 
only; others are undefined 
db 
Undefined 
dlrbase 
OPS, Bl, ATE = 0; others 
are undefined 
fir 
Undefined 
fsr 
Undefined 
KR,KI, 
T, 
Undefined 
MERGE 


Caches 
Initial 
Value 


Instruction 
Cache 
Flushed 
Data Cache 
Undefined 
TlB 
Flushed 


The 
software 
must 
ensure 
that 
the 
data 
cache 
is 
flushed 
and control 
registers 
are properly 
initialized 
before 
performing 
operations 
that 
depend 
on the 
values of the cache or registers. The data cache has 
no "validity" 
bits, so memory 
accesses 
before 
the 


flush may result in false data cache 
hits. 


Reset code must initialize the floating-point 
pipeline 
state to zero with floating-point 
traps disabled 
to en- 


sure that no spurious 
floating-point 
traps are gener- 
ated. 


After 
a RESET 
the 
i860 XR microprocessor 
starts 
execution 
at supervisor 
level (U =0). Before branch- 


ing to the first user-level 
instruction, 
the RESET trap 
handler 
or subsequent 
initialization 
code 
has to set 
PU and a trap bit so that an indirect 
branch 
instruc- 


tion will copy PU to U, thereby changing 
to user level. 


2.9 Debugging 


The 
i860 
XR 
microprocessor 
supports 
debugging 
with both data and instruction 
breakpoints. 
The fea- 


tures of the i860 XR architecture 
that support debug- 
ging include: 


• 
db (data breakpoint 
register) which permits speci- 
fication 
of a data addresses 
that the i860 XR mi- 


croprocessor 
will monitor. 
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• 
BR (break read) and BW (break write) bits of the 
psr, 
which 
enable 
trapping 
of 
either 
reads 
or 
writes 
(respectively) 
to the address 
in db. 


• 
OAT (data access 
trap) bit of the psr, 
which 
al- 


lows the trap handler 
to determine 
when 
a data 
breakpoint 
was the cause of the trap. 


• 
trap 
instruction 
that 
can be used to set break- 


points 
in code. Any number 
of code breakpoints 
can 
be set. The 
values 
of the 
isrc1 
and 
isrc2 
fields 
help 
identify 
which 
breakpoint 
has 
oc- 
curred. 


• 
IT (instruction 
trap) 
bit of the psr, 
which 
allows 
the 
trap 
handler 
to determine 
when 
a trap 
in- 


struction 
was the cause of the trap. 


3.0 HARDWARE INTERFACE 


In the following 
description 
of hardware 
interface, 


the # symbol 
at the end of a signal name indicates 
that 
the active 
or asserted 
state 
occurs 
when 
the 
signal is at a low voltage. 
When no if is present after 


the signal name, the signal is asserted 
when at the 
high voltage 
level. 


3.1 Signal Description 


Table 3.1 identifies 
functional 
groupings 
of the pins, 


lists every pin by its identifier, 
gives a brief descrip- 


tion of its function, 
and lists some of its characteris- 


tics. All output 
pins are tristate, 
except 
HlOA 
and 
BREQ. 
All 
inputs 
are 
synchronous, 
except 
HOLD 
and INT. 


3.1.1 CLOCK 
(ClK) 


The ClK 
input determines 
execution 
rate and timing 
of the i860 XR microprocessor. 
Timing 
of other 
sig- 


nals is specified 
relative 
to the 
rising 
edge 
of this 
signal. 
The 
i860 
XR 
microprocessor 
can 
utilize 
a 
clock 
rate of 25 MHz, 33.3 
MHz or 40 MHz. 
The 
internal 
operating 
frequency 
is the same as the ex- 
ternal clock. 


3.1.2 SYSTEM 
RESET 
(RESET) 


Asserting 
RESET for atleast 
16 ClK 
periods causes 


initialization 
of the i860 XR microprocessor. 
Refer to 
section 
3.2 "Initialization" 
for more details related to 
RESET. 


3.1.3 BUS HOLD 
(HOLD) 
AND 
BUl; 
HOLD 
ACKNOWLEDGE 
(HlDA) 


These pins are used for i860 XR microprocessor 
bus 
arbitration. 
At some 
clock 
after the HOLD signal 
is 
asserted, 
the i860 XR microprocessor 
releases 
con- 


1860TM XR MICROPROCESSOR 


Table 3.1. Pin Summary 


Pin 
Function 
Active 
Inputl 
Name 
State 
Output 
Execution Control Pins 
CLK 
CLocK 
I 
RESET 
System reset 
High 
I 
HOLD 
Bus hold 
High 
I 
HLDA 
Bus hold acknowledge 
High 
'0 
BREQ 
Bus request 
High 
0 
INT/CS8 
Interrupt, code-size 
High 
I 
Bus Interface Pins 
A31-A3 
Address bus 
High 
0 
BE7#-BEO# 
Byte Enables 
Low 
0 
D63-DO 
Data bus 
High 
I/O 
LOCK# 
Bus lock 
Low 
0 
W/R# 
Write/Read bus cycle 
High/Low 
0 
NENE# 
NExt NEar 
Low 
0 
NA# 
Next Address request 
Low 
I 
READY# 
Transfer Acknowledge 
Low 
I 
ADS# 
ADdress Status 
Low 
0 
Cache Interface Pins 
KEN# 
Cache ENable 
Low 
I 
PTB 
Page Table Bit 
High 
0 
Testability Pins 
SHI 
Boundary Scan Shift Input 
High 
I 
BSCN 
Boundary Scan Enable 
High 
I 
SCAN 
Shift Scan Path 
High 
I 
Intel-Reserved Configuration 
Pins 
CC1-CCO 
Configuration 
High 
I 
Power and Ground Pins 
Vcc 
System power 
Vss 
System ground 


A # after a pin name Indicates 
that the signal IS active 
when 
at the low voltage 
level. 


trol of the local bus and puts all bus interface out- 
puts (except BREQ and HLDA) into a floating state, 
then asserts HLDA-all 
during the same clock peri- 
od. It maintains this state until HOLD is deasserted. 
Instruction execution stops only if required instruc- 
tions or data cannot be read from the on-chip in- 
struction and data caches. 


3.1.4 BUS REQUEST (BREQ) 


This signal is asserted when the i860 XR microproc- 
essor has a pending memory request, even when 
HLDA is asserted. This allows an external bus arbi- 
ter to implement an "on demand only" policy for 
granting the bus to the i860 XR microprocessor. 
BREQ is asserted the clock after the i860 XR micro- 
processor realizes an internal request for the bus. In 
normal operation, BREQ goes low the clock after 
ADS# goes low for the final pending bus cycle. (Re- 
fer to Figure 4.10 for timing information.) During data 
or instuction cache fills, however, BREQ may be 
deasserted for one or more clocks, due to cache 
and TLB logic. 


The time required to acknowledge a hold request is 
one clock plus the number of clocks needed to finish 
any outstanding bus cycles. HOLD is recognized 
even while RESET or LOCK# is asserted. 


When leaving a bus hold, the i860 XR microproces- 
sor deactivates HLDA and, in the same clock period, 
initiates a pending bus cycle, if any. 


Hold is an asynchronous input. 
3.1.5 INTERRUPT/CODE-SIZE (INT/CS8) 


This input allows interruption of the current instruc- 
tion stream. If interrupts are enabled (IM set in psr) 
when INT is asserted, the i860 XR microprocessor 
fetches 
the 
next 
instruction 
from 
address 
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OxFFFFFFOO.To assure that an interrupt is recog- 
nized, INT should remain asserted until the software 
acknowledges the interrupt (by writing, for example, 
to a memory-mapped port of an interrupt controller). 
When the bus is not locked, the maximum time be- 
tween the assertion of INT and the execution of the 
first instruction of the trap handler is ten clocks, plus 
the time for four sets of four pipelined read cycles 
and two sets of four pipelined writes (instruction- 
and data-cache misses and write-back cycles to up- 
date memory), plus the time for twenty nonpipelined . 
read cycles (six TLB misses, with eight refetches 
when the A-bit is zero), plus the time for eight non- 
pipelined writes (updates to the A-bit). 


If the bus is locked from a lock instruction, the INT 
pin is ignored and the INT bit of epsr is always zero. 
The lock instruction can only assert LOCK# for 30- 
33 instructions before trapping. 


If INT is asserted during the clock before the falling 
edge of RESET, the eight-bit code-size mode is se- 
lected. For more about this mode, refer to section 
3.2 "Initialization". 


INT is an asynchronous input. 


3.1.6 ADDRESS PINS (A31-A3) 
AND BYTE 
ENABLES (BEU-BEO#) 


The 29-bit address bus (A31-A3) identifies address- 
es to a 64-bit location. Separate byte-enable signals 
(BE7#-BEO#) 
identify which bytes should be ac- 
cessed within the 64-bit location. In all noncachea- 
ble 
read 
cycles 
(KEN# 
deasserted), 
the 
byte 
enables match the length and address of the re- 
quested data. Cacheable read cycles (KEN# assert- 
ed), however, result in four 64-bit memory cycles to 
fill an entire 32-byte cache line. The BEn# pins acti- 
vated are those that represent the operand of the 
load instruction that caused the line fill, and these 
same BEn# pins remain activated for all four cycles 
of the line fill. All 64 bits must be returned for each 
cycle without regard for the BEn# signals. In all 
write cycles (noncacheable writes as well as cache 
line write-backs) the BEn# 
signals indicate the 
bytes that must be written. 


Instruction fetches (W/R# 
is low) are distinguished 


from data accesses by the unique combinations of 
BE7#-BEO# 
defined in Table 3.2. For an eight-bit 
code 
fetch 
in eight-bit code-size 
(CS8) mode, 
BE2#-BEO# 
are redefined to be A2-AO of the ad- 
dress. In this case BE7#-BE3# 
form the code 
shown in Table 3.2 that identifies an instruction 
fetch. The A2 in the table does not represent a phys- 
ical pin, just a conceptual internal address line value. 
The "x"under A2 for CS8 mode means "not applica- 
ble", or "don't care". All other combinations of byte 
enables indicate data accesses. 
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The address and' byte-enable pins are driven until 
either NA# or REAOY# is asserted. 


3.1.7 DATA PINS (063-00) 


The bus interface has 64 bidirectional data pins 
(063-00) 
to transfer data in eight- to 64-bit quanti- 
ties. Pins 07-00 
transfer the least significant byte; 
pins 063-056 
transfer the most significant byte. 


In read bus cycles, all 64 bits of the data bus are 
latched, even in CS8-mode instruction fetches when 
only the low-order eight bits are used. 


In write bus cycles, the point at which data is driven 
onto the bus depends on the type of the preceding 
cycle. If there was no preceding cycle (i.e. the bus 
was idle), data is driven with the address. If the pre- 
ceding cycle was a write, data is driven as soon as 
READY# is returned from the previous cycle. If the 
preceding cycle was a read, data is driven one clock 
after READY# is returned from the previous cycle, 
thereby allowing time for the bus to be turned 
around. Data continues to be driven until READY# 
for the current cycle is returned. 


3.1.8 BUS LOCK (LOCK#) 


This signal is used to provide atomic (indivisible) 
read-modify-write sequences in multiprocessor sys- 
tems. A multiprocessor bus arbiter must permit only 
one processor a locked access to the address which 
is on the bus when LOCK# first activates. The sys- 
tem must maintain the lock of that location until 
LOCK# deactivates. 


The i860 XR microprocessor coordinates the exter- 
nal LOCK# signal with the software-controlled BL 
bit of the dlrbase 
register. Programmers do not 
have to be concerned about the fact that bus activity 
is not always synchronous with instruction execu- 
tion. LOCK# is asserted with AOS# for the address 
operand of the first load or store instruction execut- 
ed after the BL bit is set by the lock instruction. 
Pending bus cycles are locked according to the val- 
ue of the BL bit when the instruction was executed. 
Even if the BL bit is changed between the time that 
an instruction generates an internal bus request and 
the time that the cycle appears on the bus, the i860 
XR microprocessor still asserts LOCK# for that bus 
cycle. 


If ADS# is active when LOCK# deactivates, then 
that request should complete before the hardware 
relinquishes the lock. If AOS# is not active, the lock- 
ing of the location can immediately end when 
LOCK# deactivates. Of course the simplest arbitra- 
tion hardware can just lock the entire bus against all 
other accesses during LOCK# assertion through 
ROY# of the cycle in which LOCK# goes inactive. 
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Table 3.2. Identifying 
Instruction 
Fetches 


Code 
A2 
BE7# 
BE6# 
BE5# 
BE4# 
BE3# 
BE2# 
BE1# 
BEO# 
Fetch 
Normal 
0 
1 
1 
1 
1 
1 
0 
1 
0 
(Non-CS8) 
Normal 
1 
1 
0 
1 
0 
1 
1 
1 
1 
(Non-CS8) 


CS8 
x 
1 
0 
1 
0 
1 
Low-order address bits 
Mode 


When the BL bit is deasserted with the unlock in- 
struction, LOCK# is deasserted with the next load 
or store but after any pending bus cycles. Between 
locked sequences, at least one cycle of no LOCK# 
is guaranteed by the behavior of the unlock instruc- 


. tion. LOCK# deassertion may occur independently 
of ADS# for the case of a trap or a cache hit after 
unlock. 


The i860 XR microprocessor also asserts LOCK# 
during TLB miss processing for updates of the ac- 
cessed bit in page-table entries. The maximum time 
that LOCK# can be asserted in this case is five 
clocks plus the time required to perform a read-mod- 
ify-write sequence. Instruction fetches do not alter 
the LOCK# pin. 


Between lock and unlock instructions, the INT pin is 
ignored and the INT bit of epsr is zero when read by 
Id.c epsr. The time that interrupts are disabled is 
limited by the lock protocol outlined in Section 2.8.2. 


3.1.9 WRITE/READ BUS CYCLE (W/R#) 


This pin specifies whether a bus cycle is a read 
(LOW) or write (HIGH) cycle. It is driven until either 
NA# or READY# is asserted. 


3.1.10 NEXT NEAR (NENE#) 


This signal allows higher-speed reads and writes in 
the case of consecutive reads and writes that ac- 
cess static column or page-mode DRAMs. The i860 
XR microprocessor asserts NENE# when the cur- 
rent address is in the same DRAM page as the pre- 
vious bus cycle. The"i860 XR microprocessor deter- 
mines the DRAM page size by inspecting the DPS 
field in the dlrbase 
register. The page size can 
range from 29 to 216 64-bit words. supporting DRAM 
sizes from 256K x 1, 256K x 4, and up. NENE# is 
never asserted on the next bus cycle after HLDA is 
deasserted. 


3.1.11 NEXT ADDRESS REQUEST (NA#) 


NA # makes address pipelining possible. The sys- 
tem asserts NA# for at least one clock to indicate 
that it is ready to accept the next address from the 
i860 XR microprocessor. NA# may be asserted be- 


fore the current cycle ends. (If the system does not 
implement pipelining, NA# does not have to be acti- 
vated.) The i860 XR microprocessor samples NA# 
every clock, starting one clock after the prior activa- 
tion of ADS#. When NA# is active, the i860 XR 
microprocessor is free to drive address and bus-cy- • 
cle definition for the next pending bus cycle. The 
i860 XR microprocessor remembers that NA# was 
asserted when no internal request is pending; there- 
fore, NA# can be deactivated after the next rising 
edge of the CLK signal. Up to three bus cycles can 
be outstanding simultaneously. 


3.1.12 TRANSFER ACKNOWLEDGE (READY #) 


The system must assert the READY# signal during 
read cycles when valid data is on the data pins and 
during write cycles when the system has accepted 
data from the data pins. READY# must be asserted 
for at least one clock. Sampling of READY# begins 
in the clock after an ADS# or in the second clock 
after a prior READY#. 


3.1.13 ADDRESS STATUS (ADS#) 


The i860 XR microprocessor asserts ADS# during 
the first clock of each bus cycle to identify the clock 
period during which it begins to assert outputs on 
the address bus. This signal is held active for one 
clock. 


3.1.14 CACHE ENABLE (KEN#) 


The i860 XR microprocessor samples KEN# to de- 
termine whether the data being read for the current 
cache-miss cycle is to be cached. This pin is inter- 
nally NORed with the CD and WT bits to control 
cacheability on a page by page basis (refer to Table 
3.3). 


If the address is one that is permitted to be in the 
cache, KEN# must be continuously asserted during 
the sampling period starting from the second rising 
clock edge after ADS# 
is asserted, through the 
clock NA# or READY# is asserted. The entire 64 
bits of the data bus will be used for the read, regard- 
less of the state of the byte-enable pins. Three addi- 
tional 64-bit bus cycles will be generated to fill the 
rest of the 32-byte cache block. 
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If KEN11 is found deasserted at any clock from the 
clock after ADS 11 through the clock of the first NAil 
or READY11, the data being read will not be cached 
and two scenarios can occur: 1) if the cycle is due to 
data-cache miss, no subsequent cache-fill cycles 
will be generated; 2) if the cycle is due to an instruc- 
tion-cache miss, additional cycle(s) will be generat- 
ed until the address reaches a 32-byte boundary. To 
avoid caching a line, external hardware must deas- 
sert KEN11 
during or before the first 
NA11 
or 
READYII. 


3.1.15 PAGE TABLE BIT (PTB) 


Depending on the setting of the PBM (page-table bit 
mode) bit of the epsr, the PTB reflects the value of 
either the CD (cache disable) bit or the WT (write 
through) bit of the page-table entry used for the cur- 
rent cycle. When paging is disabled, PTB remains 
inactive. 


Table 3.3. Cacheablllty based on 
KEN 11 and CD OR WT 


CDORWT 
KEN 11 
Meaning 


0 
0 
Cacheable access 
0 
1 
Noncacheable access 
1 
0 
Noncacheable page 
1 
1 
Noncacheable page 


3.1.16 BOUNDARY SCAN SHIFT INPUT (SHI) 


This pin is used with the testability features. Refer to 
section 3.3. 
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3.1.17 BOUNDARY SCAN ENABLE (BSCN) 


This pin is used with the testability features. Refer to 
section 3.3. 


3.1.18 SHIFT SCAN PATH (SCAN) 


This pin is used with the testability features. Refer to 
section 3.3. 


3.1.19 CONFIGURATION (CC1-CCO) 


These two pins are reserved by Intel. Strap both pins 
LOW. 


3.1.20 SYSTEM POWER (Vcc) AND GROUND 
(Vss) 


The i860 XR microprocessor has 48 pins for power 
and ground. All pins must be connected to the ap- 
propriate low-inductance power and ground signals 
in the system. 


3.2 Initialization 


Initialization of 
the 
i860 
XR 
microprocessor 
is 
caused by assertion of the RESET signal for at least 
16 clocks. Table 3.4 shows the status of output pins 
during the time that RESET is asserted. Note that 
HOLD requests are honored during RESET and that 
the status of output pins depends on whether a 
HOLD request is being acknowledged. 


Table 3.4. Output Pin Status during Reset 


Pin Value 


Pin Name 
HOLD 
HOLD 
Not 
Acknowledged 
Acknowledged 


ADS#, 
LOCK# 
HIGH 
Tri-State 
OFF 


W/R#, 
PTB 
LOW 
Tri·State 
OFF 


BREQ 
LOW 
LOW 


HLDA 
LOW 
HIGH 


063-00 
Tri-State 
OFF 
Tri-State 
OFF 


A31-A3, 
BE7#-BEO#, 
Undefined 
Tri-State 
OFF 
NENE# 


After a reset, the i860 XR microprocessor 
begins ex- 
ecuting 
at physical 
address 
OxFFFFFFOO. The pro- 
gram-visible 
state of the i860 XR microprocessor 
af- 


ter reset is detailed 
in section 
2.8.7. 


Eight-bit 
code-size 
mode is selected 
when INT ICS8 


is asserted 
during the clock 
before 
the falling 
edge 
of 
RESET. 
While 
in eight-bit 
code-size 
mode, 
in- 
struction 
cache 
misses 
are byte reads 
(transferred 
on 
07-00 
of 
the 
data 
bus) 
instead 
of 
eight-byte 


reads. This allows the i860 XR microprocessor 
to be 
bootstrapped 
from 
an eight-bit 
EPROM. 
For these 
code 
reads, 
byte enables 
BE2#-BEO# 
are rede- 
fined to be the low order three 
bits of the address, 
so that a complete 
byte address 
is available. 
These 


reads 
update 
the instruction 
cache 
if KEN # 
is as- 
serted (refer to section 
3.1.14) and are not pipe lined 
even if NA # is asserted. 
While in this mode, instruc- 


tions must reside in an eight-bit 
wide memory, 
while 
data must reside in a separate 
64-bit wide memory. 
After the code has been loaded 
into 64-bit memory, 


initialization 
code can initiate 64-bit code fetches 
by 
clearing 
the CS8 bit of the dlrbase 
register 
(refer to 
section 
2). Once 
eight-bit 
code-size 
mode 
is dis- 
abled by software, 
it cannot 
be reenabled 
except 
by 


resetting 
the i860 XR microprocessor. 


3.3 Testability 


The i860 XR microprocessor 
has a boundary scan 
mode that may be used in component- 
or board-lev- 
el testing 
to test 
the 
signal 
traces 
leading 
to and 


from 
the 
i860 
XR microprocessor. 
Boundary 
scan 
mode provides 
a simple serial interface 
that makes it 
possible 
to test 
all signal 
traces 
with 
only 
a few 
probes. 
Probes 
need 
be connected 
only 
to CLK, 
BSCN, SCAN, SHI, BREQ, RESET, and HOLD. 


The 
pins 
BSCN 
and 
SCAN 
control 
the 
boundary 
scan 
mode (refer to Table 
3.5). When 
BSCN is as- 
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serted, the i860 XR microprocessor 
enters boundary 


scan mode on the next rising clock edge. Boundary 
scan 
mode 
can be activated 
even while 
RESET 
is 


active. When BSCN is deasserted 
while in boundary 


scan 
mode, 
the 
i860. XR 
microprocessor 
leaves 


boundary 
scan mode on the next rising clock 
edge. 


After leaving boundary 
scan mode, the internal 
state 
is undefined; 
therefore, 
RESET should 
be asserted. 


Table 3.5. Test Mode Selection 


BSCN 
SCAN 
Testability 
Mode 


La 
La 
No testability 
mode selected 


La 
HI 
(Reserved 
for Intel) 


HI 
La 
Boundary 
scan mode, normal 


HI 
HI 
Boundary 
scan mode, shift 


SHI as input; BREQ as 
output 


For testing purposes, 
each signal pin has associated 
with it an internal 
latch. Table 
3.6 indentifies 
these 


latches 
by name and classifies 
them 
as input, 
out- 


put, or control. 
The input 
and output 
latches 
carry 


the name of the corresponding 
pins. 


Table 3.6. Test Mode Latches 


Input 
Output 
Associated 


Control 
Latch 
Latch 
Latch 


SHI 
BSCN 
SCAN 
RESET 
00-063 
00-063 
DATAt 


CC1-CCO 
A31-A3 
ADDRt 


NENE# 
NENEt 


PTB# 
PTBt 


W/R# 
W/Rt 


ADS# 
ADSt 
HLDA 
LOCK # 
LOCKt 


READY# 
KEN# 
NA# 
INT/CS8 
HOLD 
BE7#-BEO# 
BEt 


BREQ 


Within boundary 
scan mode the i860 XR microproc- 


essor 
operates 
in one 
of 
two 
submodes: 
normal 


mode or shift mode, depending 
on the value of the 


SCAN input. A typical 
test sequence 
is ... 
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1. Enter shift mode to assign values to the latches 
that correspond with the pins. 
2. Enter normal mode. In normal mode the i860 XR 
microprocessor transfers the latched values to 
the output pins and latches the values that are 
being driven onto the input pins. 
3. Reenter shift mode to read the new values of the 
input pins. 


3.3.1 NORMAL MODE 


When SCAN is deasserted, the normal mode is se- 
lected. 
For 
each 
input 
pin 
(RESET, 
HOLD, 
INT/CS8, 
NA#, 
READY#, 
KEN#, 
SHI, BSCN, 
SCAN, CC1, and CCO),the corresponding latch is 
loaded with the value that is being driven onto the 
pin. 


The tristate output pins (A31-A3, 
BE7#-BEO#, 
W/R#, 
NENE#, ADS#, LOCK#, and PTB) are en- 
abled by the control latches ADDRt (for A31-A3), 
BEt, W/Rt, 
NENEt, ADSt, LOCKt, and PTBt.lf a con- 
trol latch is set, the corresponding output latches 
drive their output pins; otherwise the pins are not 
driven. 


The I/O pins (D63-DO) are enabled by the control 
latch DATAt, which is similar to the other control 
latches. In addition, when DATAt is not set, the data 
pins are treated as input pins and their values are 
latched. 


3.3.2 SHIFT MODE 


When SCAN is asserted, the shift mode is selected. 
In shift mode, the pins are organized into a boundary 
scan chain. The scan chain is configured as a shift 
register that is shifted on the rising edge of CLK. The 
SHI pin is connected to the input of one end of the 
boundary scan chain. The value of the most signifi- 
cant bit of the scan chain is output on the BREQ pin. 
To avoid glitches while the values are being shifted 
along the chain, the tester should assert both the 
RESET and HOLD pins. Then all tristate outputs are 
disabled. The order of the pins within the chain is 
shown in Figure 3.1. 


A tester causes entry into this mode for one of two 
purposes: 
1. To assign values to output latches to be driven 
onto output pins upon subsequent entry into nor- 
mal mode. 
2. To read the values of input pins previously latched 
in normal mode. 


4.0 BUS OPERATION 


A bus cycle begins when ADS# 
is activated and 
ends when READY#. is sampled active. READY# is 
sampled one clock after assertion of ADS# and 
thereafter until it becomes active. New cycles can 
start as often as every other clock until three cycles 
are outstanding. A bus cycle is considered outstand- 
ing as long as READY# has not been asserted to 
terminate that cycle. After READY# becomes ac- 
tive, it is not sampled again for the following (out- 
standing) cycle until the second clock after the one 
during which it became active. READY# is assumed 
to be inactive when it is not sampled. 


With regard to how a bus cycle is generated by the 
i860 XR microprocessor, there are two types of cy- 
cles: pipelined and nonpipelined. Both types of cy- 
cles can be either read or write cycles. A pipelined 
cycle is one that starts while one or two other bus 
cycles are outstanding. A nonpipelined cycle is one 
that starts when no other bus cycles are outstand- 
ing. 


4.1 Pipelining 


A m-n read or write cycle is a cycle with a total cycle 
time of m clocks and a cycle-to-cycle time of n 
clocks (m ~ n). Total cycle time extends from the 
clock in which ADS# is activated to the clock in 
which READY# becomes active, whereas cycle-to- 
cycle time extends from the time that READY# is 
sampled active for the previous cycle to the time 
that it is sampled active again for the current cycle. 
When m = n, a nonpipelined cycle is implied; m > n 
implies a pipelined cycle. 


I 
2 
3 
4 
5 
6 
69 
SHI 
BSCN 
SCAN 
--+ 
RESET 
--+ 
DATAt 
DO 
--+ 
--+ 
063 
--+ 


70 
71 
72 
lOO 
101 
102 
103 
104 
CCl 
--+ 
cco 
--+ 
A31 
--+ 
--+ 
A3 
--+ 
ADDRt 
NENEt 
NENE# 
--+ 
PTBt 
--+ 


105 
106 
107 
lOB 
109 
110 
III 
112 
113 
PTB# 
--+ 
W/Rt 
--+ 
W/R# 
ADSt 
ADS# 
--+ 
HLDA 
--+ 
LOCKt 
--+ 
LOCK# 
READY # 
--+ 


114 
lIS 
116 
117 
t ts 
119 
126 
127 
KEN# 
NA# 
--+ 
INT/CSB 
HOLD 
--+ 
BEt 
--+ 
BE7# 
--+ 
BEO# 
--+ 
BREa 


Figure 3.1. Order of Boundary Scan Chain 
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Pipelining may occur for the next bus cycle any time 
the current bus cycle requires more than two clock 
periods to finish (m > 2). If a bus request is pending, 
the next cycle will be initiated when NA# is sampled 
active, even if the current cycle has not terminated. 
In this case, pipelining occurs. NA# is not recog- 
nized unitl after ADS# has become inactive. 


To allow high transfer rates in large memory sys- 
tems, two-level pipelining is supported (i.e., there 
may be up to three cycles in progress at one time). 
Pipelining enables a new word of data to be trans- 
ferred every two clocks, even though the total cycle 
time may be up to six clocks. 
. 


4.2 
Bus State Machine 


The operation of the bus is described in terms of a 
bus state machine using a state transition diagram. 
Figure 4.1 illustrates the i860 XR microprocessor 
bus state machine. A bus cycle is composed of two 
or more states. Each bus state lasts for one Cl.K 
period. 


The i860 XR microprocessor supports up to two lev- 
els of address pipelining. Once it has started the first 
bus cycle, it can generate up to two more cycles as 
long as READY# remains inactive. To start a new 
bus cycle while other cycles are still outstanding, 
NA# must be active for at least one clock cycle 
starting with the clock after the previous ADS#. 
NA# is latched internally. 


States Tj and Tjk' for j = {1,2,31 and k = {1,21, are 
used to describe the state of the i860 XR microproc- 
essor Bus State Machine. Index j indicates the num- 
ber of outstanding bus cycles while index k distin- 
guishes the intermediate states for the j-th outstand- 
ing cycle. Therefore there can be up to three out- 
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standing cycles, and there are two possible interme- 
diate states for each level of pipelining. Tj1 is the 
next state after Tj, as long as j cycles are outstand- 
ing. Tj2 is entered when NA# is active but the i860 
XR microprocessor is not ready to start a new cycle. 


Five conditions have to be met to start a new cycle 
while one or more cycles are already pending: 
1. READY# inactive 


2. NA# having been active 
3. An internal request pending (BREQ active) 
4. HOLD not active 
5. Fewer than three cycles outstanding 


Note that BREQ is asserted on the clock after the 
i860 XR microprocessor realizes an internal request 
for the bus. 


Upon hardware RESET, the bus control logic enters 
the idle state TI and awaits an internal request for a 
bus cycle. If a bus cycle is requested while there is 
no hold request from the system, a bus cycle begins, 
advancing to state T1. On the next cycle, the state 
machine automatically advances to state T11. If 
READY# is active in state T11' the bus control logic 
returns either to TI' if no new cycle is started, or to 
T1, if a new cycle request is pending internally. In 
fact, if an internal bus request is pending each time 
READY# is active, the state machine continues to 
cycle between T11and T1. 


However, if READY# is not active but the next ad- 
dress request is pending (as indicated by an active 
NA#), the state machine advances either to state 
T2 (if an internal bus request is pending, signifying 
that two bus cycles are now outstanding), or to state 
T12 (if no bus internal request is pending, signifying 
NA# has been found active). Transitions from state 
T12 are similar to those from T11. 
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NOTES: 
READY# 
Once 
READY# 
has been sampled 
active, 
it is 
not sampled 
again until two clocks 
later 
Not sampled 
during ADS# 
active 
clock 
Active 
in r., T2 and T3 
Active 
in TH 
HOLD 
in this figure 
is the internally 
synchro- 
nized version 
of the external 
signal 
HOLD 
Internal 
Bus Request 
Pending 
(BREQ 
assert- 
ed) 


NA# 
ADS# 
HLDA 
HOLD 


REQUEST 


HOLD 
ASSERTED 
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Figure 4.1. Bus State Machine 


If two bus cycles are already outstanding (as indicat- 
ed by T2k for k = !1,2J) and NA 11 is latched active 
but READY 11 is not active, one more bus request 
causes entry into state T3. Transitions from this 
state are similar to those from T2. 


machine continues to oscillate between Tj1 and Tj, 
for j = !2,31. 


When NA 11 is sampled active while there is a pend- 
ing bus request, ADSII 
is activated in the next clock 


period (provided no more than two cycles are al- 
ready outstanding). 
In general, if there is an internal bus request each 
time both READYII 
and NAil 
are active, the state 
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Internal pending bus requests start new bus cycles 
only if no HOLD request has been recognized. THis 
entered from the idle state TI. T11.and T12. HLDA is 
active in this state. There is a one clock delay to 
synchronize the HOLD input when the signal meets 
the respective minimum setup and hold time require- 
ments. The state machine uses the synchronized 
HOLD to move from state to state. 


4.3.1 NONPIPELINED 
READ CYCLES 


4.3 
Bus Cycles 


A read cycle begins with the clock in which ADS;1lis 
asserted. The i860 XR microprocessor begins driv- 
ing the 
address 
during this 
clock. 
It 
samples 
READY;1Ifor active state every clock after the first 
clock. A minimumof two clocks is required per cycle. 
Data is latched when READY;1Iis found active when 
sampled at the end of a clock period. Figure 4.2 il- 
lustrates nonpipelined read cycles with zero wait 
states. 


Figures 4.2 through 4.10 illustrate combinations of 
bus cycles. 


CYCLE 1 
CYCLE 2 
CYCLE 3 


NON-PIPELlNED 
NON-PIPELlNED 
NON-PIPELlNED 
READ 
READ 
READ 
(2-2) 
(2-2) 
(2-2) 


T, 
T" 
T, 
T" 
T, 
T" 


ClK 


NAil 


ADSII 


A31-A3. 
W/RII. 
BEnll. NENEII. 
PTB 


READYII 


063-00 


240296-'3 


Figure 4.2. Fastest 
Read Cycles 
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CYCLE 1 
CYCLE 2 
CYCLE 3 


NON-PIPELlNEO 
NON-PIPELlNEO 
NON-PIPELlNEO 
WRITE 
WRITE 
WRITE 
(2-2) 
(2-2) 
(2-2) 


T, 
T" 
T, 
T" 
T, 
T" 


CLK 


AOSI/ 


A31-A3. 
W/RI/. 
BEnl/. NENEI/. 
PTB 


NAI/ 


REAOYI/ 
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Figure 4.3. Fastest Write Cycles 


4.3.2 NONPIPELlNED 
WRITE 
CYCLES 
wait state, 
as illustrated 
in Figure 4.4. Because 
the 
device 
being read might still be driving the data bus 
during 
the first clock 
of the write 
cycle, 
there 
is a 
potential 
for bus contention. 
To help avoid such con- 


tention, 
the i860 XR microprocessor 
does 
not drive 
the data bus until the second 
clock 
of the write cy- 
cle. The wait state 
is required 
to provide 
the addi- 
tional time necessary 
to terminate 
the write cycle. 
In 
other 
read-write 
combinations. 
the 
i860 
XR micro- 
processor 
does not require a wait state. 


The 
ADS# 
and 
READY# 
activity 
for write 
cycles 
follows 
the same 
logic as that 
for read cycles, 
as 
Figure 4.3 illustrates 
for back-to-back. 
nonpipelined 
write cycles with zero wait-states. 


The fastest write cycle takes only two clocks to com- 
plete. However, 
when a read cycle immediately 
pre- 
cedes 
a write cycle. 
the write cycle 
must contain 
a 
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CYCLE 
1 
CYCLE 
2 
CYCLE 
3 


NON-PIPELlNED 
NON-PIPElINED 
NON-PIPELlNED 
READ 
WRITE 
READ 
(2-2) 
(3-3) 
(2-2) 


r, 
Tll 
Tl 
Tll 
Tll 
r, 
Tll 


ClK 


ADSII 


A31-A3. 
W/RII. 
BEnll. 
NENEII. 
PTB 


NAil 


READYII 
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Figure 4.4. Fastest 
Read/Write 
Cycles 
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CYCLE 4 


PIPELlNED 
WRITE 
(6-2) 


CYCLE 1 


HOH-PIPElIHED 
READ 
(5-5) 


CYCLE 2 


PIPEllHED 
READ 
(5-2) 


CYCLE 3 


PIPEllHED 
WRITE 
(6-3) 


ClK 


ADSII 


HA' 


READY' 
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Figure 4.5. Pipellned 
Read Followed 
by Plpelined 
Write 
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CYCLE 1 


NON-P 
PELlNED 
WfUTE 
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ClK 


ADS/I 


A31-A3, 
W/R/I, 
BEn/l, 
NENE/I, 
PTB 


NA/I 


READY/I 


CYCLE 4 


PIPELlNED 
READ 
(5-2) 


CYCLE 2 


PIPELlNED 
WRITE 
(5-2) 


CYCLE 3 


PIPELlNED 
READ 
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4.3.3 PIPELINED 
READ AND WRITE 
CYCLES 


Figure 4.6. P'lpellned Write Followed by Plpellned 
Read 


Figures 4.5 and 4.6 illustrate combinations of non- 
pipelined and pipelined read and write cycles. The 
following description applies to both diagrams. While 
Cycle 1 is still in progress, two new cycles are initiat- 
ed. By the time READY# first becomes active, the 
state machine has moved through states T1, T11, 
T2, T21, and T3. Cycles 3 and 4 show how activating 
READY# terminates the correspondinq outstanding 
cycle, and yet activating NA# while there is an inter- 
nal request pending adds a new outstanding cycle. 


In Figure 4.5, Cycle 3 is a write cycle following a read 
cycle; therefore, one wait state must be inserted. 
The i860 XR microprocessor does not drive the data 
bus until one clock after the read data is returned 
from the preceding read cycle. During Cycles 3 and 
4, the state machine oscillates between states T3 
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and T31 maintaining full bus capacity (two levels of 
pipelining; three outstanding cycles). Cycles 2, 3, 
and 4 in Figure 4.6 are 5-2 cycles; i.e. each requires 
a total cycle time of five clocks while the throughput 
rate is one cycle every two clocks. 


Figure 4.7 illustrates in a more general manner how 
the NA# 'signal controls pipelining. Cycle 1 is a 2-2 
cycle, the fastest possible. The next cycle cannot be 
started any earlier; therefore, there is no need to 
activate NA# to start the next cycle early. Cycle 2, a 
3-3 read, is different. Cycle 3 can be started during 
the third state (a wait state) of Cycle 2, and NA# is 
asserted to accomplish this. 
. 


NA# is not activated following the ADS# clock of 
Cycle 3, thereby allowing Cycle 3 to terminate be- 
fore the start of Cycle 4. As a result, Cycle 4 is a 
nonpipelined cycle. 
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CYCLE 3 


PIPELlNED 
READ 
(3-2) 


CYCLE 4 


NON-PIPELlNED 
READ 
(2-2) 


CYCLE 
1 


NON-PIPELlNED 
READ 
(2-2) 


CYCLE 2 


NON-PIPELlNED 
READ 
(3-3) 


IDLE 


ClK 


ADS# 


A31-A3. 
W/R#. 
BEn#. 
NENE#. 
PTB 
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READY# 
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Figure 4.7. Plpellnlng Driven by NA# 
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Figure 4.8. NA# Active with No Internal Bus Request 
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ClK 
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When there is no internal bus request, activating 
NA11 does not start a new cycle; the i860 XR micro- 
processor, however, remembers that NAil has been 
activated. Figure 4.8 illustrates the situation where 
NA11 is active but no internal bus request is pending. 
NAil is activated when two cycles are outstanding. 
Because there is no internal request pending until 
after one idle state, no new bus cycle is started dur- 
ing that period. 


4.3.4 LOCKED CYCLES 


Figure 4.9. Locked Cycles 


4.3.5 HOLD AND BREQ ARBITRATION CYCLES 


The HOLD, HLDA, and BREQ signals permit bus ar- 
bitration between the i860 XR microprocessor and 
another bus master. 


See Figure 4.10. When HOLD is asserted, the i860 
XR microprocessor does not relinquish control of 
the bus until all outstanding cycles are completed. If 
HOLD were asserted one clock earlier, the last i860 
XR microprocessor bus cycle before HLDA would 
not be started. 


The LOCK11 signal is asserted when the current bus 
cycle is to be locked with the next bus cycle. Asser- 
tion of LOCKII may be initiated by a program's set- 
ting the BL bit of the dlrbase register using the lock 
instruction (refer to section 2) or by the i860 XR mi- 
croprocessor itself during page table updates. 


HOLD is sampled at the end of the clock in which it 
is activated. Recommended setup and hold times 
must be met to guarantee sampling one clock after 
external HOLD activation. When HOLD is sampled 
active, a one clock delay for internal synchronization 
follows. Likewise when HOLD is deasserted, there is 
a one-clock delay for internal synchronization before 
HLDA is deasserted. The outputs (except HLDA and 
BREQ) float when HLDA is asserted. 


In Figure 4.9, the first read cycle is to be locked with 
the following write cycle. If there were idle states 
between the cycles, the LOCKII signal would re- 
main asserted. This is the case for a read/modify/ 
write operation. Cycle 3 is not locked because 
LOCK 11 is no longer asserted when Cycle 2 starts. 
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HOLD 
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Figure 4.10. HOLD, HLDA, and BREQ 


If, during a HOLD cycle, an internal bus request is 
generated, BREQ is activated even though HLDA is 
asserted. It remains active at least until the clock 
after ADS# is activated for the requested cycle. 


SET. If INT/CS8 is sampled activu,the i860 XR mi- 
croprocessor enters CS8 mode. No inputs (except 
for HOLD and INTICS8) are sampled during RESET. 


4.4 Bus States During RESET 


Figure 4.11 shows how INTICS8 is sampled during 
the clock period just before the falling edge of RE- 


Note that, because HOLD is recognized even while 
RESET is active, the HLDA output signal may also 
become active during RESET. R(3ferto Table 3.4 
"Output Pin Status during Reset". 


I, 
~ 16 elK. 


CLK 


RESET 


IHT/CSB 


OTHER 
IHPUTS 
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Figure 4.11. Reset Activities 
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5.0 MECHANICAL 
DATA 


Figures 5.1 and 5.2 show the locations 
of pins; Tables 
5.1 and 5.2 help to locate pin identifiers. 
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Figure 5.1. Pin Configuration-View 
from Top Side 
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Figure 5.2. Pin Configuration-View 
from Pin Side 
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Table 5.2. Pin Cross Reference by Pin Name 


Signal 
Location 
Signal 
Location 


A3 
Q5 
A4 
A5 
A5 
Q4 
A6 
Q3 
A7 
P3 
A8 
P2 
A9 
N3 
A10 
N2 
A11 
M3 
A12 
N1 
A13 
M2 
A14 
l3 
A15 
l2 
A16 
K3 
A17 
M1 
A18 
K2 
A19 
L1 
A20 
J2 
A21 
K1 
A22 
H3 
A23 
J1 
A24 
H2 
A25 
H1 
A26 
G3 
A27 
G2 
A28 
F2 
A29 
G1 
A30 
F3 
A31 
F1 
AOS# 
S7 
BEO# 
Q12 
BE1# 
A12 
BE2# 
A11 
BE3# 
S11 
BE4# 
Q11 
BE5# 
S10 
BE6# 
Q10 
BE7# 
A10 
BAEQ 
Q7 
BSCN 
Q13 
CCO 
E2 
CC1 
E3 
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J3 
00 
A14 
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Q15 
03 
P15 
04 
N16 
05 
N15 
06 
M17 
07 
M15 
08 
l16 
09 
M16 
010 
L17 
011 
L15 
012 
K17 
013 
K15 
014 
J16 
015 
K16 
016 
J17 
017 
J15 
018 
H17 
019 
,H16 
020 
G17 
021 
H15 
022 
G16 
023 
G15 
024 
F17 
025 
F16 
026 
E17 
027 
F15 
028 
E16 
029 
E15 
030 
016 
031 
015 
032 
C15 
033 
C14 
034 
B13 
035 
C13 
036 
B12 
037 
C12 
038 
A12 
039 
C11 
040 
A11 


041 
B11 
042 
B10 
043 
C10 
044 
A10 
045 
'.' .B9 
046 
A9 
047 
C9 
048 
A8 
049 
B8 
050 
A7 
051 
C8 
052 
A6 
053 
B7 
054 
B6 
055 
C7 
056 
A5 
057 
C6 
058 
B5 
059 
B4 
060 
C3 
061 
C5 
062 
03 
063 
C4 
HlOA 
A7 
HOlO 
Q9 
INT/CS8 
S9 
KEN# 
A8 
lOCK# 
S8 
NA# 
A9 
NENE# 
A6 
PTB 
Q6 
AEAOY# 
Q8 
AESET 
S13 
SCAN 
A13 
SHI 
S12 
VCC 
A1 
VCC 
A3 
VCC 
A13 
VCC 
A15 
vcc 
A17 
vex: 
B2 
vcc 
B14 


Signal 
Location 


vcc 
B16 
vcc 
C1 


vcc 
C17 
vcc 
02 
vcc 
E1 
vcc 
N17 


vcc 
P16 
vcc 
Q1 


vcc 
Q17 
vcc 
A2 
vcc 
A4 
vcc 
A16 


Vee·············S1 
• 
vcc 
S3 
vcc 
S5 
vcc 
S15 
vcc 
S17 
Vss 
A2 
Vss 
A4 
Vss 
A14 
Vss 
A16 
Vss 
B1 


Vss 
B3 
Vss 
B15 
Vss 
B17 
Vss 
C2 
Vss 
C16 
vss 
, 
01 
vss 
017 
Vss 
P1 
Vss 
P17 
Vss 
Q2 
Vss 
Q16 
vss 
A1 
Vss 
A3 
Vss 
A15 
Vss 
A17 
Vss 
S2 
Vss 
S4 
Vss 
S14 
Vss 
S16 
W/A# 
S6 
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Table 5.3. Ceramic 
PGA Package 
Dimension 
Symbols 


Letter or 
Description 
of ,?Imenslons 
Symbol 


A 
Distance 
from seating plane to highest point of body 


A1 
Distance 
between 
seating plane and base plane (lid) 


A2 
Distance from base plane to highest point of body 


A3 
Distance from seating plane to bottom of body 


B 
Diameter 
of terminal 
lead pin 


D 
Largest overall package 
dimension 
of length 


D1 
A body length dimension, 
outer lead center to outer lead center 


e1 
Linear spacing between 
true lead position 
centerlines 
l 
Distance 
from seating plane to end of lead 


S1 
Other body dimension, 
outer lead center to edge of body 


NOTES: 
1. Controlling 
dimension: 
millimeter. 
2. Dimension 
"e-" 
("e") 
is non-cumulative. 
3. Seating 
plane (standoff) 
is defined 
by P.C. board 
hole size: 0.0415-0.0430 inch. 
4. Dimensions 
"S", 
"S1" 
and "C" 
are nominal. 


5. Details 
of Pin 1 identifier 
are optional. 
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Family: Ceramic Pin Grid Array Package 


Symbol 
Mllllmeters 
Inches 


Mln 
Max 
Notes 
Mln 
Max 
Notes 


A 
3.56 
4.57 
0.140 
0.180 


A1 
0.64 
1.14 
SOLID LID 
0.025 
0.045 
SOLID LID 


A2 
2.79 
3.56 
SOLID LID 
0.110 
0.140 
SOLID LID 


A3 
1.14 
1.40 
0.045 
0.055 


B 
0.43 
0.51 
0.017 
0.020 


D 
44.07 
44.83 
1.735 
1.765 


D1 
40.51 
40.77 
1.595 
1.605 


61 
2.29 
2.79 
0.090 
0.110 


L 
2.54 
3.30 
0.100 
0.130 


N 
168 
11 of Pins 
168 
11 of Pins 


S1 
1.52 
2.54 
0.060 
0.100 


ISSUE 
IWS 
REVX 
7/15/88 


Figure 
5.3.168 
Lead Ceramic 
PGA Package 
Dimensions 


6.0 PACKAGE 
THERMAL 
SPECIFICATIONS 


The i860 XR microprocessor 
is specified 
for opera- 
tion when 
TC is within 
the range 
Cif O°C-85°C. 
Tc 
may be measured 
in any environmont 
to determine 
whether 
the i860 XR microprocessor 
is within speci- 
fied operating 
range. The case temperature 
should 
be measured 
at the center 
of the top surface 
oppo- 
site the pins. 


For this section, 
let: 


P 
= maximum 
power consumption 


TC 
= case temperature 


TA 
= ambient 
air temperature 


8CA = thermal 
resistance 
from case to ambient 
air 


8JC = thermal 
resistance 
from junction 
to case 


8JA = thermal 
resistance 
from junction 
to ambient 
air 


TA can be calculated 
from 8CA (thermal 
resistance 
from case to ambient) 
with the following 
equation: 
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Typical 
values 
for 8CA and 8JC at various 
airflows 
are given in Table 
6.1 for the 1.75 sq. in., 168 pin, 
ceramic pGA. 8JC is also shown so that 8JA can be 
calculated 
by: 


IJCA = IJJA - 
IJJC 


Note that 8JC with a heatsink 
differs 
from 8JC with- 
out a heatsink 
because 
case 
temperature 
is mea- 


sured 
differently. 
Case 
temperature 
for 
8JC with 
heatsink 
is measured 
at thecenter 
of the heat fin 
base. Case temperature 
for 8JC without 
heatsink 
is 
measured 
at the center 
of package 
top surface. 


Table 6.2 shows the maximum 
TA allowable 
(without 


exceeding 
Tc> at various 
airflows 
and operating 
fre- 


quencies 
(fCLK)' 


Note that TA is greatly 
improved 
by attaching 
"fins" 


or a "heat 
sink" 
to the package. 
P (the maximum 
power consumption) 
is calculated 
by using the maxi- 
mum Icc at 5V as tabulated 
in the DC Characteris- 


tics of section 
7. 


Figure 6.1 gives typical 
Icc derating 
with case tem- 


perature. 
For more information 
on heat sinks, 
mea- 
surement 
techniques, 
or package 
characteristics, 
re- 


fer 
to 
Intel Packaging Handbook, 
order 
number 
240800. 


Typical 
part al5V 
with maximum 
load 


30 


Ice (mA) 


580 


570 


560 


550 


540 


530 


520 


510 


500 


490 


480 


·470 


460 


450 


440 


430 


420 


410 


400 
o 


t-- --- 


40.0 t.lHz---"- -- 
- 
33.3 t.lHz 


25.0 t.lHz 


10 
20 
70 
80 85 
60 
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Figure 
6.1. Icc vs Case Temperature 


Table 
6.1. Thermal 
Resistance 
("C/W) 
8JC and 8CA 


8CA at Alrflow-ft/mln 
(m/sec) 


8JC 
0 
200 
400 
600 
800 
1000 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 
(5.07) 


With 
2 
11 
6 
4 
3.2 
2.5 
2.2 
Heat Sink' 


Without 
1.5 
17.5 
13 
11 
9.5 
8.5 
8 
Heat Sink .. 
'Nlne-fln, 
unidirectional heat sink (fin dimensions: 0.350" 
height, 0.040 
width, 0.115" center-to-center spacing, 1.530" length). 
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Table 
6.2. Maximum 
Allowable 
TA at Various 
Airflows 


InoC 


felK 
Alrflow-ft/mln 
(m/sec) 


0 
200 
400 
600 
800 
1000 
(MHz) 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 
(5.07) 


TA with 
25.0 
57.5 
70 
75 
77 
78.8 
79.5 


Heat Sink" 
33.3 
52 
67 
73 
75.5 
77.4 
78.5 


40.0 
49.3 
65.5 
72 
74;6 
76.9 
77.9 


TA without 
25.0 
41.3 
52.5 
57.5 
61.3 
63.8 
65 


Heat Sink 
33.3 
32.5 
46 
52 
56.5 
59.5 
61 


40.0 
28.1 
42.8 
49.3 
54.1 
57.4 
59 


·Nine-fin unidirectional heat sink (fin dimensions: 0.350" height, 0.040 width, 
0.115" center-te-center spacing, 1.530" length). 


7.0 ELECTRICAL DATA 
NOTICE: This data sheet contains 
preliminary infor- 


mation on new products in production. The specifica- 
tions are subject to change without notice. Verify with 
your local Intel Sales office that you have the latest 
data sheet before finalizing a design. 


Inputs 
and outputs 
are TTl 
compatible, 
except 
for 
ClK. 
All input and output timings 
are specified 
rela- 
tive to the 
1.5 volt level of the rising edge of ClK 
and refer to the point that the signals reach 
1.5V. 
"WARNING: Stressing the device beyond the "Absolute 
Maximum 
Ratings" 
may 
cause 
permanent 
damage. 
These are stress ratings only. Operation 
beyond 
the 


"Operating 
Conditions" 
is not recommended 
and ex- 
tended 
exposure 
beyond 
the 
"Operating 
Conditions" 


may affect device/eliabilify. 


7.1 Absolute Maximum Ratings 


Case Temperature 
Tc under Bias 
O°C to 85°C 


Storage Temperature 
-65°C 
to + 150°C 


Voltage 
on Any Pin 
with Respect to Ground 
- 0.5 to 6.5V 


7.2 D.C. Characteristics 


Table 
7.1. DC Characteristics 
Tc = 0°Ct085°C, 
vcc = 5V ±5% 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


Vll 
Input lOW 
Voltage 
-0.3 
+0.8 
V 


VIH 
Input HIGH Voltage 
2.0 
Vcc+0.3 
V 


VllC 
ClK 
Input lOW 
Voltage 
-0.3 
+0.8 
V 


VIHC 
ClK 
Input HIGH Voltage 
3.0 
vcc + 0.3 
V 


VOl 
Output lOW 
Voltage 
0.45 
V 
(Note 1) 


VOH 
Output HIGH Voltage 
2.4 
V 
(Note 2) 
tee 
Power Supply Current 
Cl.K = 25.0 MHz 
500 
mA 
VCC@5V 
Cl.K = 33.3 MHz 
600 
mA 
VCC@5V 
elK 
= 40.0 MHz 
650 
mA 
VCC@5V 


lu 
Input leakage 
Current 
±15 
/LA 
Nopullup 
or pulldown 


IlO 
Output leakage 
Current 
±15 
/LA 
CIN 
Input Capacitance 
15 
pF 
(Note 3) 


Co 
1/0 or Output Capacitance 
15 
pF 
(Note 3) 


CClK 
Clock Capacitance 
20 
pF 
(Note 3) 


NOTES: 
1. This parameter is measured at 4.0 mA for A31-A3, 063-00, 
BE7#-BEO#; 
at 5.0 mA for all other outputs. 
2. This parameter is measured at 1.0 mA for A31-A3, 063-00, 
BE7#-BEO#; 
at 0.9 mA all other outputs. 


3. These are not tested. They are guaranteed by design char~cterization. 
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7.3 A.C. Characteristics 


Table 7.2. A.C. Characteristics 
Tc = O°Cto 85°C, vcc = 5V ±5% 
All timings measured at ClK = 1.5V unless otherwise specified. 


25MHz 
33MHz 
40MHz 
Symbol 
Parameter 
Mln 
Max 
Mln 
Max 
Mln 
Max 
Notes 


(ns) 
(ns) 
(ns) 
(ns) 
(ns) 
(ns) 


t1 
ClK Period 
40 
125 
30 
125 
25 
125 


t2 
ClK High Time 
6 
5 
3 
at3V 


t3 
ClKlowTime 
8 
7 
5 
atO.8V 


t4 
ClKFallTime 
7 
7 
7 
3V-0.8V 


t5 
ClK Rise Time 
7 
7 
7 
0.8V-3V 


t6a 
A31-A3, PTB,W/R#, NENE# 
3.5 
25 
3.5 
23 
3.5 
19 
50pF load 


Valid Delay 


t6b 
BEn#* Valid Delay 
3.5 
27 
3.5 
25 
3.5 
21 
50 pF load 


t7 
Float TIme, All 
3.5 
40 
3.5 
30 
3.5 
25 
(Note 1) 


t8 
ADS#,BREQ,lOCK#,HlDA 
3.5 
22 
3.5 
20 
3.5 
15 
50 pF load 


Valid Delay 


t9 
D63-oo Valid Delay 
3.5 
38 
3.5 
35 
3.5 
31 
50 pF load 


t10 
Setup Time, All Inputs 
13 
11 
8 
(Note 2) 


t11a 
Hold Time, All Inputs except 
4 
4 
3 
(Note 2) 


DATA 


t11b 
DATA Hold Time 
5 
4 
3 


NOTES: 
1. Float condition occurs when maximum output current becomes less than ILOin magnitude. Float delay is not tested. 
2. INT and HOLD 
are asynchronous inputs. The setup and hold specifications are given for test purposes or to assure 


recognition on a specific rising edge of ClK. 
• n = 0, 1, ..., 7 
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3.0V 


elK 


1.5V 
t2 
t3 


O.8V 


tl 


INPUT 
INPUT 
• 


SETUP 
HOLD 


tl0mln 
t"min 


INPUTS 


t6max' t8max' t9max 


VALID 


FLOAT 
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Figure 7.1. eLK, Input, and Output Timings 
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nom+15,---,----,----,----r---, 


TYPICAL· OUTPUT 
DELAY (ns) 
o 1.5V 


1----t---+--t---t7~"I'3-DD 


KH. HLDA 


nom-5~--~--~----~--~--~ 
25 
50 
75 
100 
125 
150 


LOAD CAPACITANCE. CL (pf) 
NOTES: 
Graphs 
are not linear outside 
the CL range shown. 
nom = nominal 
value given 
in the AC timing 
table. 


·Typical 
part under worst-case 
conditions. 
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Figure 7.2. Typical 
Output Delay vs Load Capacitance 
under Worst-Case 
Conditions 


A 
#. BREQ. LOCK,. 
HLDA 
TYPICAL· OUTPUT 
SLEW TIME (ns) 
9 1---!r---7I~-+----:ii""I----:;;I"'9 
(O.8-2.0V) 


/RH. 
NENEI 


O~~~~--~--~---J 
25 
50 
75 
100 
125 
150 


NOTES: 
LOAD CAPACITANCE. ~ 
(pf) 


Graphs 
are not linear outside 
the CL range 
shown. 


·Typical 
part under worst-case 
conditions. 
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Figure 7.3. Typical Slew Time vs Load Capacitance 
under Worst-Case 
Conditions 


700 
/ 


/ 
/'''' 


/' V 


/' 


600 


30D 


200 
8 
12 16 20 24 26 30 34 3840 


FREQUENCY(MHz) 
NOTES: 
Graphs 
are not linear outside 
the frequency 
range 
shown. 


·Worst-case 
supply 
current 
at 5V. 
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Figure 7.4. Typical Ice vs Frequency 
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8.0 INSTRUCTION SET 


Key to abbreviations: 


For register operands, the abbreviations that describe the operands are composed of two parts. The first part 
describes the type of register: 
c 
One of the control registers fir, psr, epsr, dlrbase, db, or fsr 
f 
One of the floating-point registers: fO through f31 
One of the integer registers: rO through r31 


The second part identifies the field of the machine instruction into which the operand is to be placed: 
src 1 
The first of the two source-register designators, which may be either a register or a 16-bit 
immediate constant or address offset. The immediate value is zero-extended for logical 
operations and is sign-extended for add and subtract operations (including addu and subu) 
and for all addressing calculations. 
Same as src1 except that no immediate constant or address offset value is permitted. 
Same as src1 except that the immediate constant is a 5-bit value that is zero-extended to 32 
bits. 
The second of the two source-register designators. 


The destination register designator. 


src1ni 
src1s 


src2 
dest 
• 


Thus, the operand specifier isrc2, for example, means that an integer register is used and that the encoding of 
that register must be placed in the src2 field of the machine instruction. 


Other (nonregister) operands are specified by a one-part abbreviation that represents both the type of operand 
required and the instruction field into which the value of the operand is placed: 
«const 
A 16-bit immediate constant or address offset that the i860 XR microprocessor sign-extends 
to 32 bits when computing the effective address. 
A signed, 26-bit, immediate, relative branch offset. 
A signed, 16-bit, immediate, relative branch offset. 
A function that computes the target address by shifting the offset (either Ibroff or sbroff) left 
by two bits, sign-extending it to 32 bits, and adding the result to the current instruction pointer 
plus four. The resulting target address may lie anywhere within the address space. 


Unless otherwise specified, floating-point operations accept single- or double-precision 
source operands and produce a result of equal or greater precision. Both input operands 
must have the same precision. The source and result precision are specified by a two-letter 
suffix to the mnemonic of the operation. 


Ibroff 
sbroff 
brx 


Other abbreviations include: 
_p 
Precision specification .ss, .sd, or .dd (.ds not permitted). Refer to Table 8.1. 


.r 
Precision specification .ss, .sd, .ds, or .dd. Refer to Table 8.1. 


•v 
.sd or .dd. Refer to Table 8.1. 


•w 
.ss or .dd. Refer to Table 8.1. 


.x 
.b (8 bits), .s (16 bits), or .I (32 bits) 


.y 
.I (32 bits), .d (64 bits), or .q (128 bits) 


.Z 
.I (32 bits), or .d (64 bits) 


Table 8.1. Precision Specification 


Suffix 
Source 
Result 
Precision 
Precision 


.SS 
single 
single 
.sd 
single 
double 
.dd 
double 
double 
.ds 
double 
single 
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mem.x(address) The contents 
of the memory 
location 
indicated 
by address with a size of x. 


PM 
The pixel mask, which is considered 
as an array of eight bits PM[7] ..PM[01, where 
PM[O] is 
the least significant 
bit. 


8.1 Instruction 
Definitions In Alphabetical Order 


adds 
isrct, isrc2, idest 
Add Signed 
idest - 
isrc 1 + isrc2 
OF 
- 
(bit 31 carry "* bit 30 carry) 
CC set if isrc2 < -isrc1 
(signed) 
CC clear if isrc2 ~ -isrc1 
(Signed) 


addu 
isrct, isrc2, idest 
Add Unsigned 
idest - 
isrc1 + isrc2 
OF - 
bit 31 carry 
CC 
- 
bit 31 carry 


and 
isrc 1,isrc2, idest 
Logical 
AND 
idest - 
isrc 1and isrc2 
CC set if result is zero, cleared 
otherwise 


andh 
# const; isrc2, idest 
Logical 
AND High 
idest - 
(#const shifted 
left 16 bits) and isrc2 
CC set if result is zero, cleared 
otherwise 


and not 
isrc1, isrc2, idest 
Logical 
AND NOT 
idest - 
not isrc 1 and isrc2 
CC set if result is zero, cleared 
otherwise 


andnoth 
# coast; isrc2, idest 
Logical 
AND NOT High 
idest - 
not (# const shifted 
left 16 bits) and isrc2 
CC set if result is zero, cleared 
otherwise 
Ibrot! 
Branch 
on CC 
CC = 1 
continue 
execution 
at brx(lbroff) 


bc 
IF 
THEN 
FI 


bc.t 
IF 
THEN 


ELSE 
FI 


bla 


Ibrot! 
Branch 
on CC, Taken 
CC = 1 
execute 
one more sequential 
instruction 
continue 
execution 
at brx(lbroff) 
skip next sequential 
instruction 


isrc1ni, isrc2, sbrot! 
Branch 
on LCC and Add 
LCC-temp 
clear if isrc2 < -isrc1ni 
(signed) 
LCG-temp 
set if isrc2 ~ -isrc1ni 
(signed) 
isrc2 - 
isrc1ni + isrc2 
Execute 
one more sequential 
instruction 
IF 
LCC 
THEN 
LCC 
- 
LCC-temp 
continue 
execution 
at brx(sbroff) 
LCC 
- 
LCC-temp 
ELSE 
FI 


bnc 
IF 
THEN 
FI 


bnc.t 
IF 
THEN 


ELSE 
FI 
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Ibrot! 
Branch 
on Not CC 
CC = 0 
continue 
execution 
at brx(lbroff) 


Ibrot! 
Branch 
on Not CC, Taken 
CC = 0 
execute 
one more sequential 
instruction 
continue 
execution 
at brx(lbroff) 
skip next sequential 
instruction 


THEN 
enter dual-instruction 
mode 
for next two instructions 
ELSE 
enter single-instruction 
mode 
for next two instructions 
FI 
• 
FI 


FI 


bte 
IF 
THEN 
FI 


btne 
IF 
THEN 
FI 
call 
Ibroff 
Subroutine Call 


r1 
- 
address 
of next sequential 
instruction 
+ 4 (+ 8 in dual mode) 
Execute 
one more sequential 
instruction 
Continue 
execution 
at brx(lbroff) 


calli 
[isrc 1nil 
Indirect Subroutine Call 
r1 
- 
address 
of next sequential 
instruction 
+ 4 (+ 8 in dual mode) 


Execute 
one more sequential 
instruction 
Continue 
execution 
at address 
in isrc1ni 


(The original contents 
of isrc1ni is used even if the next instruction 
modifies isrc1ni. Does not trap if isrc1ni is misaligned. 
The register isrc1ni must not be r1.) 


fadd.p 
tsrct, fsrc2, fdest 
Floating-Point Add 


fdest - 
fsrc1 + fsrc2 


faddp 
tsrct, fsrc2, fdest 
Add with Plxel Merge 


fdest - 
fsrc1 + fsrc2 
Shift and load MERGE 
register 
as defined 
in Table 8.2 


faddz 
tsrct, fsrc2, fdest 
Add with Z Merge 


tdest - 
fsrc 1 + fsrc2 
Shift MERGE 
ri"ht 
16 and load fields 31 ..16 and 63 ..48 


famov.r 
fsrc1, fdo'Jst 
Floating-Point Adder Move 


fdest - 
fsrc1 
Send fsrc1 through 
the floating-point 
adder. (Preserves -0 (minus zero) when fsrc1 is -0. 
fsrc2 


must be coded 
as to by the assembler.) 


FI 
Continue 
execution 
at address 
in isrc1ni 


(The original 
c:ontents of isrc1ni is used even if the next instruction 
modifies isrc1ni. Does not trap if isrc1ni is misaligned.) 
tsrcts, isrc2, sbroff 
Branch If Equal 
isrcts = isrc2 
. 


continue 
execution 
at brx(sbroff) 


isrc 1s, if,:rc2,sbroff 
Branch If Not Equal 


isrc 1s "" isrc2 
continue 
execution 
at brx(sbroff) 
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fladd.w 
fsrc1, fsrc2, fdes! 
Long-Integer Add 
tdest +- fsrc1 + fsrc2 
flsub.w 
fsrc1, fsrc2, tdest 
Long-Integer Subtr~ct 
tdest +- fsrc1 - 
fsrc2 


fix. v 
fsrc 1,tdest 
Floatlng-Polntto 
Integer Conversion 
tdest +- 64- bit value with low-order 
32 bits equal to integer 
part of fsrc1 rounded 


Floating-Point 
Load 
fld.y 
isrc1(isrc2), fdes! 
(Normal) 
fld.y 
isrc1(isrc2)+ +, tdest 
(Autolncrement) 
taest +- 
mem.y (isrc1 + isrc2) 
IF autoincrement 
THEN isrc2 +- isrc1 + isrc2 
FI 


Cache Flush 
flush 
# const(isrc2) 
(Normal) 
flush 
# const(isrc2) + + 
(Autolncrement) 
Replace 
block in data cache with address (#cons! + isrc2). 
Contents 
of block undefined. 


IF autoincrement 
THEN isrc2 +- «const + isrc2 
FI 
fmlow.dd 
tsrct, fsrc2, tdest 
Floating-Point Multiply Low 
fdes! +- low-order 
53 bits of fsrc1 mantissa x fsrc2 mantissa 
tdest bit 53 +- most significant 
bit of mantissa 
fmov.r 
fsrc1, fdes! 
Floating-Point Reg-Reg Move 
Assembler 
pseudo-operation 
fmov.ss fsrc1, tdest = fladd.ss fsrc1, to, tdest 
fmov.dd tsrct, taest = fladd.dd fsrc1, fO, fdes! 
fmov.sd tsrct, taest = famov.sd fsrc1, fdes! 
fmov.ds fsrc1, tdest = famov.ds fsrc1, fdes! 
fmul.p 
fsrc1, fsrc2, fdes! 
Floating-Point Multiply 
tdest +- fsrc 1 x fsrc2 


fnop 
Floating-Point No Operation 
Assembler 
pseudo-operation 
fnop = shrd ro, rO, ro 


form 
fsrc1, fdes! 
OR with MERGE Register 
tdest +- fsrc1 OR MERGE 
MERGE 
+- 0 


frcp.p 
fsrc2, tdest 
Floating-Point Reciprocal 
tdest +- 
1Ifsrc2 with maximum 
mantissa 
error < 2-7 


frsqr.p 
fsrc2, tdest 
Floating-Point Reciprocal Square Root 
fdes! +- 1/SQRT (fsrc2) with maximum 
mantissa 
error < 2-7 


Floating-Point Store 
fst.y 
tdest, isrc1(isrc2) 
(Normal) 
fst.y 
taest. isrc1(isrc2)+ + 
(Autolncrement) 
mem.y (isrc2 + isrc1) +- tdest 
IF autoincrement 
THEN isrc2 +- isrc1 + isrc2 
FI 


fsub.p 
tsrct, fsrc2, tdest 
Floating-Point Subtract 
tdest +- fsrc1 - 
fsrc2 


ftrunc.v 
tsrct, fdes! 
Floatlng-Polntto 
Integer Conversion 
tdest +- 64-bit value with low-order 
32 bits equal to integer 
part of fsrc1 
fxfr 
tsrct, ides! 
Transfer F-P to Integer Register 
ides! +- fsrc1 
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fzchkl 
fsrc1, fsn:2, fdest 
32-Blt 
Z-Buffer 
Check 
Consider 
fsrc1, tsrce. 
and fdest as arrays of two 32-bit 
fields fsrc1(O) ..fsrc1(1), 
fsrc2(O)..fsrc2(1), 
and fdest(O)..fdest(1) 
where 
zero denotes 
the least-significant 
field. 


PM 
- 
PM shifted 
right by 2 bits 
FOR i = 0 to 1 
DO 
PM (j + 6] - 
fsrc2(i) 
~ fsrc1(i) 
(unsigned) 
fdest(i) 
- 
smaller 
of fsrc2(i) 
and fsrc1(i) 
, 


OD 
MERGE 
- 
0 


fzchks 
tsrct, 
fsrc2, fdest 
16-Blt Z-Buffer 
Check 
Consider tsrct, fsrc2, 
and fdest as arrays of four 16-bit 
fields fsrc1(O)..fsrc1(3), 
fsrc2(O)..fsrc2(3), 
and fdest(O)..fdest(3) 
where zero denotes 
the least-significant 
field. 
PM 
- 
PM shifted 
right by 4 bits 
FORi=Ot03 
2 


DO 
PM (j + 4] - 
fsrc2(i) 
~ fsrc1(i) 
(unsigned) 
fdest(i) 
- 
smaller 
of fsrc2(i) 
and fsrc1(i) 
OD 
MERGE 
- 
0 


Intovr 
Software 
Trap on Integer 
Overflow 
If OF in epsr 
= 
1, generate 
trap with IT set in psr. 


Ixfr 
isrc1 ni, fdest 
Transfer 
Integer 
to F-P Register 
fdest 
- 
isrc1ni 


Id.c 
csrc2, idest 
" 
Load from 
Control 
Register 
idest 
- 
csrc2 


Id.x 
isrc1(isrc2j, 
idest 
Load 
Integer 
idest 
- 
mem.x (isrc 1 + isrc2j 


lock 
Begin 
Interlocked 
Sequence 
Set BL in dlrbase. 
The next load or store that misses the cache locks that location. 
Disable 
interrupts 
until the bus is unlocked. 


mov 
isrc2, idest 
Register-Register 
Move 
Assembler 
pseudo-operation 
mov 
isrc2, idest 
= 
shl ro, 
isrc2, idest 


mov 
const32, 
idest 
- 
Constant-to-Reglster 
Move 
Assembler 
pseudo-operation 
" 


adds 
I%COflst32, 
ro, idest 


... 
when const32 
< Ox8000 


orh h %com~t32, rO, idest 
or l%const32, 
taest. idest 
... 
when const32 
:?: Ox8000 


nop 
Core-Unit 
No Operation 
Assembler 
pseudo-operation 
nop 
= 
shl IrO,ro, ro 


or 
isrc 1, isrc2, idest 
Logical 
OR 
idest 
- 
isrc1 OR isrc2 
CC set if result ls zero, cleared 
otherwise 


orh 
# const, isrc2, idest 
Logical 
OR High 
idest 
- 
(# coast shifted 
left 16 bits) OR isrc2 
CC set ifresult 
is zero, cleared 
otherwise 
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pfadd.p 
tsrct, fsrc2, fdest 
Plpellned Floating-Point Add 
fdest - 
last stage Adder result 
Advance A pipeline one stage 
A pipeline first stage - 
fsrc1 + fsrc2 
pfaddp 
tsrct, fsrc2, fdest 
Plpellned Add with Plxel Merge 
fdest - 
last stage Graphics result 
last stage Graphics result - 
fsrc 1 + fsrc2 
Shift and load MERGE register from last stage Graphics result as defined in Table 8.2 
pfaddz 
fsrc1, fsrc2, fdest 
Plpellned Add with Z Merge 
fdest - 
last stage Graphics result 
last stage Graphics result - 
fsrc1 + fsrc2 
Shift MERGE right 16 and load fields 31..16 and 63..48 from last stage Graphics result 
pfam.p 
fsrc1, fsrc2, fdest 
Plpelined Floating-Point Add and Multiply 
fdest - 
last stage Adder result 
Advance A and M pipeline one stage (operands accessed before advancing pipeline) 
A pipeline first stage - 
A-op1 + A-op2 
M pipeline first stage - 
M-op1 x M-op2 
pfamov.r 
fsrc1, fdest 
Plpellned Floating-Point Adder Move 
fdest - 
last stage Adder result 
Advance A pipeline one stage 
A pipeline first stage - 
fsrc1 
pfeq.p 
tsrct, fsrc2, fdest 
Plpellned Floating-Point Equal Compare 
fdest - 
last stage Adder result 
CC set if fsrc1 = fsrc2, else cleared 
Advance A pipeline one stage 
A pipeline first stage is undefined, but no result exception occurs 
pfgt.p 
fsrc " fsrc2, fdest 
Plpellned Floating-Point Greather- Than Compare 
(Assembler clears R-bit of instruction) 
fdest - 
last stage Adder result 
CC set if fsrc1 > fsrc2, else cleared 
Advance A pipeline one stage 
A pipeline first stage is undefined, but no result exception occurs 
pfladd.w 
fsrc1, fsrc2, fdest 
Plpellned Long-Integer Add 
fdest - 
last stage Graphics result 
last stage Graphics result - 
fsrc1 + fsrc2 
pflsub.w 
tsrct, fsrc2, fdest 
Plpellned Long-Integer Subtract 
fdest - 
last stage Graphics result 
last stage Graphics result - 
fsrc1 - 
fsrc2 
pflx.v 
fsrc1, fdest 
' 
Plpellned Floating-Point to Integer Conversion 
fdest - 
last stage Adder result 
Advance A pipeline one stage 
A pipeline first stage - 
54-bit value with low-order 32 bits 
equal to integer part of fsrc1 rounded 
Pipelined Floating-Point 
Load 
pfld.z 
isrc1(isrc2), fdest 
(Normal) 


pfld.z 
isrc1(isrc2)+ + ,fdest 
(Autolncrement) 


fdest - 
mem.z (third previous pfld's (isrc1 + isrc2) 
(where .Z is precision of third previous pfld.z) 
If autoincrement 
THEN isrc2 - 
isrc1 + isrc2 
FI 
pfle.p 
fsrc1, fsrc2, fdest 
Pipelined F-P Less-Than or Equal Compare 
Assembler pseudo-operation, identical to pfgt.p except that 
assembler sets R-bit of instruction. 


fdest - 
last stage Adder result 
CC clear if fsrc1 5.: fsrc2, else set 
Advance A pipeline one stage 
A pipeline first stage is undefined, but no result exception occurs 
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pfmam.p 
tsrct, fsrc2, fdes! 
Plpellned Floating-Point Add and Multiply 
tdest - 
last stage Multiplier 
result 
Advance 
A and M pipeline 
one stage (operands 
accessed 
before advancing 
pipeline) 
A pipeline 
first stage 
- 
A-op1 
- 
A-op2 
M pipeline 
first stage 
- 
M-op1 
X M-op2 
pfmov.r 
tsrct, taest 
Plpellned Floating-Point Reg-Reg Move 
Assembler 
pseudo-operation 
pfmov.ss fsrc1, tdest = pfladd.ss tsrct, to, fdes! 
pfmov.dd tsrct, tdest = pfladd.dd fsrc1, to, tdest 
pfmov.sd tsrct, tdest = pfamov.sd fsrc1, tdest 
pfmov.ds tsrct, tdest = pfamov.ds fsrc1, tdest 
pfmsm.p 
fsrc1, fsrc2, fdes! 
Plpellned Floating-Point Subtract and Multiply 
tdest - 
last stage Multiplier 
result 
Advance 
A and M pipeline 
one stage (operands 
accessed 
before 
advancing 
pipeline) 
A pipeline 
first stage 
- 
A-op1 
- 
A-op2 
M pipeline 
first stage 
- 
M-op1 
X M-op2 
pfmul.p 
tsrct, fsrc2, tdest 
Plpellned Floating-Point Multiply 
tdest - 
last stage Multiplier 
result 
Advance 
M pipeline 
one stage 
M pipeline 
first stage 
- 
fsrc1 X fsrc2 
pfmul3.dd 
tsrct, fsrc2, fdes! 
Three-Stage Plpellned Multiply 
tdest - 
last stage Multiplier 
result 
Advance 
3-Stage 
M pipeline 
one stage 
M pipeline 
first stage 
- 
fsrc1 X fsrc2 
pform 
tsrct, tdest 
Plpellned OR to MERGE Register 
fdes! - 
last stage Graphics 
result 
last stage Graphics 
result 
- 
fsrc1 OR MERGE 
MERGE 
- 
0 
pfsm.p 
fsrc 1,fsrc2, tdest 
Plpellned Floating-Point Subtract and Multiply 
fdes! - 
last stage Adder result 
Advance 
A and M pipeline 
one stage (operands 
accessed 
before advancing 
pipeline) 
A pipeline 
first stage 
- 
A-op1 
- 
A-op2 
M pipeline 
first stage 
- 
M-op1 
X M-op2 
pfsub.p 
tsrct, fsrc2, tdest 
Plpellned Floating-Point Subtract 
fdest - 
last stage Adder result 
Advance 
A pipeline 
one stage 
A pipeline 
first stage 
- 
fsrc1 + fsrc2 
pftrunc.v 
tsrct, tdest 
Plpellned Floatlng-Pointto 
Integer Conversion 
tdest - 
last stage Adder result 
Advance 
A pipeline 
one stage 
A pipeline 
first stage 
- 
54-bit value with low-order 
32 bits 
equal to integer 
part of fsrc1 
pfzchkl 
fsrc 1,fsrc2, tdest 
Plpellned 32-Blt Z-Buffer Check 
Consider 
fsrc1, fsrc2, and tdest, as arrays of two 32-bit 
fields fsrc1(O)..fsrc1(1), fsrc2(O)..fsrc2(1),and fdest(O)..fdest(1) 
where zero denotes 
the least significant 
field. 
PM 
- 
PM shifted 
right by 2 bits 
FOR i = 0 to 1 
DO 
PM [i + 6] 
- 
fsrc2(i) s fsrc1(i) (unsigned) 
fdest(i) - 
last stage Graphics 
result 
last stage Graphics 
result 
- 
smaller 
of fsrc2(i) and fsrc1(i) 
OD 
MERGE 
- 
0 


• 
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pfzchks 
fsrc1, fsrc2, fdest 
Plpellned 16-Blt Z-Buffer Check 
Consider tsrct, 
fsrc2, and fdest, as arrays of four 16-bit 
fields fsrc1(0) ..fsrc1(3), 
fsrc2(0) ..fsrc2(3), 
and fdest(0) ..fdest(3) 
where zero denotes the least significant field. 
PM +- PM shifted right by 4 bits 
FOR i = 0 to 3 
DO 
PM [i + 4] +- fsrc2(i) 
~ fsrc1(i) (unsigned) 
fdest(i) 
+- last stage Graphics result 
last stage Graphics result +- smaller of fsrc2(i) and fsrc1(i) 
OD 
MERGE +- 0 


pst.d 
fdest, # const(isrc2) 
Plxel Store 
pst.d 
fdest, # const(isrc2) + + 
Plxel Store Autolncrement 
Pixels enabled by PM in mem.d (isrc2 + # consf) 
+- fdest 
Shift PM right by 8/pixel size (in bytes) bits 
IF autoincrement 
THEN isrc2 +- # const + isrc2 
FI 
shl 
isrct, isrc2, idest 
Shift Left 
idest +- isrc2 shifted left by isrc1 bits 
shr 
tsrct, 
isrc2, idest 
Shift Right 


SC (in psr) +- isrc1 
idest +- isrc2 shifted right by isrc1 bits 
shra 
isrc 1,isrc2, idest 
Shift Right Arithmetic 
idest +- isrc2 arithmetically shifted right by isrc1 bits 
shrd 
isrct, 
isrc2, idest 
Shift Right Double 
idest +- low-order 32 bits of isrc1:isrc2 
shifted right by SC bits 
st.c 
isrc1ni, csrc2 
Store to Control Register 
csrc2 
+- isrc1 ni 
st.x 
isrc1ni, 
#const(isrc2) 
Store Integer 
mem.x 
(isrc2 + #consf) 
+- isrc1ni 
subs 
isrc1, isrc2, idest 
Subtract Signed 
idest +- isrc1 
- 
isrc2 
OF +- (bit 31 carry "* bit 30 carry) 
CC set if isrc2 > isrc1 (signed) 
CC clear if isrc2 
~ isrc1 (signed) 
subu 
isrct, 
isrc2, idest 
Subtract Unsigned 
idest +- isrc 1 - 
isrc2 
OF +- NOT (bit 31 carry) 
CC +- bit 31 carry 
(i.e. 
CC set if isrc2 
~ isrc1 (unsigned) 
CC clear if isrc2 > isrc1 (unsigned) 
trap 
isrc1ni, isrc2, idest 
Software Trap 
Generate trap with IT set in psr 
unlock 
End Interlocked Sequence 
Clear BL in dlrbase. The next load or store unlocks the bus. 
Enable interrupts after bus is unlocked. 


xor 
isrc 1,isrc2, idest 
Logical Exclusive OR 
idest +- isrc1 XOR isrc2 
CC set if result is zero, cleared otherwise 
xorh 
# const, isrc2, idest 
Logical Exclusive OR High 
idest 
+- 
(#const 
shifted left 16 bit) XOR isrc2 
CC set if result is zero, cleared otherwise 
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Plxel 
Fleld!~Loaded From 
Right Shift 
Size 
Result Into MERGE 
Amount 
(from PS) 
(Field Size) 


8 
63..56,47 ..40,31 ..24,15 ..8 
8 
16 
63..58,47'..42,31 ..26,15 ..10 
6 
32 
63..56, 
31..24 
8 


8.2 Instruction Format and Encoding 


All instructions are 32 bits long and begin on a four- 
byte boundary. When operands are registers, the 
register encodings shown in Table 8.3 are used. 
There are two general core-instruction formats, 
REG-format and CTRL-format, as well as a separate 
format for floating-point instructions. 


8.2.1 REG-FORMAl" INSTRUCTIONS 


Within the REG-format are several variations as 
shown in Figure 8.1. Table 8.4 gives the encodings 
for these instructions. One encoding is an escape 
code that defines yet another variation: the core es- 
cape instructions. Figure 8.2 shows the format of 
this group, and Table 8.5 shows the encodings. 


In these instructions, the src2 field selects one of 
the 32 integer registers (most instructions) or five 
control registers (st.c and Id.c). Dest selects one of 
the 32 integer registers (most instructions) or float- 
ing-point registers (ild, fst, pfld, pst, Ixfr). For in- 
structions where src1 is optionally an immediate val- 
ue, bit 26 of the opcode (I-bit) indicates whether src 1 
is an immediate. If bit 26 is clear, an integer register 
is used; if bit 26 is set, src1 is contained in the low- 
order 16 bits, except for bte and btne instructions. 
For bte and btne, the five-bit immediate value is 
contained in the srct field. For st, bte, btne, and 
bla, the upper five bits of the offset or broffset are 
contained in the dest field instead of srct, and the 
lower 11 bits of offset are the lower 11 bits of the 
instruction. 
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Table 8.3. Register Encoding 


Register 
Encoding 


rO 
0 


r31 
31 


fO 
0 


f31 
31 


Fault Instruction 
0 
Processor Status 
1 
Directory Base 
2 
Data Breakpoint 
3 
Floating-Point Status 
4 
Extended Process Status 
5 • 


For Id and st, bits 28 and zero determine operand 
size as follows: 


Bit 28 
Bit 0 
Operand Size 


0 
0 
8-bits 
0 
1 
8-bits 
1 
0 
16-bits 
1 
1 
32-bits 


When src1 is an immediate and bit 28 is set, bit zero 
of the immediate value is forced to zero. 


For fld, fst, pfld, pst, and flush, bit 0 selects autoin- 
crement addressing if set. For fld, fst, pfld, 
and 
pst, bits one and two select the operand size as 
follows: 


Bit 1 
Bit 2 
Operand Size 


0 
0 
64-bits 
0 
1 
128-bits 
1 
0 
32-bits 


1 
1 
32-bits 


When src1 is an immediate value, bits zero and one 
of the immediate value are forced to zero to main- 
tain alignment. When bit one of the immediate value 
is clear, bit two is also forced to zero. 


For flush, bits one and two must be zero. 
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General 
Format 
15 
31 
25 
20 
10 


OPCODEII 
SRCl 
IMMEDIATE. OFFSET. OR NULL 
SRC2 
DEST 


16-Blt Immediate 
Variant (except 
bte and btne) 
31 
25 
20 
15 
0 


I 


OPCODE 8 


SRC2 
I 


DEST 
I 


IMMEDIATE 
I 


at, bla, bte, and btne 
31 
25 
20 
15 
10 
0 


OPCODE/I 
SRC2 
SRCl 
OFFSET LOW 
SRC1S 


bte and btne with 5-Blt Immediate 
31 
25 
20 
15 
10 
0 


OPCODE 
SRC2 
OFFSET 
IMMEDIATE 
OFFSET LOW 
HIGH 


Figure 8.1. REG-Format 
Variations 


2-230 


o 


-. 
(Escape for Core Unit) 
0 
1 
0 
0 
1 
1 
bte, btne 
Branch Equal or Not Equal 
0 
1 
0 
1 
E 
I 
pfld.y 
Pipelined 
F-P Load 
0 
1 
1 
0 
0 
I 
(GTRL-Format 
Instructions) 
0 
1 
1 
x 
x 
x 
addu, 
-s, subu, -s, 
Add/Subtract 
1 
0 
0 
SO 
AS 
I 
shl,shr 
Logical Shift 
1 
0 
1 
0 
LR 
I 
shrd 
Double Shift 
1 
0 
1 
1 
0 
0 
bla 
Branch LGG Set and Add 
1 
0 
1 
1 
0 
1 
shra 
Arithmetic 
Shift 
1 
0 
1 
1 
1 
I 
and(h) 
AND 
1 
1 
0 
0 
H 
I 
andnot(h) 
ANDNOT 
1 
1 
0 
1 
H 
I 
or(h) 
OR 
1 
1 
1 
0 
H 
I 
xor(h) 
XOR 
1 
1 
1 
1 
H 
I 
(reserved) 
1 
1 
x 
x 
1 
0 • 


L 
Integer Length 
AS 
Add/Subtract 
0 
-8 
bits 
0 
-Add 
1 
-16 
or 32 bits (selected by bit 0) 
1 
-Subtract 
LS 
Load/Store 
LR 
Left/Right 
0 
-Load 
0 
-Left 
Shift 
1 
-Store 
1 
-Right 
Shift 
SO 
Signed/Ordinal 
E 
Equal 
0 
-Ordinal 
0 
-Branch 
on Not Equal 
1 
-Signed 
1 
-Branch 
on Equal 
H 
High 
Immediate 
0 
-and. 
or. andnot. xor 
0 -src1 is register 
1 
-andh. 
orh. andnoth. xorh 
1 -src1 is immediate 


31 
26 
15 
10 
5 
0 


1 
0 
1 
0 
0 
1 
1 I 
reserved" 
I 
SRC1 
I 
reserved" 
I 
OPCODE I 
"reserved (must be set to zero by assemblers) 


Figure 
8.2. Core Escape 
Instruction 
Format 
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Table 8.5. Core Escape Opcodes 
4 
o 


(reserved) 
0 
0 
0 
0 
0 
lock 
Begin Interlocked 
Sequence 
0 
0 
0 
0 
1 
calli 
Indirect Subroutine 
Call 
0 
0 
0 
1 
0 
(reserved) 
0 
0 
0 
1 
1 
intovr 
Trap on Integer Overflow 
0 
0 
1 
0 
0 
(reserved) 
0 
0 
1 
0 
1 
(reserved) 
0 
0 
1 
1 
0 
unlock 
End Interlocked 
Sequence 
0 
0 
1 
1 
1 
(reserved) 
0 
1 
x 
x 
x 
(reserved) 
1 
0 
x 
x 
x 
(reserved) 
1 
1 
x 
x 
x 


8.2.2 CTRL-FORMAT 
INSTRUCTIONS 


The CTRL instructions 
do not refer to registers, 
so instead 
of the register 
fields, they have a 26-bit 
relative 
branch 
offset. 
Figure 8.3 shows the format 
of these 
instructions 
and Table 8.6 defines 
the encodings. 
• 


o 


BROFFSET 


BROFFSET is a signed 26-bit relative branch offset. 


Figure 8.3. CTRL Instruction 
Format 


Table 8.6. CTRL-Format 
Opcodes 
28 
26 


(reserved) 
0 
0 
0 
(reserved) 
0 
0 
1 
br 
Branch Direct 
0 
1 
0 
call 
Call 
0 
1 
1 
bc(.t) 
Branch on CC Set 
1 
0 
T 
bnc(.t) 
Branch on CC Clear 
1 
1 
T 


T 
Taken 
o 
-bc or bnc 
1 -bc.t or bnc.t 
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8.2.3 FLOATING-POIINT INSTRUCTIONS 


The floating-point instructions also constitute an escape series. All these instructions begin with the bit se- 
quence 010010. Figure 8.4 shows the format of the floating point instructions, and Table 8.7 gives the encod- 
ings. Within the dual-operation instructions is a subcode OPCwhose values are given in Table 8.8 along with 
the mnemonic that corresponds to each. 
31 
25 
20 
15 
7 
0 
1-0-1--0-0-1-0-'-1--s-R-C2--I-' --OE-S-T--I---SR-C-1--tEEj'-- 
__ 0_PCO_O_E 
__ 
---'i 


SRC1, SRC2 -SourCE'; one of 32 floating-point registers 
DEST 
-Destination 
register 
(instructions other than fxfr) one of 32 floating-point registers 
(fxfr) one of 32 integer registers 


P 
Pipelining 
1 
-Pipelined 
instruction mode 
o 
-Scalar 
instruc:tionmode 
D 
Dual-Instruction Mode 
1 
-Dual-instruction 
mode 
o 
-Single-instruction 
mode 


S 
Source Precision 
1 
-Double-precision 
source operands 
o 
-Single-precision 
source operands 
R 
Result Precision 
1 
-Double-precision 
result 
o 
-Single-precision 
result 


Figure 8.4. Floating-Point Instruction 
Encoding 


Table 8.7. Floating-Point Opcodes 
6 


pfam 
Add and Multiply" 
pfmam 
Multiply with Add" 
0 
0 
0 
OPC 
pfsm 
Subtract and Multiply" 
0 
0 
1 
OPC 
pfmsm 
Multiply with Subtract" 


(p)fmul 
Multiply 
0 
1 
0 
0 
0 
0 
0 
fmlow 
Multiply Low 
0 
1 
0 
0 
0 
0 
1 
frcp 
Reciprocal 
0 
1 
0 
0 
0 
1 
0 
frsqr 
Reciprocal Square Root 
0 
1 
0 
0 
0 
1 
1 
pfmul3.dd 
3-Stage Pipelined Multiply 
0 
1 
0 
0 
1 
0 
0 


(p)fadd 
Add 
0 
1 
1 
0 
0 
0 
0 
(p)fsub 
Subtract 
0 
1 
1 
0 
0 
0 
1 
(p)flx 
Fix 
0 
1 
1 
0 
0 
1 
0 
(p)famov 
Adder Move 
0 
1 
1 
0 
0 
1 
1 
pfgt/pfle" 
Greater Than 
0 
1 
1 
0 
1 
0 
0 
pfeq 
Equal 
0 
1 
1 
0 
1 
0 
1 
(p)ftrunc 
Truncate 
0 
1 
1 
1 
0 
1 
0 


fxfr 
Transfer to Integer Register 
1 
0 
0 
0 
0 
0 
0 
(p)fladd 
Long-Integer Add 
1 
0 
0 
1 
0 
0 
1 
(p)flsub 
Long-Integer Subtract 
1 
0 
0 
1 
1 
0 
1 


(p)fzchkl 
Z-Check Long 
1 
0 
1 
0 
1 
1 
1 
(p)fzchks 
Z-Check Short 
1 
0 
1 
1 
1 
1 
1 
(p)faddp 
Add with Pixel Merge 
1 
0 
1 
0 
0 
0 
0 
(p)faddz 
Add with Z Merge 
1 
0 
1 
0 
0 
0 
1 
(p)form 
OR with MERGE Register 
1 
0 
1 
1 
0 
1 
0 


'pfam and pfsm have P-blt set; pfmam and pfmsm have P-bIt clear. 
"pfgt 
has R bit cleared; pfle has R bit set. 


NOTE: 
All opcodes not shown are reserved. 
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The following 
table shows the opcode 
mnemonics 
that generate 
the various 
encodings 
of ope and explains 
each encoding. 


Table 8.8. ope Encoding 


ope 
PFAM 
PFSM 
M-Unit 
M-Unit 
A-Unit 
A-Unit 
T 
K 
Mnemonic 
Mnemonic 
op1 
op2 
op1 
op2 
Load 
Load" 


0000 
r2p1 
r281 
KR 
src2 
src1 
M result 
No 
No 
0001 
r2pt 
r28t 
KR 
src2 
T 
M result 
No 
Yes 
0010 
r2ap1 
r2a81 
KR 
src2 
src1 
A result 
Yes 
No 
0011 
r2apt 
r2a8t 
KR 
src2 
T 
A result 
Yes 
Yes 


0100 
12p1 
1281 
KI 
src2 
src1 
M result 
No 
No 
0101 
12pt 
128t 
KI 
src2 
T 
M result 
No 
Yes 
0110 
12ap1 
12a81 
, 


KI 
src2 
srct 
A result 
Yes 
No 
0111 
12apt 
12ast 
KI 
src2 
T 
A result 
Yes 
Yes 


1000 
rat1p2 
rat182 
KR 
A result 
src1 
src2 
Yes 
No 
1001 
m12apm 
m12asm 
src1 
src2 
A result 
M result 
No 
No 
1010 
ra1p2 
ra1s2 
KR 
A result 
src1 
src2 
No 
No 
1011 
m 12ttpa 
m12ttsa 
src1 
src2 
T 
A result 
Yes 
No 


1100 
lat1p2 
lat182 
KI 
A result 
src1 
src2 
Yes 
No 
1101 
m12tpm 
mtztsm 
src1 
src2 
T 
M result 
No 
No 
1110 
la1p2 
la1s2 
KI 
A result 
src1 
src2 
No 
No 
1111 
m12tpa 
m12tsa 
src1 
src2 
T 
A result 
No 
No 


ope 
PFMAM 
PFMSM 
M-Unit 
M-Unit 
A-Unit 
A-Unit 
T 
K 
Mnemonic 
Mnemonic 
op1 
op2 
op1 
op2 
Load 
Load" 


0000 
mr2p1 
mr281 
KR 
src2 
src1 
M result 
No 
No 
0001 
mr2pt 
mr28t 
KR 
src2 
T 
M result 
No 
Yes 
0010 
mr2mp1 
mr2ms1 
KR 
src2 
src1 
M result 
Yes 
No 
0011 
mr2mpt 
mr2mst 
KR 
src2 
T 
M result 
Yes 
Yes 


0100 
ml2p1 
ml2s1 
KI 
src2 
src1 
M result 
No 
No 
0101 
ml2pt 
ml2st 
KI 
src2 
T 
M result 
No 
Yes 
0110 
ml2mp1 
ml2m81 
KI 
src2 
src1 
M result 
Yes 
No 
0111 
ml2mpt 
ml2m8t 
KI 
src2 
T 
M result 
Yes 
Yes 


1000 
mrmt1p2 
mrmt1s2 
KR 
M result 
src1 
src2 
Yes 
No 
1001 
mm12mpm 
mm12msm 
src1 
src2 
M result 
M result 
No 
No 
1010 
mrm1p2 
mrm182 
KR 
M result 
src1 
src2 
No 
No 
1011 
mm 12ttpm 
mm 12ttsm 
src1 
src2 
T 
A result 
Yes 
No 


1100 
mlmt1p2 
mlmt1s2 
KI 
M result 
src1 
src2 
Yes 
No 
1101 
mm12tpm 
mm12tsm 
src1 
src2 
T 
M result 
No 
No 
1110 
mlm1p2 
mlm1s2 
KI 
M result 
src1 
src2 
No 
No 
1111 
Intel-Reserved 
.. 
*If K-Ioad 
IS set, ,KR IS loaded 
when 
operand-1 
of the multipher 
IS KR; KI IS loaded 
when 
operand-1 
of the multiplier 
is KI. 
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8.3 Instruction Timings 
shown 
in the table 
below. 
Freezes 
due to multiple 
simultaneous 
cache 
misses 
result in a delay that is 
the sum of the delays 
for processing 
each miss by 
itself. 
Other 
multiple 
freeze 
conditions 
usually 
add 
only the delay of the longest 
individual 
freeze. 


i860 XR microprocessor 
instructions 
take one clock 
to 
execute 
unless 
a freeze 
condition 
is invoked. 
Freeze 
conditions 
and their 
associated 
delays 
are 


Freeze 
Condition 
Delay 


lnstruction-cachs 
miss 
Number of clocks to read instruction 
(from ADS 
clock to first READY 11 clock) plus time to last 
READY 11 of block when jump or freeze occurs 
during miss processing 
plus two clocks if data- 
cache being accessed 
when instruction-cache 
miss occurs. 


Reference 
to destination 
of Id instruction 
that 
One plus number of clocks to read data (from 
misses 
ADS 11 clock to first READY 11 clock) minus number 
of instructions 
executed 
since load (not counting 
instruction 
that references 
load destination) 


fld miss 
One plus number of clocks until first READY 11 
returned 
(for 32- or 64-bit read cycles) or until 
second 
READY 11 returned 
(for 128-bit fld.q 
read 
cycles) 


call, calli, Ixfr, fxfr, 
Id.e, or st.e and data cache 
One plus number of clocks until first READY 11 


load miss processing 
in progress 
returned 
(for 64-bit read cycles) or until second 
READY 11 returned 
(for 128-bit fld.q 
read cycles) 


Id/st/pfld/fld/fst 
and data cache load miss 
One plus number of clocks until last READY # 
processing 
in progress 
returned 


Reference 
to destof Id, call, calli, fxfr, 
or Id.e in 
One clock 
the next instruction. 
(Destof call and calli is r1.) 
• 
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Freeze 
Condition 
Delay 


Reference 
to dest of fld/pfld/lxfr 
in the next two 
Two clocks in the first instruction; 
one in the 
instructions 
second instruction 


bc/bnc/bc.t/bnc.t 
following 
addu/adds/subu/ 
One clock 
subs/pfeq/pfle/pfgt 


Fsrc1 of multiplier 
operation 
refers to result of 
One clock 
previous operation 


Floating-point 
operation 
or graphics-unit 
If the scalar operation 
is fadd, 
fix, fmlow, 
fmul.ss, 
instruction 
or fst, and scalar operation 
in progress 
fmul.sd, 
ftrunc, 
or fsub, 
two minus the number of 
other than frcp 
or frsqr 
instructions 
(or dual-mode 
pairs) already executed 


- 
after the scalar operation. 
If the scalar operation 
is 
fmul.dd, 
three minus the number of instructions 
(or dual-mode 
pairs) executed 
after it. Add one if 
either or both of these two situations 
occur: 
1. There is an overlap 
between 
the result register 
of the previous scalar operation 
and the source 
of the floating-point 
operation, 
and the 
destination 
precision 
of the scalar operation 
is 
different 
than the source precision 
of the 


floating-point 
operation. 


2. The floating-point 
operation 
is pipelined 
and its 


destination 
is not fO. 


There is no delay if the result is negative. 


Multiplier 
operation 
preceded 
by a double 
One clock 
precision 
multiply 


TLB miss 
Five plus the number of clocks to finish two reads 
plus the number of clocks to set A·bits (if 
necessary) 


pfld when three pfld's 
are outstanding 
One plus the number of clocks to return data from 
first pfld 


pfld hits in.the data cache 
Two plus the number of clocks to finish all 
outstanding 
accesses 


st, pst or fst miss, Id miss, or flush 
with modified 
One plus the number of clocks until READY # 
block when store path full (two stores or one 256- 
active on next 64-bit write cycle or second 
bit write-back 
internally 
waiting for bus plus 
READY # of next 128-bit write cycle. 


external 
bus pipeline full) 


Id, fld, pfld, st, pst, or fst when address 
path full 
Number of clocks until next nonrepeated 
address 
(one address 
internally 
waiting for bus plus 
can be issued (i.e., an address that is not the 2nd- 


external 
bus pipeline full) 
4th cycle of a cache fill, the 2nd-8th 
cycle of a 
CS8 mode instruction 
fetch, nor the 2nd cycle of a 
128-bit write) 


Idlfld 
following 
st/fst 
hit 
One clock 


, 
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FretezeCondition 
Delay 


Delayed branch not taken 
One clock 


Nondelayed branch taken: 


bc,bnc 
One clock 
bte, btne 
Two clocks 


Indirect branch brl or call calli 
One clock 


st.c 
Two clocks 


Result of graphics·unit instruction (other than 
One clock 
fmov.dd) used in next instruction when the next 
instruction is an adder- or multiplier-unit instruction 


Result of graphics·unit instruction used in next 
One clock 
instruction when the next instruction is a graphics- 
unit instruction 


flush followed by flush 
Three clocks minus the number of instructions 
between the two flush instructions. There is no 
delay if the result is negative. 


fst or pst followed by pipelined floatinq-point 
One clock 
operation that overwrites the register being stored 


8.4 Instruction Characteristics 


The following table lists some of the characteristics 
of each instruction. The characteristics are: 
• What processing unit executes the instruction. 
The codes for processing units are: 
A 
Floating-point adder unit 
E 
Core execution unit 
G 
Graphics unit 
M 
Floating-point multiplier unit 
• Whether the instruction is pipelined or not. A P 
indicates that the instruction is pipelined. 


• Whether the instruction is a delayed branch in- 
struction. A D marks the delayed branches. 


• Whether the instruction changes the condition 
code CC. A CC marks those instructions that 
change CC. 


• Which faults can be caused by the instruction. 
The codes used for exceptions are: 
IT 
Instruction Fault 
SE 
Floating-Point Source Exception 
RE 
Floating-Point Result Exception, including 
overflow, underflow, inexact result 
OAT 
Data Access Fault 
Note that this is not the same as specifying at 
which instructions faults may be reported. A re- 
sult exception is reported on the subsequent 
floating-point instruction, pst, fst, or sometimes 
fld, pfld, and Ixfr. 


• 


The instruction access fault IAT and the interrupt 
trap IN are not shown in the table because they 
can occur for any instruction. 
• Performance notes. These comments regarding 
optimum 
performance 
are 
recommendations 
only. If these recommendations are not followed, 
the i860 XR microprocessor automatically waits 
the necessary number of clocks to satisfy internal 
hardware requirements. The following notes de- 
fine the numeric codes that appear in the instruc- 
tion table: 
1. The following instruction should not be a con- 
ditional branch (bc, bnc, bc.t, or bnc.t). 
2. The destination should not be a source oper- 
and of the next two instructions. 
3. A load should not directly follow a store that is 
expected to hit in the data cache. 
4. When the prior instruction is scalar, fsrc1 
should not be the same as the fdest of the 
prior operation. 
5. The fdest should not reference the destination 
of the next instruction if that instruction is a 
pipelined floating-point operation. 


6. The destination should not be a source opsr- 
and of the next instruction. (For call and calli, 
the destination is r1.) 
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7. When the prior operation is scalar and multipli- 
er opt is tsrct, Isrc2 should not be the same 
as the Idest of the prior operation. 


8. When the prior operation is scalar, tsrct and 
Isrc2 of the current operation should not be the 
same as Idest of the prior operation. 
9. A pfld should not immediately follow a pfld. 


• Programming restrictions. These indicate combi- 
nations of conditions that must be avoided by 
programmers, assemblers, and compilers. The 
following notes define the alphabetic codes that 
appear in the instruction table: 
a. The sequential instruction following a delayed 
control-transfer instruction may not be another 
control-transfer instruction (except in the case 
of external interrupts), nor a trap instruction, 
nor the target of a control-transfer instruction. 


b. When using a brl to return from a trap handler, 


programmers should take care to prevent traps 
from occurring on that or on the next sequen- 
tial instruction. IM should be zero (interrupts 
disabled) when the brl is executed. 


c. If Idest is not zero, tsrc t must not be the same 
as Idest. 


d. When tsrct goes to the multiplier opt, KR, or 
KI, tsrct must not be the same as Idest. 
e. If Ides! is not zero, tsrc t and Isrc2 must not be 
the same as tdest. 
f. isrct 
must not be the same as isrc2 for the 
autoincrementing form of this instruction. 
g. isrct must not be the same as isrc2. 
• Core and Floating-Point Instruction Interaction in 
Dual-Instruction Mode 
1. If one of the branch-on-condition instructions 
be or bne is paired with a floating-point com- 
pare, the branch tests the value of the condi- 
tion code prior to the compare. 
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2. If an Ixfr, fld, or pfld loads the same register 
as a source operand in the floating point in- 
struction, the floating-point instruction refer- 
ences the register value before the load up- 
dates it. 


3. An fst or pst that stores a register that is the 
destination register of the companion pipe- 
lined floating-point operation will store the re- 
sult of the companion operation. 


4. When the core instruction sets CC and the 
floating-point instruction is pfgt, pfle, or pfeq, 
CC is set according to the result of pfgt, pfle, 
or pfeq. 


5. When a trap instruction causes a trap in dual- 
instruction mode, the floating-point instruction 
has neither completed execution nor has up- 
dated the FT bit or any result status bits. This 
is not a problem when the trap is inserted by a 
debugger, because the trap is replaced by the 
original instruction, and the dual-mode pair is 
reexecuted. However, when the trap is pro- 
grammed, the trap handler must avoid reexe- 
cuting the trap by returning to user code at 
the address in fir + 8. In this case, the trap 
handler must emulate the floating-point in- 
struction before returning to the user code. 
Emulation of the instruction must include all 
side-effects (for example, the effect of its 
D-bit, effect on the pipelines, and effect on FT 
and result-status bits), just as if the instruction 
had been executed by the processor in the 
original context. 


6. In dual-instruction mode, when the Intovr 
in- 
struction causes a trap, the floating-point com- 
panion instruction has completely finished ex- 
ecution before the trap is taken. 


infel~ 


• Programming Restrictions 
for 
Dual-Instruction 
Mode 
1. The result of placing a core instruction in the 
low-order 32 bits or a floating-point instruction 
in the high-order 32 bits is not defined (except 
for shrd rO, rO, rO which is interpreted as 
fnop). 
- 


2. A fl,oating-point instruction that has the D-bit 
set must be aliqned on a 64-bit boundary (Le., 
the three least-significant bits of its address 
must be zero). This applies as well to the initial 
32-bit floating-point instruction that triggers 
the transition into dual-instruction mode, but 
does not apply to the following instruction. 


3. When the floating-point operation is scalar 
and the core operation is fst or pst, the store 
should not reference the result register of the 
floating-point operation. When the core opera- 
tion is pst, the floating-point instruction can- 
not be (p)fzehks or (p)fezhkl. 
4. When the core instruction of a dual-mode pair 


. is a control-transfer operation and the previ- 
ous instruction had the D-bit set, the floating- 
point instruction must also have the D-bit set. 
In other words, an exit from dual-instruction 
mode cannot be initiated (first instruction pair 
without D-bit set) when the core instruction is 
a control-transfer instruction. 


5. When the core operation is a Id.e or st.e, the 
floating-point operation must be d.fnop. 
6. When the floating-point operation is fxfr, the 
core instruction cannot be Id, Id.e, st, st.e, 
ealllxfr, or any instructiorvthat updates an in- 
teger register (including autoincrement index- 
ing). Furthermore, the core instruction cannot 
be a fld, fst, pst, or pfld that uses as isrc1 or 
isrc2 the same register as the idest 
of the 
fxfr. Additionally, in dual instruction mode, 
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fxfr may not be used in a branch delay slot if 
its destination register is referenced by the 
preceding branch instruction. 
7. A brl must not be executed in dual-instruction 
mode if any trap bits are set. 


a. When the core operation is be.t or bne.t, the 
floating point operation cannot be pfeq or 
pfgt. The floating-point operation in the se- 
quentially following instruction pair cannot be 
pfeq or pfgt, either. 


9. A transition to or from dual-instruction mode 
cannot be initiated on the instruction following 
a brl. 
10. An Ixfr, fld, or pfld cannot update the desti- 
nation of the companion floating-point in- 
struction (unless the destination is fO or f1) 2 
or.of the following pipelined floating-point in- 
struction (regardless of its destination regis- 
ter). No overlap of register destinations is 
permitted; for example, the following instruc- 
tions must not be paired: 


// Illegal 
case 
1 
d.fmul.ss 
f9, flO, 
f5 
fld.d 
address, 
f4 
; Overlaps 
f5 


// Illegal 
case 2 
d.fmul.ss 
fO, fO, f3 
fld.q 
address, 
fO 
; Overlaps 
f3 


// Illegal 
case 
3 
d.fmul.ss 
f9, flO, 
fll 
fld.l 
address, 
f5 
d.pfadd.ss 
fx, fx, f4 
Overlaps 
f5, if last 
stage 
result 
is double- 
precision 
11. During a locked sequence, a transition to or 
from dual-instruction mode is not permitted. 
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Table 8.9 Instruction Characteristics 


Instruction 
Execution 
Plpellned? 
Sets 
Faults 
Performance 
Programming 
Unit 
Delayed? 
CC? 
Notes 
Restrictions 


adds 
E 
CC 
1 
addu 
E 
CC 
1 
and 
E 
CC 
andh 
E 
CC 
andnot 
E 
CC 


andnoth 
E 
CC 
be 
E 
bc.t 
E 
0 
8 
bla 
E 
0 
8,9 
bnc 
E 


bnc.t 
E 
0 
8 
br 
E 
0 
8 
brl 
E 
0 
8,b 
bte 
E 
btne 
E 


call 
E 
0 
6 
8 
calli 
E 
0 
6 
8 
fadd.p 
A 
SE,RE 
faddp 
G 
8 
faddz 
G 
8 


famov.r 
A 
SE,RE 
fladd.z 
G 
8 
flsub.z 
G 
8 
~ 


flx.p 
A 
SE,RE 
fld.y 
E 
OAT 
2,3 
f 


flush 
E 
fmlow.p 
M 
4 
fmul.p 
M 
SE,RE 
4 
form 
G 
8 
frep.p 
M 
SE,RE 


frsqr.p 
M 
SE,RE 
fst.y 
E 
OAT 
5 
f 
fsub.p 
A 
SE, RE 
ftrunc.p 
A 
SE,RE 
fxfr 
G 
6,8 


fzchkl 
G 
8 
fzchks 
G 
8 
Intovr 
E 
IT 
Ixfr 
E 
2 
Id.c 
E 


Id.x 
E 
OAT 
6 
or 
E 
CC 
orh 
E 
CC 
pfadd.p 
A 
P 
SE,RE 
pfaddp 
G 
P 
8 
e 
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pfmam.p 
A&M 
P 
SE,RE 
7 
d 


pfmsm.p 
A&M 
P 
SE,RE 
7 
d 
pfmul.p 
M 
P 
SE,RE 
4 
c. 


pfmul3.dd 
M 
P 
SE,RE 
4 
c 
pform 
G 
P 
8 
e 
pfsm.p 
A&M 
P 
SE,RE 
7 
d 
pfsub.p 
A 
P 
SE,RE 


pHrune.p 
A 
P 
SE,RE 
pfzehkl 
G 
P 
8 
pfzehks 
G 
P 
8 
pst.d 
E 
OAT 
f 
shl 
E 


shr 
E 
shra 
E 
shrd 
E 
st.e 
E 
st.x 
E 
OAT 


subs 
E 
CC 
1 
subu 
E 
CC 
1 
trap 
E 
IT 
xor 
E 
CC 
xorh 
E 
CC 


9.0 
FUNCTIONAL 
CHARACTI:RISTICS 


The following characteristics of the 80860XR Micro· 
processor are additions to revision 2 of the i860™ 
Microprocessor 
Family 
Programmers 
Reference 
Manual, Intel order number 240875·002. This docu- 
ment will also appear in the 1993 revision of the 
Multimedia and Supercomputing Processors data 
book, Intel order number 272084-002. 


Four steppings of the 80860XR Microprocessor are 
covered in this docurnent; B2, B3, C1 and DO. Each 


• 


of the characteristics listed pertains to one or more 
of the steppings. Which steppings are affected ap- 
pears in the left hand columns before the number 
and title of each functional characteristic and are de- 
fined as follows: 
X 
indicates that the characteristic is affected the 
given stepping. 
F' indicates that the characteristic was fixed in the 
given stepping and no workaround is required. 
- 
indicates that the erratum was fixed in a previous 
stepping and no workaround is required. 
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Stepping 
Description 
B2 
B3 
C1 
DO 


X 
X 
F 
- 
1. 
Trap Handler 
User/Supervisor 
Fault 


Problem: When returning 
from the trap handler, a false data access trap may 
occur on the Id following 
the brl. This can happen in either of the two following 
situations: 


1. The target of the brl is an I-cache hit; the instruction 
after the target is an 
l-cache and TLB miss; and the page from which Id is loading is a supervisor- 
level page. 
2. HOLD is asserted 
after the instruction 
fetch for the brl target is begun and 
before the external 
bus cycle for the Id begins. 


Workaround· 
The page on which the trap handler saves the registers 
must be 
made readable 
in User mode, Le. both levels of page tables entries for that page 
must have U = 1. 


X 
X 
F 
- 
2. HOLD/HLDA 
in Multi- Transfer Stores 


Problem: 
If HOLD is asserted 
between 
the first and second transfers 
of a 128-bit 


store or between 
any transfers 
of a cache writeback 
to memory, then upon 
leaving the bus hold, the data for the next write cycle is issued on the bus one 
clock after ADS# 
is driven low for that write cycle. 


Workaround· 
Since the write data lags ADS# 
by one clock cycle, systems 
should not use HOLD with zero-wait-state 
write cycles. 


X 
X 
F - 
3. Flush 
with Paging after (f) Id 


Problem: 
In systems using paging, if an (f) Id causes a data cache writeback 
with 
a TLB miss and the next data access instruction 
is a flush 
instruction, 
then the 
data in the cache block being flushed will be corrupted 
and the processor 
may 


hang. 


NOTE: 
For more errata relating to the flush 
instruction 
see errata 
# 23 and # 43. 


Workaround· 
If the workaround 
for Erratum 
# 43 (flush 
with HOLD) is used, no 
workaround 
for this erratum is necessary, 
even if flush 
with paging is used 


If the workaround 
for Erratum 
#43 is not used, both of the two following 
workarounds 
are required: 


1. In order to prevent an (f) Id-induced 
cache write back from immediately 
preceding 
the flush 
instruction, 
systems using paging must execute 
the last 
(f) Id before the flush 
instruction 
twice. Thus, if that (f) Id were to cause a 
writeback, 
the writeback 
will be executed 
at the first of the two (f) Ids and not 


at the (f) Id immediately 
preceding 
the flush. 
2. Also, in order to prevent this problem 
from occurring 
when returning 
from the 
trap handler to the flush routine, the (f) Id in the delay slot of the brl at the 
end of the trap handler must be executed 
twice. However, 
since the brl only 
has one delay slot, the first of the two (f) Ids from that address must precede 
the brl and must be discarded 
to rO so as not to prematurely 
restore rt, which 
would corrupt the destination 
of the brl. 


The following 
is an example 
of the last few instructions 
of an appropriately 
modified trap handler: 


••• 
ld.l 
stack_area, 
rO 
bri 
rl 
ld:l 
stack_area, 
rl 
••• 
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Stepping 
Description 
B2 
B3 
C1 
DO 


X 
X 
F 
- 
4. NE NE 11 Incorrectly 
Asserted 
After Incomplete 
HOLD 


Problem: 
If HOLD is asserted 
and then deasserted 
before HLDA is asserted, 


then HLDA can be asserted 
for one clock cycle; on the following 
bus cycle, 
NENE 11 will be incorrectly 
asserted 
if that cycle would have been next-near 
given no hold state interruption. 
Although 
NENE 11 will usually still be valid at this 
point (depending 
on memory system design), its assertion 
does not follow the 
correct 
protocol, 
which specifies 
that NENE 11 should not be asserted 
on the 
next bus cycle after HLDA is deasserted. 


Workaround: 
Do not assert and then deassert 
HOLD before HLDA is asserted, 


and do not reassert 
HOLD until HLDA is deasserted 
for the previous 
HOLD. 


X 
X 
F 
- 
5. Multiprocessor 
A-bit Settings 
Cause Address 
Corruption 


Problem: 
In systems 
using paging, the correct 
protocol 
for setting the A-bit 
(Accessed 
bit) during TLB miss processing 
is as follows: 
1. Fetch PTE from memory with LOCK 11 deasserted 
and check the A-bit status. 
2. If the A-bit is clear, then refetch 
PTE with LOCKII 
asserted. 
3. Write PTE back to memory with the A-bit set and LOCK 11 deasserted. 


However, 
if between 
steps 1 and 2 the A-bit is set by another 
processor, 
then the 
TLB address transformation 
will be corrupted 
and the processor 
may hang. 


Workaround: 
In multiprocessor 
systems, the A-bits in shared page table entries 
must be set to 1 when the pages are allocated 
in order to avoid locked read! 
write A-bit set cycles for these pages. 


X 
X 
F 
- 
6. Incorrect 
Floating 
Point Result 
Trap in Multiplier 
Unit 


Problem: When a sequence 
of pipelined 
single-pre'cision 
multiplier 
operations 
is 
followed 
by a pipelined 
double-precision 
multiplier 
operation, 
the next-to-Iast 
single-precision 
operation 
may cause a result exception 
trap even though the 
result will be correctly 
discarded. 
This erroneous 
trap will only occur if the 
instruction 
executed 
immediately 
after the first double-precision 
multiply 
operation 
is 1) fst; 2) pst; 3) fld, pfld, or Ixfr into a register or register set 
overlapping 
the rdest 
of the first double-precision 
multiplier 
instruction; 
or 4) any 
floating 
point instruction 
other than a multiplier 
operation. 


Workaround' 
In the situation 
described 
above, the instruction 
immediately 
following 
the first double-precision 
multiply operation 
must not be 1) fst; 2) pat; 
3) fld, pfld, or Ixfr into a register or register set overlapping 
the rdeat 
of 
the first double-precision 
multiplier 
instruction; 
or 4) any floating 
point instruction 
other than a multiplier 
operation. 
In addition, 
if the instruction 
cache is enabled, 


then programs 
must not use a delayed branch whose delay slot instruction 
is the 
first pipelined 
double-precision 
multiplier 
operation 
when the first instruction 
at 
the target address 
is an fat, pat, f1d, pfld, 
Ixfr, or floating 
point instruction 
which 
could cause the spurious floating 
point trap. 
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Stepping 
Description 
B2 
B3 
C1 
DO 


X 
X 
F 
- 
7. fxfr 
Result Exception 
Trap Corrupts Destination 
Register 


Problem: An fxfr 
that reports a result exception 
may corrupt its destination 
register before control is transferred 
to the trap handler. 
In DIM, if either source 
of the core operation 
paired with the fxfr 
is the same as the fxfr's 
destination 
register, then when the program 
returns from the trap handler, the core operation 
paired with the fxfr 
will use the incorrect 
value in that register. Also, in both SIM 


and DIM, if the fxfr 
is used in the delay slot of a brl, calli, or bla which 
references 
the same register as the fxfr's 
destination, 
then upon returning 
from 
the trap handler, the branch will use the corrupted 
value in that register. 


Note that when this erratum is fixed, it will be permissible 
to use fxfr 
in the delay 
slot of a branch which references 
the fxfr's 
destination 
in SIM, but it will still be 
illegal to use it in such a delay slot in DIM. 


Workaround: 
In DIM neither of the sources of a core operation 
paired with an 
fxfr 
may be the same as the fxfr's 
destination 
register. 
In both SIM and DIM the 


fxfr 
instruction 
must not be used in the delay slot of a brl, calli, or bla which 
references 
the fxfr's 
destination 
register. For example, 


brl 
r5 
fxfr 
f2,r5 
must not be used. 


X 
X 
F 
- 
8. pf/add.ss/pf/sub.ss/pfmov.ss 
Results 
Corrupted 
by other Pipelined 
Graphics Instructions 


Problem: When the next graphics 
unit instruction 
after a pfladd.ss, 
pflsub.ss, 
or 
pfmov.ss 
is a pfzchkl, 
pfzchks, 
pfaddp, 
pfaddz, 
or pform, 
the result of 
the pfladd.ss, 
pflsub.ss, 
or pfmov.ss 
may be incorrect. 
Note that this problem 
only occurs with single precision 
pfladd, 
pflsub, 
and pfmov 
instructions. 


Workaround: 
Flush the graphics 
unit pipeline between 
a pfladd.ss, 
pflsub.ss, 


or pfmov.ss 
and a following 
pfzchkl, 
pfzchks, 
pfaddp, 
pfaddz, 
or pform 
with a 
pfladd.ss 
fO, fO, fn. 


X 
X 
F - 
9. Multiplier 
Pipeline Result Not Discarded 
at Precision 
Transition 


Problem: 
In correct execution, 
when a sequence 
of pipelined 
single-precision 
multiplier 
operations 
is followed 
by a pipelined 
double-precision 
multiplier 
operation, 
the result of the next-to-last 
single-precision 
multiply is discarded. 


However, 
if a floating 
point trap is reported 
during the two clocks after the first 
pipelined 
double-precision 
multiply, then the result of the next-to-last 
pipelined 
single-precision 
multiply may not be discarded, 
and the multiplier 
pipeline 
may 
not advance 
correctly. 
As a result, the next two double-precision 
pipelined 
multiplies 
or three single-precision 
pipelined 
multiplies 
after the first pipelined 
double-precision 
multiply may receive incorrect 
data. 


Workaround: 
Anyone 
of the following 
workarounds 
is sufficient: 


1. Flush the pipeline with two pfmul.ss 
fO,fO,fn instructions 
between 
the single- 
and double-precision 
multiplier 
operations. 
2. Disable floating 
point traps after the last pipe lined single-precision 
multiply; 


enable them again after the first pipelined 
double-precision 
multiply. 


3. Ensure that the two instructions 
(or two pairs of instructions, 
in DIM) following 
the first pipelined 
double-precision 
multiply are instructions 
which cannot 
report a floating 
point trap. Those which cannot report floating 
point traps are 


all core instructions 
except for fst, pst, and an fld, pfld, or Ixfr whose 
destination 
register (or register set) overlaps 
the destination 
of the first 
pipelined 
double-precision 
multiply. 
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Stepping 
Description 
82 
83 
C1 
DO 


X 
X 
F 
- 
10. flush 
with Paging May Corrupt Memory Data 


Problem: 
In systems using paging, if a flush routine writeback 
or instruction 
fetch 
causes a TLB miss, then modified 
data at the target address of one flush 
instruction 
may also be written to the target address of the next flush 
instruction, 


and the chip may hang. 


Workaround: 
If the workaround 
for Erratum 
# 43 (flush 
with HOLD) is used, no 
workaround 
for this erratum is necessary, 
even if flush 
with paging is used. If the 
workaround 
for Erratum 
#43 
is not used, all three of the following 
workarounds 
must be implemented: 


1. The old flush 
inner loop: 
D_FLUSH_LOOP: 
bla 
Rx,Ry,D_FLUSH_LOOP 
flush 
32 
(Rw) 
++ 
must be replaced 
with a new flush 
inner loop: 
D_FLUSH_LOOP: 
ixfr 
rO,fO 
bla 
Rx,Ry,D_FLUSH_LOOP 
flush 
32(Rw)++ 
ixfr 
rO,fO 
In addition, 
floating point traps must be disabled 
during execution 
of the Ixfrs. 


2. The D_FLUSH_LOOP 
label must be aligned on a 32-byte boundary 
by 
preceding 
the label with the .allgn 32 assembler 
directive. 
3. External interrupts 
must be disabled 
before entering the flush routine and 
reenabled 
after exiting the routine by clearing and setting the IM bit of the 
PSA. 


X 
X 
F - 
11. KNF, DIM, OS, BW, and BR Bits Not Write Protected 


Problem: The KNF, DIM, OS, BW, and BA bits of the PSA are not write protected 
in user mode. 


Workaround· 
Software 
should not assume write protection 
of these bits in user 
mode. 


X 
X 
F 
- 
12. 
TLB Miss Processing 
Address 
Corrupted After Id.c from IIpsr 


Problem: 
If a TLB miss resulting from an instruction 
fetch or cache writeback 
cycle follows 
a Id.c from the epsr, the address for the TLB miss processing 
may 
be corrupted. 


Workaround· 
Every Id.c epsr, 
rn should be immediately 
preceded 
by an Ixfr rO, 


10, and floating point traps must be disabled 
during execution 
of the Ixfr. 
In 
addition, the Ixfr must be aligned on a 32 byte boundary 
by preceding 
it with the 
assembler 
directive 
.allgn 32. 
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Stepping 
Description 
82 
83 
C1 
DO 


X 
X 
F - 
13. fmlow.dd 
Incorrectly 
Causes Floating 
Point Traps 


Problem: 
Although 
fmlow.dd 
is not supposed 
to cause floating 
point traps or update 


the result-status 
bits of the FSR, it can trigger a floating 
point trap in the following 
case: 
fmlow.dd 
rx,tv ,fz 
IICauses 
result underflow 
or overflow 
•• 
IIAny 
sequence of non-pfmul 
instructions 
• 
pfmul.xx 
fa,fb,fc 
IIAny precision pfmul 
•• 
IIAny 
sequence of non-pfmul instructions 
• 
pfmul.dd 
fl,fm,fn 
IIDouble 
precision pfmul only 
pfadd.ss 
fd,fe,ff 
IIReports erroneous floating point trap 
II(Any 
FP trap-reporting 
instruction) 


The instruction 
immediately 
following 
the pfmul.dd 
f1, fm, fn which reports the trap' 


can be any floating 
point trap-reporting 
instruction 
other than another 
multiplier 
unit 


instruction. 


Although 
a trap will be reported 
and the FT bit will be set (FT = 1) in the PSR, no 
floating 
point result status bits will be set in the FSR. 


Workaround'lf 
the trap handler finds FT set in the PSR but no result-status 
bits set in 
the FSR, then it should just return back to user code. 


X 
X 
F - 
14. Byte Enable Code Wrong on First Fetch in CS8 Mode 


Problem: 
During the first CS8 mode instruction 
byte fetch of a sequence 
of fetches 
within a 32 byte block boundary, 
the byte enable pattern BE7#:BEO# 
is hex FF rather 
than hex AF as it should be. 


Workaround'lnterpret 
the byte enable code hex FF to indicate the first instruction 
byte 


fetch in a CS8 mode instruction 
fetch sequence. 


X 
X 
F - 
15. frcp, 
frsqr 
After pfmul 
May Corrupt Data 


Problem: 
If a pfmul.ss 
or pfmul.sd 
immediately 
precedes 
an frcp 
or frsqr 
which in 
turn is immediately 
followed 
by a scalar floating 
point instruction 
other than a graphics 
unit instruction, 
then the result of the scalar floating 
point instruction 
may be corrupted. 


Note that the pfmul.ss/pfmul.sd 
preceding 
the frcp/frsqr 
could reside in the delay 


slot of a branch. 


Workaround' 
Either one of the following 
two workarounds 
will suffice: 


1. Separate 
the frcp/frsqr 
from the preceding 
pfmul.ss/pfmul.sd 
with a nop or 
other instruction. 


2. Replace the pfmul.ss/pfmul.sd 
immediately 
preceding 
the frcp/frsqr 
with a 
pfmul.dd. 
The replacement 
will not affect any result, since the pfmul 
data entering 
the multiplier 
pipeline should be discarded 
because the frcp/frsqr 
is scalar. 
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Stepping 


B2 
B3 
C1 
DO 
Description 


x 
X 
F 


X 
X 
F 


16. KNF (Kill Next Floating-point) 
Ineffective 
With brl/cs/ll 
in DIM 


Problem: When an instruction 
pair containing 
a brl or calli is executed 
in dual 
mode with the KNF bit set in the PSR, the floating 
point instruction 
paired with 
the brl or calli is executed, 
although 
it should not be. 


Workaround· 
Either one of the following 
two workarounds 
will suffice. The first is 
a trap-handler 
workaround, 
whereas 
the second is implemented 
in the compiler. 


1. The trap handler should not return to a dual pair containing 
a brl or calli with 


KNF set in the PSA. Instead, if the floating 
point operation 
paired with the brll 
calli should not be reexecuted, 
then the trap handler should execute 
the brll 
calli in single instruction 
mode before transitioning 
to dual mode on the next 
pair if appropriate. 
The following 
two rules implement 
this workaround: 


a. If the trap handler finds the DIM bit set and the DS bit clear, then it should 
clear DIM, set DS, and return to the instruction 
at the address 
in fir +4 
• 
with KNF clear. 


b. If the trap handler finds both the DIM and DS bits set, then it should clear 


both bits and return to the instruction 
at the address 
in fir + 4 with KNF 
clear. 


2. In dual instruction 
mode, pair every brl or calli with an fnop 
or a graphics 
unit 
instruction. 
Since the brl or calli cannot cause a data-access 
fault and the 
fnop 
or graphics 
unit instructions 
cannot cause a source exception, 
these 
dual pairs will never require the use of KNF. 


- 
17. f/d/pf/d//xfr 
May Not Report Floating Point Result Exception 


Problem: 
In correct operation 
an fld, pfld, 
or Ixfr whose destination 
register (or 
register set) overlaps 
the destination 
of a preceding 
scalar instruction 
which 
caused a floating 
point result exception 
(RE) should always report that RE, 


unless a floating-point 
operation, 
pst, or fst has already done so. However, 
if a 
trap other than a floating-point 
trap occurs between 
the RE-causing 
instruction 
and the fld/pfld/lxfr, 
the fld/pfldllxfr 
may not report 
the RE. Instead, the trap will be reported 
by the next floating 
point operation, 
fst, 
or pst after the fld/pfld/lxfr. 
The problem 
only occurs if the trap handler uses 
any floating 
point or graphics 
operations 
to handle the trap that occurred 
between 
the RE-causing 
instruction 
and the fld/pfld/lxfr. 
If the trap handler 
does not use floating 
point or graphics 
operations 
to handle the non-floating 
point trap, then execution 
will be correct. 
For example: 


fmul.dd 
fX,fy,fs 
II 
Causes 
an 
FP 
result 
exception 
••• 
<any 
non-floating 
point 
trap 
to 
trap 
handler 
requiring 
floating 
point 
pipeline 
manipulations> 


•• 
fld.l 
• 
° (rO),fz 
II 
Should 
report 
RE 
but 
may 
not 


• 
fmul.ss 
fO,fO,fO 
II Will 
report 
the 
result 
exception 
if 
II fld.l 
does 
not, 
even 
though 
fz has 
II been 
overwritten 
by 
the 
fld.l 
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Stepping 
Description 
B2 
B3 
C1 
DO 


X 
X 
F 
- 
17. 
(Continued) 


Workaround: 
If floating 
point traps are enabled 
(FTE = 1) and the trap handler 


uses any floating 
point or graphics 
operations 
to handle traps, then when 
handling any trap, the trap handler should check the result status bits of the FSR 
register. 
(The Id.c fsr which saves the result status bits of the FSR register must 


occur on the third or later instruction 
of the trap handler.) 
If any of the FSR result 
status bits is set, indicating 
a floating 
point RE, then the trap handler should 
handle the floating 
point RE as well as the other trap that caused the branch to 
the trap handler routine. 


X 
X 
F 
- 
18. pfld With HOLD May Corrupt D(1ta 


Problem: 
In systems 
using HOLD, if a pfld miss is followed 
by a pfld hit and the 


bus cycle for the pfld miss is delayed due to a HOLD request and acknowledge, 
then the pfld hit data fetch is completed 
before the pfld miss data fetch. As a 
result the data FIFa receives 
data out of order, and the data is corrupted. 
Note 
that this problem can only occur if pfld data has already been loaded into the 
data cache with fld. 


Workaround: 
Either one of the following 
two workarounds 
will suffice: 


1. Do not use pfld with systems which use HOLD. 
2. Only use pfld on data which cannot reside in the data cache. This 
workaround 
may be implemented 
by making pages containing 
pfld data 
noncacheable. 
- 
X 
X 
F - 
19. Core Operation 
May Overwrite 
CC Bit Set by pfgt/pfle/pfeq in DIM 


Problem: 
In correct operation, 
if a pfgt, pfle, orpfeq is paired with an ALU or 
logical core instruction 
in dual instruction 
mode, then the CC bit should be set 
according 
to the result of the pfgtlpfle/pfeq, 
and not according 
to the result of 
the ALU or logical core instruction. 
However, 
if a floating 
point source exception 


is reported 
on the pfgt/pfle/pfeq, 
then the trap handler will update the CC bit 


for the pfgt/pfle/pfeq, 
return to the user code, and reexecute 
the dual pair with 


KNF set. The ALU/logical 
core instruction 
may then modify the CC bit from its 
correct value. 


Workaround: 
Either one of the following 
two workarounds 
will suffice. The first is 
a trap-handler 
workaround, 
whereas 
the second is implemented 
in the compiler. 


1. After handling a pfgtlpfle/pfeq 
floating 
point source exception, 
the trap 
handler should not return with KNF set to a dual pair containing 
an ALU or 
logical core operation. 
Instead, it should emulate the core operation, 
except 
for the update of the CC bit, and then return to the next instruction 
pair with 
KNF clear and the DIM and OS bits modified according 
to the rules below. If 


the dual pair containing 
the pfgtlpfle/pfeq 
is in the delay slot of a delayed 
branch, then the trap handler must resume at the branch target. 


Rules for modifying 
DIM and OS upon returning 
to the subsequent 
instruction 


pair: 
a. If the trap handler finds the DIM bit set, the OS bit clear, and the D-bit of 


the floating 
point instruction 
set, then it should leave both the DIM and OS 


bits as they are and return to the next instruction 
pair. 


b. If the trap handler finds the DIM bit set, the OS bit clear, and the D-bit of 


the floating 
point instruction 
clear, then it should leave the DIM bit set, set 


the OS bit, and return to the next instruction 
pair. 


c. If the trap handler finds both the DIM and OS bits set, then it should clear 


both bits and return to the next instruction 
pair. 


2. In dual instruction 
mode do not pair a pfgt/pfle/pfeq 
with a core instruction 
that can modify the CC bit, Le. an ALU or logical operation. 
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Stepping 


B2 
B3 
C1 
DO 


Example 2 
pfmul.ss 
fa,fb,fx 
pfmul.ss 
fe,ff,fx 
pfmul.ss 
fe,ff,fx 
pfmul.ss 
fe,fl,fl 
II op_2 will 
be fa*fb, 
rather 
than 
0 
pfmul.ss 
fe,ff,fx 
pfmul.ss 
fe,ff,fx 
pfmul.ss 
fe,ff,fk 
II fk = fa*fb*fe, 
rather 
than 
0 


Workaround: 
Do not use f1 as the destination 
register when discarding 
a result. 


Discard the result to register fO instead. 


x 
X 
F 


Description 


20. 
Register 
Bypass of f1 Does Not Work 


Problem: 
If a single precision 
floating 
point instruction 
discards 
data into destination 
register f1 and the same instruction 
(in pipelined 
mode) or next instruction 
(in scalar 
mode) references 
f1 as a source, then the data being discarded 
into f1 will be 
mistakenly 
bypassed 
to the instruction 
using f1 as a source. Since f1 is supposed 
to 
be a read-only 
register whose value is zero, incorrect 
program 
execution 
may result. 


Example 
1 
fmul.ss 
fadd.ss 
fa,fb,fl 
fl,fe,fd 
II op_l will 
be fa"fb, 
rather 
than 
0 
II fd = 
(fa*fb)+fe, 
rather 
than 
fe 


X 
X 
X 
F 
21. Multiplier 
Pipeline Not Cleared of Result Exception 
by frcp/frsqr 


Problem: 
If a result exception 
is in the first stage of the multiplier 
pipeline and a frsqrl 
frcp is executed, 
the result exception 
should be cleared from the pipe but is not. As a 
consequence, 
a result exception 
with no result-status 
bits set will occur. 


Example: 
pfmul.dd 
f2,f2,f8 
II Causes 
result 
exception 
frsqr.dd 
f4,f6 
pfmul.dd 
fX,fx,fy 
pfmul.dd 
fa,fb,fe 
pfiadd.dd 
fq,fr,fs 
II Triggers 
RE reporting 


Workaround: 
If the trap handler finds FT set in the PSR but no result-status 
bits set in 
the FSR, then it should just return back to user code. 


X 
X 
F 
- 
22. 
Short HOLDIHLDA 
Sequences 
With Paging Can Cause Data Corruption 


Problem: 
In systems 
using HOLD and paging, internal data corruption 
may occur if 
HLDA is asserted 
for 3 clocks or fewer. This problem can only occur if all of the 
following 
conditions 
are met: 
1. HOLD is asserted 
during the 1st, 2nd, or 3rd fetch of an instruction 
fetch sequence. 
2. The last bus cycle before HLDA is asserted 
is an instruction 
fetch of a store 
instruction 
which causes a TLB hit and a data-access 
fault. 


3. HOLD is deasserted 
on the 1st, 2nd, or 3rd clock during which HLDA is asserted. 


Workaround: 
Either HOLD must never be asserted 
during an instruction 
fetch 
sequence, 
or HOLD must remain asserted 
for at least four clocks after HLDA is 
asserted. 
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B2 
B3 
C1 
DO 


X 
X 
F 
- 
23. Sticky Inexact Bit of FSR May Be Incorrectly 
Set 


Problem: 
Scalar multiply operations 
may incorrectly 
set the SI bit of the FSR 
when the multiply unit has a data-dependent 
rounding 
freeze. If the SI bit is set, 


then the, processor 
mayor 
may not have encountered 
an inexact result. 


Workaround: 
Do not use the SI bit. 


X 
X 
F 
- 
24. Floating 
Point Underflow 
Trap May Occur When FZ Set 


Problem: 
In correct 
operation, 
when the FZ bit in the FSR is set, no floating 
point 


traps are reported 
on underflow 
results, although 
the MU (or AU) bit is set. 


However, 
if during floating 
point pipeline resumption 
MU (or AU) is restored 
to 
the multiplier 
(or adder) pipeline by setting the U (Update) 
bit of the FSR, then a 


false underflow 
trap may occur even though 
FZ is set. This false underflow 
trap 


may occur upon execution 
of the first floating 
point trap-reporting 
instruction 
after the MU (or AU) bit has propagated 
to the third stage of the pipeline. 


Workaround: 
When restoring 
a floating 
point pipeline 
by setting the U bit in the 
FSR, the trap handler must clear the MU (or AU) bit whenever 
the FZ bit is set. 


X 
X 
X 
F 
25. AA Bit Not Set Correctly 


Problem: The (p) fix and (p) ftrune 
instructions 
may not set the AA bit of the 
FSR correctly. 
Specifically, 
the AA bit may report that the conversion 
of an exact 
non-positive 
number to an integer value involved 
an upward rounding, 
when in 


fact it did not. Note that the AA bit is not IEEE defined. 


Workaround: 
Disregard 
the AA bit when using (p) fix or (p) ftrune. 
The 
conversion 
result will still be correct. 


X 
X 
F 
- 
26. famov 
and pfamov 
Erroneously 
Normalize 
Negative 
Denormals 


Problem: famov 
and pfamov 
(which is used in the trap handler to restore the 
adder pipeline) 
erroneously 
normalize 
negative 
denormals. 


woosround: 
If fsrc1 of a (p) famov 
is a negative denormal 
as defined 
in PRM 
Table 2-2, then the (p) famov 
must be replaced 
by a (p) fadd whose fsrc2 is 
negative zero or by a (p) fsub whose fsrc2 is positive zero. 


x 
X 
X 
F 
27. Pipeline Precision 
Transition May Cause False Result Exception 


Problem: 
If both of the following 
conditions 
are met, a false result exception 
can 


be reported. 
However, 
the result exception 
bits in the FSR due to this result 
exception 
will be cleared by the time the processor 
branches 
to the trap handler. 


1. When the multiplier 
pipeline transitions 
from single- to double-precision 
pipelined 
mode, the second-to-last 
single precision 
pipelined 
instruction 
causes a result exception 
trap and a multiplier 
rounding freeze. 


2. A floating 
point trap-reporting 
instruction 
is decoded 
during the multiplier 
freeze. 


Workaround: 
Either one of the following 
two workarounds 
will suffice: 


, 
, 
1. Flush the multiplier 
pipeline when switching 
source precision. 
(This 
workaround 
is preferred.) 
2. If the trap handler finds FT set in the PSR but no result-status 
bits set in the 
FSR, then it should just return back to user code. 
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B2 
B3 
C1 
DO 


X 
X 
X 
F 
28. 
Flush Instruction 
on 16-Byte Boundary 
Corrupts Data 


Problem: 
if the address referenced 
by a flush instruction 
is aligned' on a 16-byte 
boundary 
rather than a 32-byte boundary, 
data corruption 
may result. The data in 
the upper half of the cache line will be written back both to its corresponding 
address in memory and also to the memory address corresponding 
to the lower 
16 bytes of that cache line. 


Workaround· 
Either one of the following 
two workarounds 
will suffice: 


1. Align all addresses 
referenced 
by the flush instruction 
on 32-byte boundaries. 
Flushing on 32-byte boundaries 
causes both cache line halves to be written 
back to the correct 
memory location 
as needed, and clears the modified 
bit of 
the lower half of the cache line. 


2. If flushing 
on a 16-byte boundary 
is necessary 
(for example, 
to clear the 
modified 
bit of the upper half of the cache line), a flush to the 32-byte 
boundary 
immediately 
below that 16-byte boundary 
must immediately 
precede 
the flush to the upper half of the cache line. This first flush changes 
the tag address for that line to indicate the reserved 
flush area; the 
subsequent 
flush on the 16-byte boundary 
will then cause an erroneous 
writeback 
to the flush area. However, 
the erroneous 
writeback 
will not corrupt 
system data because 
it does not affect the valid copy of the cache line which 
has already been written back to the correct 
memory address for that data. 


X 
X 
F 
- 
29. 
External Interrupts 
Ignored 
While Data = Lock Opcode During Read 


Problem: 
During a read cycle, external 
interrupts 
are ignored while the opcode 
for the lock instruction 
appears on the data bus before READY # is asserted. 
External interrupt 
acknowledgment 
resumes normally when this data pattern is 
changed. 
This situation 
is only a problem for sites using multiple processors 
in 
lock-step; 
even then, there is no functional 
difference 
between 
the two 
processors 
as far as code execution 
is concerned. 


Workaround· 
When running multiple processors 
in lock-step, 
ensure that both 
processors 
always see the same data patterns. 


X 
X 
F 
- 
30. 
flush with HOLD May Corrupt Data or Hang Processor 


Problem: 
In systems 
using HOLD/HLDA 
with flush, the bus cycle for a writeback 
caused by a flush instruction 
can be "lost", 
causing the writeback 
data to be 
stuck in the internal write buffers. This condition 
will cause data corruption 
and 
may cause the processor 
to hang. The problem 
occurs only when both of the 
following 
conditions 
are met: 


1. The data cache has at least two cache line entries which have been half 
modified 
(EITHER the upper or lower half is modified, 
but not both). 
2. HOLD is asserted 
during the flush routine. 


Workaround: 
For systems using HOLD/HLDA 
with flush, the following 
procedures 
are recommended 
for flushing the data cache and should be used in 
place of the one specified 
in the PRM. Upon RESET the data cache must be 
initialized 
using the flush instruction. 
To flush the data cache subsequently, 
the 
fld instruction 
must be used in the flush loop. The data cache initialization 
and 
flush routines require an 8 KB reserved 
cacheable 
flush memory area. The 
reserved 
area must be hardware 
(KEN #) and page table (CD/WT) 
cacheable. 


NOTE: 
If this workaround 
is used, no workarounds 
for Errata # 10 and # 23 are 
necessary, 
even if flush with paging is used. 
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Stepping 


82 
83 
C1 
DO 


init2: 
bla 
flush 
bri 
st.c 


Description 


x 
X 
F 
30. (Continued) 
Data cache Initialization at RESET 
The following 
procedure 
should be used to initialize the data cache at RESET: 


.data 
flush_area: : 
.byte [8192] 0 


•text 


// Using method of your choice. reserve 
// 8 KB of writable. cacheable memory • 


reset initialization:: 
Id.c 
dirbase. Rv 
adds 
-1. rO. Rx 
or 
127. rO. Ry 
or 
1%flush_area-32. rOt Rt 
orh 
h%flush_area-32. Rt. Rt 


// Save dirbase 
// Loop decrement 
// Loop counter 
// Beginning virtual addr 
// minus 32B (to allow for 
// autoincrement) 


11 Initialize the first half of the cache 
andnot OxFOO. 
Rv , Rw 
11 Clear RC. RB; put result in R" 
or 
x0800. 
Rw. R" 
11 Set MSB of RC 
bla 
Rx. Ry. 
initl 
11 One time to initialize LCC 
st.c 
R". dirbase 
11 Store dirbase; RC = 2. RB = 0 


initl: 
bla 
Rx. Ry. 
initl 
flush 
32(Rt)++ 


// Now 
or 
or 
bla 
st.c 


initialize the 
Ox900. Rw. Rw 
127. rO. Ry 
Rx, Ry. init2 
Rw. dirbase 


Rx , Ry. init2 
32(Rt)++ 
rl 
Rv. dirbase 


// Loop for 128 iterations to 
// initialize 1st 4 KB block 


other half by changing RB 
// Set RC = 2 and RB = 1 
// Reset loop counter 
// One time to initialize 
// RC = 2. RB = 1 
LCC 
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// Loop for 128 iterations to 
// initialize 2nd 4 KB block 
// Return to calling procedure 
// Restore original dirbase 


orh 
h%flush_area-32, Rt, Rt II minus 32B (to allow for 
II autoincrement) 


II Flush 1st half of 
andnot OxFOO, Rv, Rw 
or 
Ox800, Rw, Rw 
bla 
Rx, Ry, flushl 
st.c 
Rw, dirbase 


flushl: 
bla 
Rx, Ry, flushl 
fld.d 
32(Rt)++, fO 


cache (with writeback if modified) 
II Clear RC, RB; put result in Rw 
II Set MSB of RC 
II One time to initialize LCC 
II Store dirbase; RC = 2, RB = 0 


II Loop for 128 iterations to 
II load from each addr, causing 
II writebacks from modified lines 
II Now flush second half of the cache 
or 
Ox900, Rw, Rw 
II Set RC = 2 and RB = 1 
or 
127, rO, Ry 
II Reset loop counter 
bla 
Rx, Ry, flush2 
II One time to initialize LCC 
st.c 
Rw, dirbase 
II RC = 2, RB = 1 


flush2: 
bla 
Rx, Ry, flush2 
fld.d 
32(Rt)++, fO 


bri 
st.c 
rl 
Rv, dirbase 


II Loop for 128 iterations to 
II load from each addr, causing 
II writebacks from modified lines 


II Return to calling procedure 
II Restore original dirbase 
x 
X 
X 
F 
31. pfmu/3.ddCanModifyCCBit 
Problem: The pfmul3.dd 
instruction, 
which is intended 
primarily for use in the trap 
handler, can erroneously 
modify the CC bit in the PSR. This modification 
may result in 
code execution 
errors if an instruction 
which follows the pfmul3.dd 
tests the CC bit. 


Workaround· If an instruction 
following 
a pfmul3.dd 
tests the CC bit, save the value of 
the CC bit before the pfmul3.dd 
and restore it after that instruction. 
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Stepping 
Description 
B2 
B3 
C1 
DO 


X 
X 
X 
F 
32. 
IL Bit Is Not Set on DA T after UNLOCK 


Problem: 
When a data access trap occurs on the first load or store after an unlock 
instruction, 
the IL bit of the EPSR is not set. Thus, the trap handler returns to that load 
or store instruction 
instead of returning to the lock instruction 
as it should. This 
problem 
also occurs if the address of the first load or store after the unlock instruction 
matches 
the data breakpoint 
(DB) register. 


Workaround: 
Both of the following 
workarounds 
must be implemented: 


1. Move the load or store that follows the unlock instruction 
prior to the unlock 
instruction, 
and replicate 
that load or store as a dummy load or store to the same 
address after the unlock instruction. 
This guarantees 
that the unlocking 
dummy 
load or store will not trap. 
2. Do not permit a data breakpoint 
register trap to occur on the unlocking 
load or 
store. 


X 
X 
X 
F 
33. pfld after Multicycle 
Write Cycle with HOLD May Corrupt Data 


Problem: 
If all the following 
conditions 
are met, pfld data may be corrupted. 


1. A Id- or fld-induced 
cache line writeback 
or a fst.q generates 
data in the write-back 
buffers to be written out to memory in a multi cycle write cycle. A multicycle 
write 
cycle is any write cycle which requires more than one address issued to complete 
the bus cycle. (The flush 
instruction 
may also cause the problem, 
but only if it is 
used as a general purpose instruction, 
rather than in the flush loop. As stated in the 
Programmer's 
Reference 
Manual, such a usage has undefined 
results.) 
2. The multicycle 
write cycle is interrupted 
by HOLD and HLOA, allowing another 
bus 
master on the bus between 
the writes of the multicycle 
write cycle. 
3. When HOLD is asserted, 
there are no internal requests 
for bus cycles pending 
(other than for the cycles of the multicycle 
write described 
in section 
1 above). 
4. A pfld hits the data cache before all cycles of the multicycle 
write cycle have 
completed. 
The data at the address of this pfld may be corrupted. 
This data will be 
written to the fdestof 
the third pfld later. 


Workaround' 
Anyone 
of the following 
three workarounds 
will suffice: 
1. Only use pfld on data which cannot 
reside in the data cache. This workaround 
may 
be implemented 
by making pages containing 
pfld data noncacheable. 
2. Do not use pfld with systems that use HOLD. 
3. For systems that use HOLD and allow pfld to access cacheable 
data, in'sert a 
dummy store that is guaranteed 
to be a data cache miss between 
each Id or fld 
and a following 
pfld. 


X 
X 
X 
F 
34. 
IL Bit Mistakenly 
Set on Trap Before Lock 
Problem: 
If the instruction 
executed 
immediately 
prior to a lock instruction 
traps on a 
OAT, IT, or FT, the trap handler will find the IL (interlock) 
bit of the EPSR set, 
mistakenly 
indicating 
that a locked sequence 
is in progress. 
Note that this problem 
can 
occur either in sequential 
execution 
or when the trapping 
instruction 
is in the delay slot 
of a branch or call whose target is a lock instruction. 


Workaround' 
Either one of the following 
two workarounds 
will suffice: 


1. Put a nop before every lock instruction. 
In addition, 
if the lock is a branch target, 


1- 
change the branch target address to the nop before the lock. 
2. Ensure that the instruction 
preceding 
a lock instruction 
can never trap. 
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Stepping 
Description 
82 
83 
C1 
DO 


X 
X 
X 
F 
35. 
Incorrect 
Address 
Translation 
on st.c Setting ITI after D-cache 
Miss/TLB 
Hit 


Problem: 
If a data access instruction 
Uf) Id, (f) st, pst, or pfld]which 
is a data cache 
miss/TLB 
hit is followed 
by a st.c 
which sets the ITI bit of dlrbase, 
then an incorrect 
or extra address translation 
may occur. The st.e does not need to immediately 
follow 
the data access instruction 
in order to cause the problem; 
any number of instructions 
may intervene. 


Workaround' 
Execute an Ixfr rO, fO immediately 
before a st.c which sets the ITI bit, 


and follow the st.c immediately 
with six nops, as specified 
in the Programmer's 
Reference 
Manual. Floating point traps must be disabled 
during execution 
of the Ixfr. 


X 
X 
X 
F 
36. 
OAT in DIM with Paging May Cause FP Pipeline Corruption 


Problem: 
If all of the following 
conditions 
are met, a floating 
point pipeline 
may be 
corrupted 
as described. 


IF: 
1. Paging is enabled. 
2. Instruction 
caching 
is enabled 
both in hardware 
and in software. 
3. A Id instruction 
that does not reside in the delay slot of a branch or call causes a 
data access trap (DAD. 
4. The OAT -causing Id, which may occur either in single or dual instruction 
mode 
(DIM), is followed 
by a DIM pair. 


5. The DIM pair which follows the Id instruction 
has: 
a) a scalar floating 
point half (Le. not fnop, 
fxfr, 
or a pipelined 
instruction). 
b) an integer half which causes a freeze because 
one of its source operands 
overlaps 
with the Id's destination. 
THEN: 
After trapping 
on the OAT, the first pipelined 
FP operation 
in the trap handler may 
corrupt its corresponding 
pipeline. 
For example, 
if the first pipe lined FP operation 
in 
the trap handler is a pfmul, 
the multiplier 
pipeline 
may be corrupted. 


Workaround' 
Either one of the following 
two workarounds 
will suffice: 


1. In DIM or a DIM transition, 
if the destination 
register of a Id instruction 
is referenced 
as a source in the integer half of the next dual pair, then the floating 
point 
instruction 
of that pair must be non-scalar 
(Le. fnop, 
fxfr, 
or a pipe lined instruction). 


, 2. On any trap, the trap handler should first save the register state (but not the 
pipeline state) of the CPU and then determine 
whether 
a Id has caused a OAT 
while DIM and/or 
OS is set. If not, the trap should be handled 
normally, 
but if so, it 
should check whether 
the Id is in the delay slot of a branch or call. If it is, the trap 
should be handled 
normally, 
but if not, the trap handler should examine the next 
instruction 
or instruction 
pair in user code after the trapping 
instruction. 


If the Id is followed 
by a DIM pair in which the FP half is a scalar adder, multiplier, 
or 
graphics 
operation, 
then the first FP operation 
performed 
in the trap handler after 
saving register state should be a pipe lined adder, multiplier, 
or graphics 
unit instruction 
respectively. 
Otherwise, 
the trap should be handled 
normally. 


X 
X 
X 
F 
37. 
BL Bit Not Set Immediately 
After lock 
Instruction 


Problem: A Id.c dlrbase, 
rx immediately 
after a lock instruction 
may show the BL bit 
not set, although 
it should be. However, 
if any other instruction 
intervenes 
between 
the 
lock and the Id.e dlrbas8, 
rx, the BL bit will appear correctly 
set. 


Workaround' 
Ensure that there is at least one instruction 
between 
a lock and a 
following 
Id.e dlrbas8, 
rx. 
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82 
83 
C1 
DO 


40. 
Id.c tsr in DIM May Return the Wrong Value 


Problem: 
In dual instruction 
mode, if Id.c fsr is followed 
by a pipe lined floating 
point instruction, 
then this pipelined floating 
point instruction 
may update the fsr 
early. The Id.c may return the updated fsr value from the execution 
of the 
pipelined floating 
point instruction. 


NOTE: 
This problem 
does not occur in single instruction 
mode. 


Workaround: 
In dual instruction 
mode, do not follow the Id.c fsr instruction 
with a 


pipelined floating 
point instruction. 


x 
x 
X 


Description 


X 
38. First 32-byte Block of Trap Handler 
Code May Execute 
Incorrectly 


Problem: 
Under certain rare conditions, 
requiring paging, caching 
and a trap, the 
CPU may skip or execute 
twice any of the instructions 
on the first 32-byte block 
of the trap handler code. 


Workaround: 
For systems 
using paging and caching, 
place eight nops as the first 
eight instructions 
of the trap handler code (at address OxFFFFFFOO). 


X 
x 
X 
39. INTOVR 
May Trap Incorrectly 


Problem: The OF bit is unreliable 
in the trap handler during a OAT, SE or RE that 
is followed 
by an adds/addu/subs/subu 
instruction. 
In dual instruction 
mode, if 
the Intovr 
instruction 
is followed 
by an adds/addu/subs/subu 
instruction, 
and the floating 
point instruction 
paired with the Intovr 
traps, Intovr 


may trap incorrectly 
upon returning from the trap handler. The adds/addu/ 
subs/subu 
instruction 
can set the OF bit by the time the floating 
point trap is 
taken. When the trap handler returns to the Intovr 
pair, the OF bit is set in the 
EPSR, and the Intovr 
instruction 
causes an instruction 
trap (IT). Consider 
the 
following 
example: 
II 
OF=O 


, 
A: 
d.fop 
II FP operation reports an SE or RE 
intovr 
II Should not r~port an overflow trap 
B: 
d.fop 
II Another FP operation 
addu 
II Sets OF 


The Intovr 
instruction 
should not trap since OF is clear. However, 
the OF bit is 
set by the addu instruction 
before an SE or RE is taken. When the trap handler 
returns to pair A, the Intovr 
instruction 
now traps because 
OF is set, even 
though it should not trap. 


Workaround: 
In single instruction 
mode, the trap handler should ignore the OF 
bit. 


In dual instruction 
mode, if a floating 
point trap occurs, and the instruction 
at 
FIR + 4 (core half of the pair) is the Intovr 
instruction, 
the trap handler must set 
the OF bit (EPSR) to the value of the IT bit (PSR). This will guarantee 
that the OF bit is set correctly 
upon return to the dual pair. If the Intovr 
instruction 
trapped 
(IT = 1 and FIR contains 
Intovr), 
then OF should be set to 1. 
If the Intovr 
instruction 
didn't trap (IT = 0 and FIR contains 
Intovr), 
then OF 
should be set to O. 


X 


NOTE: 
This workaround 
must be placed in the section 
before the normallntovr 
handling 
in the trap handler. 


X 
X 
X 
x 
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B2 
B3 
C1 
DO 
Description 


x 
X 
X 
X 
41. 
PFLO Pipeline May Return Cotrupted 
Data 


Problem: 
Under the following 
conditions 
the PFLD pipeline can lose synchronization, 


resulting 
in corrupted 
data. Once the pipeline is out of synchronization, 
it will remain 
out of synchronization 
until the chip is reset. All of the conditions 
listed below must be 


present 
in order for the error to occur. 


1. A pfld instruction 
is near the end of an instruction 
cache line. 
and; 
2. The next instruction 
to be fetched 
is both an instruction 
cache miss and a TLB 
miss. 


and; 
3. Data for the pfld resides on the same page as the instruction 
cache miss. 


Workaround: 
Ensure that pfld data resides on non-instruction 
pages. 


NOTE: 
The following 
procedure 
may be used during a context 
switch to test for pfld pipeline 


corruption. 
This allows the problem to be detected 
and localized 
to a single user. If the 
test fails then the pipeline is corrupted 
and the 80860XR 
must be RESET in order to 
use pfld instructions 
again. 


" 
pfld PIPELINE TEST ROUTINE. 
" 
Mem_addr is the address of three consecutive double word 
" 
memory locations that have been initialized with some 
" 
pfld load test data. 
" 
Save_Stagel, Save_Stage2 and Save_Stage3 are memory 
" 
locations used as temporary storage for pipeline data. 
" 
Rx is an integer register and Fx, Fy, Fz are floating 
" 
point registers used in the test. 
" 
The test data is '1', '2', and '3' in this example. 


test-pipes: 
or 
l%Mem_addr, rO, Rx 
orh 
h%Mem_addr, Rx, Rx 
" 
Set Rx pointer to memory 
" 
addr containing test data. 


" 
Load values from memory into 
" 
pipeline and save current 
" 
pipeline data to Fx, Fy, Fz. 


" 
Save old Stagel data 
" 
Save old Stage2 data 
" 
Save old Stage3 data 


pfld.d O(Rx) , Fx 
pfld.d 4(Rx) , Fy 
pfld.d B(Rx) , Fz 


fst.d Fx, Save_Stagel(rO) 
fst.d Fy, Save_Stage2(rO) 
fst.d Fz, Save_Stage3(rO) 


pfld.d 
Save_Stagel(rO), Fxl/ Fx <- '1' and restore Stagel 
pfld.d 
Save_Stage2(rO), Fyl/ Fy <- '2' and restore Stage2 
pfld.d 
Save_Stage3(rO), Fz" 
Fz <- 
'3' and restore Stage3 


fst.d 
Fx, Save_Stagel(rO) " 
Memory 
(Save_Stagel) <- Fx 
fst.d 
Fy, Save_Stage2(rO) " 
Memory 
(Save_Stage2) <- Fy 
fst.d 
Fz, Save_Stage3(rO) " 
Memory 
(Save_Stage3) <- Fz 


ld.l 
Save_Stagel(rO), Ry 
btne 
1, Ry, test_fail 
ld.l 
Save_Stage2(rO), Ry 
btne 
2, Ry, test fail 
ld.l 
Save_Stage3(rO), Ry 
bte 
3, Ry, test_pass 
test-fail : 
" 
Insert a branch to HALT the system or RESET the BOB60XR. 
test_pass: 
" 
pfld pipeline is not corrupted - Continue 


/I 
Ry <- Memory(Save_Stagel) 
/I 
Test for (Ry = 
'1') 
/I 
Ry <- Memory(Save_Stage2) 
/I 
Test for (Ry = 
'2') 
/I 
Ry <- Memory(Save_Stage3) 
/I 
Test for (Ry = 
'3') 
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Stepping 
Description 
B2 
B3 
C1 
DO 


X 
X 
F 
- 
42. 
Multiple Sequential 
Transfers between 
the Integer and Floating Point Units 
can Result in Corrupted 
Data 


Problem: When executing 
a very tight loop resulting 
in large numbers 
of transfers 
between 
the integer and floating 
point units, data can become 
corrupted. 
All of 


the following 
conditions 
listed below must be present 
in order for the error to 
occur. 


1. High Temperature 
(> 70·C Tease) 
and; 
2. Low Voltage 
« 
50V) 
and; 


3. A very long (> 109), tight loop involving 
transfers 
between 
the integer and 
floating 
point units. 


The loop length required to cause failure differs with temperature 
and voltage. 


Workaround: 
No feasible 
workaround 
in B2/B3 
steppings. 


X 
X 
X 
X 
43. 
LOCK Protocol 
Failure at Page Boundary 


Problem: An Instruction 
Access Trap (IAT) may be incorrectly 
reported, 
and the 
access (A) bit of a Page Table or Page Directory 
Entry may be updated without 
assertion 
of the LOCK if pin, when all of the following 
conditions 
occur. 


1. A LOCK sequence 
finishes 
near the end of a page (the st instruction 
following 
the unlock instruction 
is one of the last four instructions 
on the 
page) 
and; 
2. Paging is enabled 
and; 
3. The access (A) bit is not set in the following 
page. 


Workaround: 
Anyone 
of the following 
four workarounds 
will suffice: 


1. Disable Paging 
2. Set the A bit in all Page Table (and Page Directory) 
Entries to 1. 
3. Finish lock sequences 
before the last four instructions 
of the current page. 
4. 
Ignore IATs signaled 
by the processor 
when none of the following 
valid IAT 
conditions 
is present: 
a) Present bit not set in PTE or PDE (PTE.P = 0) 
b) Supervisor 
Page Protection 
Violation 
(PSR.PU = 1 AND PTE.U = 0) 
c) Access 
Bit not set in PTE during a lock sequence 
(PTE.A = 0 AND 
PSR.IL= 
1) 


NOTE: 
If workaround 
if 4 is used, the A bit of the following 
page may be set by the 
processor 
without the LOCK if pin asserted. 
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SHEET REVISION 
REVIEW 


The following list represents the key differences be- 
tween version 002 and version 001 of the i860 XR 
Microprocessor Data Sheet. 
1. 
Big-endian description in section 2.3 has been 
expanded. 
2. 
Bit 17 of the Extended Processor Status Regis- 
ter (EPSR) is the INT bit which reflects the value 
on the interrupt pin (INT), as described in sec- 
tion 2.2.4 entitled "EXTENDED PROCESSOR 
STATUS REGISTER". This is a documentation 
update only. 
3. 
The cacheability of a page is controlled by 
NOR'ing the value of the CD, WT bits and the 
KEN# input pin, as described in section 2.5 en- 
titled "Caching and Cache Flushing" and sec- 
tion 3.1.14 entitled "Cache Enable (KEN#)". 
This is a documentation update only. 
4. 
The NOTE section in section 2.5 entitled "Cach- 
ing and Cache Flushing" has been updated to 
clarify the paging requirement on changing the 
DTB field in the dlrbase register. 
5. 
Information on register encoding is added in 
section 8.2 entitled "Instruction Format and En- 
coding". This is a documentation update only. 


The following list represents the key differences be- 
tween version 003 and version 002 of the i860 XR 
Microprocessor Data Sheet. 


Specification Changes: 
1. 
Specification changes for improved AC perform- 
ance are in section 7.3. 
2. 
HOLD is acknowledged during locked bus cy- 
cles. See section 3.1.8. 
3. 
Additional paths have been added to the bus 
state diagram to allow direct transitions from 
states T12 and T11 to state TH. See Figures 4.1 
and 4.10. 
4. 
Two new instructions, (p)famov.r, 
have been 
added. 
These 
replace 
(p)fadd.ds 
and 
(p)fadd.sd 
in 
the 
assembler 
pseudo-ops 
(p)fmov.r. 
These changes are in section 8.1 
and tables 2.7, 8.7, and 8.9. 


Documentation Changes: 
1. 
Big and little endian description has been ex- 
panded in sections 2.2.2, 2.3, and Figure 2.8. 
2. 
The actions and explanations of the lock, un- 
lock, and st.c dirbase changing the BL bit have 
been updated in sections 2.2.4, 3.1.5, 3.1.8, 
4.3.4, 4.3.5, and 8.1. 
3. 
The explanation of the AA and MA bits of the 
fpsr have been expanded in section 2.2.8. 


i860™ 
XR MICROPROCESSOR 


4. 
The explanation of the WT bit of the Page Table 
Entries has been expanded in sections 2.4.4.4 
and 2.5. 
5. 
A change concerning the locking of the bus dur- 
ing address translation is explained in sections 
2.4.5 and 2.8.5. 
6. 
A further explanation on when to flush the data 
cache is given in section 2.5. 
7. 
The explanation of the floating point multiplier 
pipeline has been expanded in section 2.6.1. 
8. 
The explanation of BREQ has been expanded 
in section 3.1.4 and Figure 4.1. 
9. 
The explanation of result exceptions has been 
expanded in sections 2.8 and 3.2. 
10. Instruction fetch identification has been clarified 
in section 3.1.6 and table 3.2. 
11. Bus cycle diagrams in Figures 4.7, 4.8, and 4.10 
have been clarified/corrected. 
I 


12. Precision specification .r has been added to 
section 8.0 and table 8.1. 
13. In section 8.4, performance note 9 has been 
added, programming restriction d has been 
changed, and programming restriction f has 
been added. Table 8.9 has been updated to re- 
flect these changes. 
14. The description of testability has changed in 
sections 3.3. and 3.3.2. RESET and HOLD must 
be asserted by the tester to force the chip out- 
puts to float (tri-state). 


The following list represents the major differences 
between version 004 and version 003 of the i860 XR 
Microprocessor Data Sheet: 
Section 2.2.4 The explanation of the WP bit of the 
espr has been expanded. 
Section 2.8.2 More information on the instruction 
trap has been added. 
Section 2.8.4 The instruction access trap has been 
clarified. 
Section 2.8.7 The values of registers after a reset 
trap have been specified. 
Section 3.1.4 BREQ timing has been clarified. 
Section 3.1.5 The calculation of interrupt latency 
has bee corrected. 
Section 3.1.6 The description of the byte-enable 
signals has been expanded. 
Section 3.1.8 The relation between the lock instruc- 
tion and the LOCK# signal has been 
clarified. The BL bit should no longer 
be changed by writing to the dlrbase 
register. 


Section 6.0 
The thermal specifications have been 
updated. 


• 
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Section 7.3 
The A.C. Characteristics for ClK have 
changed. 
Section 7.3 
Advance timing information for the 50 
MHz 
clock 
rate has been added. 


These timings are subject to change 
without notice. 


Section 8.0 
The 
operand 
naming 
conventions 
have improved. 
Section 8.2.1 The encoding of the flush instruction 
has been corrected. 
Section 8.3 
The data-dependent multiplier freeze 
has been eliminated. Other freeze 
conditions have been corrected or 
clarified. 


The following list represents the major differences 
between version 005 and version 004 of the i860 XR 
Microprocessor Data Sheet. 


Section 2.2.4 OF bit is writable only in supervisor 
mode using ST.C. 
ClK rate has been updated. 
Figure 5.3 has been corrected. 
More information on measuring case 
temperature has been added. 
Section 6.0 
Figure 6.1 has been updated to in- 
clude 25 MHz. 


Section 3.1.1 
Section 5.0 
Section 6.0 


2-260 


Section 6.0 
Section 6.0 


Section 7.2 


Section 7.3 


Section 7.3 
Section 7.3 


Section 7.3 


Section 8.3 


Section 8.4 


intel~ 


Table 6.1 has been corrected. 
Table 6.2 has been updated to in- 
clude 25 MHz. 
The D.C. Characteristics have been 
updated to include 25 MHz 
power 


supply current. 


The A.C. Characteristics for ClK have 
been changed. 
50 MHz clock rate has been deleted. 
25 
MHz 
A.C. Specifications 
have 
been added. 
Figure 7.1 has been corrected. 


The data-dependent multiplier round- 
ing freeze has been eliminated. 
Programming restrictions for dual-in- 
struction mode are added. 


The following list represents the differences be- 
tween version 005 and version 006 of the 80860XR 
Microprocessor data sheet. 


Section 9.0 Functional Characteristics section add- 
ed. 


intel~ 
82495XP CACHE CONTROLLERI 
82490XP CACHE RAM 
• Two-Way, Set Associative, 
Secondary 
• MESI Cache Consistency 
Protocol 
Cache for 1860™ XP Microprocessor 
• Hardware Cache Snooping 
• 50 MHz "No Glue" Interface with CPU 
• Maintains Consistency 
with Primary 
• Configurable 
Cache via Inclusion Principle 
-Cache 
Size 256 or 512 Kbytes 
• Flexible User-Implemented 
Memory 
- 
Line Width 32, 64 or 128 Bytes 
Interface Enables Wide Range of 
- 
Memory Bus Width 64 or 128 Bits 
Product Differentiation 
• Dual-Ported Structure Permits 
- 
Clocked or Strobed 
Simultaneous Operations on CPU and 
- 
SynChronous or Asynchronous 
Memory Buses 
-Plpelining 
• Efficient MRU Way Prediction 
- 
Memory Bus Protocol • 


- 
Zero Wait States on MRU Hit 
• 82495XP Cache Controller 
Available in 
- 
One Wait State on MRU Miss 
208-Lead Ceramic Pin Grid Array 
• Dynamically Selectable Update PoliCies 
Package 


- 
Write-Through 
• 82490XP Cache RAM Available in 84- 
- 
Write-Once 
Lead Plastic Quad Flatpack Package 
- 
Write-Back 
(See Packaging 
Handbook, 
Order #240800) 


The Intel 82495XP 
cache controller 
and 82490XP 
cache RAM, when coupled 
with a user-implemented 
memo- 


ry bus controller, 
provide 
a second-level 
cache subsystem 
that eliminates 
the memory 
latency and bandwidth 
bottleneck 
for a wide 
range 
of multiprocessor 
systems 
based 
on the 
i860 XP microprocessor. 
The 
CPU 
interface 
is optimized 
to serve the i860 XP microprocessor 
with zero wait states at up to 50 MHz. A secondary 
cache 
built from the 82495XP 
and 82490XP 
isolates 
the CPU from the memory 
subsystem; 
the memory 
can 
run slower 
and follow 
a different 
protocol 
than the i860 XP microprocessor 
. 
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Figure 0-1. Secondary 
Cache Configuration 
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Figure 1·1. 82495XP Plnout (Bottom View) 
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Pin Cross Reference 
Tables 


Table 1-1. 82495XP 
Pin Cross Reference 
by Name 


Signal 
Location 
Signal 
Location 
Signal 
Location 


AOS# 
B15 
AHOLO 
A17 
BGT# 
M03 


BLAST # 
C15 
BLE# 
C16 
BOFF # [CLENO] 
G15 


BROY# 
P01 
BROYC1# 
015 
BROYC2# 
F14 


BUS# 
P16 
CACHE# 
G14 
CAOS# 
E03 


CAHOLO 
G04 
COC# 
003 
COTS # 
F04 


CFAO 
E15 
CFA1 
B14 
CFA2 
006 


CFA3 
B02 
CFA4 
A16 
CFA5 
E14 


CFA6 
014 
CLK 
011 
CMIO# 
004 


CNA#[CFGO] 
L04 
CROY # [SLFTST #] 
M02 
CWAY 
J03 


CWR# 
E04 
OC# 
H14 
ORCTM# 
M01 


EAOS# 
J15 
EWBE# 
S02 
FLUSH # [NCPFLO #] 
N04 


FPFLO # [FPFLOEN] 
J04 
FSIOUT# 
001 
HITM# 
017 


INV[CLEN1] 
K15 
KEN# 
016 
KLOCK# 
C03 


KWENO# 
[CFG2] 
M04 
LEN 
F15 
LOCK # 
B16 


MALE[WWOR#] 
002 
MAOE# 
S04 
MAWEA# 
017 


MBALE[HIGHU] 
P04 
MBAOE# 
P06 
MCACHE# 
CO2 


MCFAO 
016 
MCFA1 
N14 
MCFA2 
R04 


MCFA3 
006 
MCFA4 
P15 
MCFA5 
P14 


MCFA6 
P13 
MCYC# 
P17 
MHITM# 
H04 


MIO# 
F16 
MKEN# 
R01 
MRO# 
J01 


MSETO 
015 
MSET1 
P12 
MSET10 
011 


MSET2 
P11 
MSET3 
014 
MSET4 
R16 


MSET5 
013 
MSET6 
R17 
MSET7 
S17 


MSET8 
P10 
MSET9 
012 
MTAGO 
010 


MTAG1 
P09 
MTAG10 
007 
MTAG11 
P07 


MTAG2 
009 
MTAG3 
R14 
MTAG4 
008 


MTAG5 
R15 
MTAG6 
S14 
MTAG7 
S15 


MTAG8 
P08 
MTAG9 
S16 
MTHIT# 
G03 


MWBWT# 
K03 
NA# 
J17 
NENE# 
005 


PALLC# 
002 
PCO 
H15 
PCYC 
J14 


PWT 
C17 
ROYSRC 
C01 
RESET 
005 


SETO 
013 
SET1 
C13 
SET10 
A09 


SET2 
C14 
SET3 
B12 
SET4 
C12 


SET5 
C11 
SET6 
012 
SET7 
009 


SET8 
010 
SET9 
B09 
SMLN# 
C06 


SNPAOS# 
F03 
SNPBSY# 
. 
F01 
SNPCLK[SNPMO] 
S03 


SNPCYC# 
H03 
SNPINV 
P05 
SNPNCA 
003 
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Signal 
Location 
Signal 
Location 
Signal 
Location 


SNPSTB# 
R03 
SWENO#[CFG1] 
001 
SYNC# 
[MEMLORV] 
004 


TAGO 
C08 
TAG1 
A04 
TAG10 
B01 


TAG11 
C05 
TAG2 
008 
TAG3 
A03 


TAG4 
B04 
TAG5 
B03 
TAG6 
C07 


TAG7 
A02 
TAG8 
007 
TAG9 
A01 


TCK 
P03 
TOI 
N03 
TOO 
C04 


TMS 
P02 
WAY 
L15 
WBA 
M14 


WBTYP 
N15 
WBWE# 
M15 
WBWT # [WRMRST] 
K14 


WR# 
B17 
WRARR# 
L14 


NC A14, A15, S01, S02 
Vcc 
A05-A08, 
A 1O-A 13, E01, 
Vss 
B05-B08, 
B10-B11, 
B13, 
E17, H01, H17, K01, K17, 
E02, E16, F02, H02, H16, 
L01, L17, C09, N17, F17, 
J02,J16,K02,K04,K16, 
G01, G17, M17, N01, S05- 
L02-L03,L16,C10,N16, 
S13 
G02, G16, R02, R05-R10, 
M16, N02, R11-:R13 


Table 1·1. 82495XP 
Pin Cross Ref,erence 
by Name (Continued) 


Signal 
Location 
Signal 
Location 
Signal 
Location 


AO 
65 
A1 
66 
A10 
77 


A11 
78 
A12 
79 
A13 
I 
80 


A14 
81 
A15 
82 
A2 
67 


A3 
68 
A4 
69 
A5 
70 


A6 
71 
A7 
73 
A8 
75 


A9 
76 
AOS# 
63 
BE# 
64 


BLAST # 
59 
BOFF# 
36 
BROY# 
60 


BROYC# 
61 
BUS# 
40 
COATAO 
48 


COATA1 
54 
COATA2 
49 
COATA3 
55 


COATA5 
51 
COATA6 
52 
COATA7 
57 


COATA4 
46 
CLK 
30 
CROY# 
43 


HITM# 
62 
MAWEA# 
41 
MBROY # [MISTB] 
22 


MCLK[MSTBM] 
26 
MCYC# 
42 
MOATAO 
18 


MOATA1 
14 
MOATA2 
10 
MOATA3 
6 


MOATA4 
16 
MOATA5 
12 
MOATA6 
8 


MDATA7 
4 
MOOE# 
20 
MEOC# 
23 


MFRZ# 
[MEMLDRV] 
24 
MOCLK[MOSTB] 
27 
MSEL # [MTR4/MTR8 
#] 
25 


MZBT # [MX4/MX8 
#] 
21 
PAR# 
32 
RESET 
28 


TCK 
3 
TOI 
2 
TOO 
84 


TMS 
1 
WAY 
45 
WBA 
38 


WBTYP 
37 
WBWE# 
39 
WR# 
' 
58 


WRARR# 
44 
Vcc 
5,9,13,17,29,35,50,56, 
Vss 
7,11,15,19,31,33,34,47, 


NC 
83 
74 
53, 72 


Table 1·2.82490XP 
Pin Cross Reference 
by Name 
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1.2 Quick Pin Reference 


BGT 11 [C490LDRV] 
I 
Bus Guaranteed 
Transfer, 
[82490XP 
low 
Drive] 
This signal is generated 
by the MBC to the 82495XP. 
It indicates 
to the 
82495XP a commitment 
by the MBC to complete 
the cycle on the memory 
bus. Until BGT 11 activation 
the 82495XP 
owns the cycle and will abort it if 
intervening 
snoops happen. After BGT 11 the cycle is owned by the MBC until 
its completion. 
From BGT 11 until SWENDII 
snoops will be accepted, 
but none 
will be processed 
until SWENDII 
activation. 


During RESET's 
falling edge, this signal controls 
the driver's 
strength 
of the 
82495XP to 82490XP 
interface 
signals. This strength 
is a function 
of the 
cache size, and therefore 
the number of 82490XP's. 
Refer to the layout 
specifications 
section for more details. 


BLEII 
0 
BE latch 
Enable 
The BlEII 
signal is used to control the enable line of an external 
'377-type 
latch. The latch captures 
the i860 XP CPU's BE (Byte Enable) signals and 
other CPU provided 
cycle attributes 
which do not go through the 82495XP. 


- 
BRDYII 
I 
82495XP 
Burst Ready 
This is the burst ready indication 
from the memory bus controller. 
The MBC 
should connect 
its burst ready indication 
to the CPU BRDY 11, the 82495XP 
BRDY 11 and the 82490XP 
BRDY 11. In the CPU, it provides 
the same function 
as that described 
in the CPU data sheet. The 82495XP will only use this 
indication 
for burst tracking 
purposes. 
In the 82490XP, 
it increments 
the CPU 
latch burst counter. 


CADS 11 
0 
Cache Address 
Strobe 
This signal is generated 
by the 82495XP and used by the memory bus 
controller. 
Its assertion 
requests 
execution 
of a memory bus cycle by the 
memory bus controller. 
This signal when active indicates 
that the cache cycle 
control 
and attribute 
signals are valid. 


CAHOLD 
0 
82495XP AHOlD 
This signal is generated 
by the 82495XP to track the CPU AHOlD 
signal 
when used for warm-reset 
and lOCKed 
sequences. 
It also provides 
information 
about CPU and cache BIST. 


CD/CII 
0 
Cache Data/Control 
This is a cycle definition 
signal driven by the 82495XP. 
It indicates 
the type of 
memory bus cycle requested. 
This signal is valid with CADS 11 and can be 
pipelined 
by the memory bus controller. 


COTS 11 
0 
Cache Data Strobe 
This signal is driven by the 82495XP to the memory bus controller. 
CDTSII 
for 
read cycles indicates 
that in the next ClK 
the memory bus controller 
can 
generate 
the first BRDY 11 for the read cycle. For write cycles it indicates 
when data is available 
on the memory bus. Usage of this signal allows 
complete 
independency 
between 
address strobes 
(CADS 11, SN PADS 11) and 
data strobe. 


CFGO-2 
I 
Cache Configuration 
bits 0-2 
These signals are inputs to the 82495XP. 
CFGO-2 allow the 82495XP 
to be 
configured 
to 5 different 
modes. Different 
modes indicate 82495XP/CPU 
line 
ratio, tag size (4K/8K), 
lines per sector. 


• 
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ClK 
I 
Clock 
This signal provides the fundamental 
timing for the 82495XP, 
82490XP 
and 
CPU. It must be provided to the 82495XP, 
82490XPs, 
CPU and memory bus 
controller 
components 
with minimal skew. 


CM/IO# 
0 
Cache Memory/IO 
This signal is driven by the 82495XP 
and is a cycle definition 
signal. It 
indicates 
the type of memory bus cycle requested. 
This signal is valid with 
CADS# 
and can be pipelined 
by the memory bus controller. 


CNA#[CFGO] 
I 
82495XP 
Next Address 
Enable, [Configuration 
Pin 0] 
This signal is driven by the memory bus controller 
and supplied to the 
82495XP. 
It is used by the memory bus controller 
to dynamically 
pipeline 
CADS# 
cycles. 


During RESET falling edge it functions 
as the 82495XP 
CFGO input. 


CRDY# [SlFTST #] 
I 
Cache Memory 
Bus Ready, [82495XP 
Self Test] 
This signal is generated 
by the memory bus controller 
and informs the 
82495XP and 82490XP that a memory bus cycle has been completed. 
CRDY # activation 
ends the memory bus cycle. 
During RESET's 
falling edge, if this signal is sampled 
low(active) 
and 
MBALE is sampled 
high(active), 
82495XP 
self test will be invoked. 


CWAY 
0 
Cache Way 
CWA Y is driven by the 82495XP 
and is a cycle definition 
signal that 
indicates 
to the memory bus controller 
the WAY to be used by the 
requested 
cycle. On line-fills it indicates 
the way the line will be loaded. For 


write-backs 
it indicates 
the WAY that was written-back. 
This signal is valid 
with CADS#. 


CW/R# 
0 
Cache Write/Read 
This signal is driven by the 82495XP 
and is a 82495XP 
cycle definition 
signal. It indicates 
the type of memory bus cycle requested. 
This signal is 
valid with CADS# 
and can be pipelined 
by the memory bus controller. 


DRCTM# 
I 
Memory Bus Direct to [M] State 
This signal is an input to the 82495XP. 
It is the mechanism 
by which the 
memory bus can dynamically 
inform the 82495XP of a request to skip the 
[E) state and move the line directly to the [M] state. This signal is sampled 
by the 82495XP when SWEND # is asserted. 


FLUSH# [NCPFLD#] 
I 
Flush the 82495XP cache, 
[Enable 
Non-Cacheable 
PFLD] 
This signal is an input to the 82495XP. 
Flush when active will cause the 
82495XP to write-back 
all of its modified 
lines into main memory then 
invalidate 
all tag locations. 
At the end of a flush operation 
the 82495XP 
tag 
array will be completely 
invalidated. 
During RESET activation, 
this pin functions 
as the NCPFLD# 
configuration 
signal which, with FPFLDEN, 
selects one of three modes for handling 
i860 XP CPU floating 
point load cycles. 
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1.2 Quick Pin Reference (Continued) 


FPFLD 11 [FPFLDEN) 
I/O 
FIFO PFLD Enable [PFLD Mode Select) 
During RESET, FPFLDEN 
and NCPFLDEN 
11 inputs select one of three 
modes to handle i860 XP CPU pipe lined floating 
point load cycles. In the 


mode which supports 
an external 
FIFO, the FPFLDII 
output indicates 
a 
PFLD cycle to be loaded into the FIFO. 


FSIOUTII 
0 
Flush/Sync/Initialization 
Output 
This signal is an output of the 82495XP and indicates 
the start and end of 
three operations: 
Flush, Sync, and Initialization. 
The output is activated 
when the operation 
internally 
begins and is de-activated 
when the 
operation 
ends. 


KLOCKII 
0 
82495XP 
LOCK 11 
This signal is driven by the 82495XP 
and indicates 
to the memory 
bus 
controller 
a request to execute 
atomic read-modify-write 
sequences. 


KLOCKII 
is active with the CADSII 
of the first LOCKed operation 
and 
remains active until at least the clock following 
CADSII 
of the last cycle of 
LOCKed operation. 


KWENDII [CFG2) 
I 
Cacheability 
Window 
End, [Configuration 
Pin 2) 
This signal is generated 
by the MBC and indicates 
to the 82495XP 
that the 
Cacheability 
Window has expired. At this point the 82495XP 
will latch the 


memory cacheability 
signal (MKENII) 
and make decisions 
based on the 
cacheability 
attribute. 
MROII 
which indicates 
the Read-Only 
cycle attribute 
is also sampled at this point. 
During RESET's 
falling edge this line functions 
as the CFG2 configuration 
signal which is used to configure 
the 82495XP/82490XP 
with cache 
parameters. 


MALE [WWOR11) 
I 
Memory Bus, Address 
Latch Enable [Weak Write Ordering) 
This signal is generated 
by the memory bus controller, 
and controls 
a 
82495XP 
internal transparent 
address latch (373 like). CADSII 
will 
generate 
a new address at the input of the internal address 
latch. MALE 
activation(high) 
will allow the flowing of this address to the memory 
bus 
provided 
MAOEII 
is active. When MALE inactive(low), 
the address 
at the 
latch input is latched. 
WWOR 11 configures 
the 82495XP 
into strong or weak write-ordering 
mode. 


MAOEII 
. I 
Memory 
Bus Address 
Output Enable 
This signal is generated 
by the memory bus controller 
and controls 
the 
82495XP's 
output buffer of the memory bus address 
latches. The 82495XP 
drives the memory bus address lines if MAOE 11 is active (Iow). Otherwise, 
it is tristated. 
MAOE 11 also serves as a qualifier for snooping 
cycles: when 
inactive snoops will be enabled. 


MBALE[HIGHU) 
I 
Memory Bus, 82495XP 
sub-line-address 
Latch Enable[High 
Impedance 
Output) 
This signal has an exact function 
as MALE but controls 
only the 82495XP 
sub-line addresses. 
This signal is generated 
by the memory bus controller, 


and controls 
a 82495XP 
internal transparent 
address latch (373 like). 
CADS 11 will generate 
a new address at the input of the internal address 
latch. MBALE activation(high) 
will allow the flowing of the sub-line address 
to the memory bus provided 
MBAOE 11 is active. When MALE inactive(low), 


the sub-line address at the latch input is latched. 
HIGHZ 11, if active along with SLFTST 
11, causes the 82495XP to float all of 


its outputs. 
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MBAOE# 
I 
Memory Bus, 82495XP 
sub-line Address Output Enable 
This signal has a similar function 
than MAOE #, but controls 
only the 
82495XP 
sub-line addresses. 
If MBAOE # is active(low), 
the 82495XP will drive the sub-line portion of the 
address onto the memory bus. Otherwise, 
it is tristated. 
MBAOE# 
is also 
sampled 
during snoop cycles. If MBAOE # is sampled 
inactive with 
SNPSTB #, the. snoop write back cycle(if any) will begin at the sub-line 
address provided. 
If MBAOE # is active with SNPSTB #, the snoop write 
back will begin at sub-line address 
O. 


MBRDY #(MISTB) 
I 
Memory Bus Ready, (Memory 
Input Strobe) 
This pin is an input to the 82490XP. 
It is used in clocked 
bus mode to 
indicate the end of a transfer. 
When active(low) 
it indicates 
that the 
82490XP 
should increment 
the burst counter 
and either output the next 
data or get ready to accept the next data. 
In strobed 
memory bus mode this pin is the input data strobe to the 
82490XP. 
On each MISTB edge, the 82490XP 
latches the data and 
increments 
the burst counter. 


MCACHE# 
0 
82495XP 
Internal Cacheability 
This signal is driven by the 82495XP. 
On read cycles, this signal indicates 
the cycle's 
internal cacheability 
attribute. 
In write cycles MCACHE # is only 
active for write-back 
cycles. MCACHE# 
is not activated 
for 1/0, special 
cycles and Locked Cycles. 


MCFA6-MCFAO 
1/0 
Memory Bus Configurable 
address lines 
MSET1o-MSETO 
1/0 
Memory bus SET number 
MTAG11-MTAGO 
1/0 
Memory bus TAG bits 
These are the memory bus address 
lines of the 82495XP 
and should be 
connected 
to the A31-A2 
(A31-A3 
for 64 bit bus) signals of the Memory 
Bus. These signals, along with the byte enables, 
define the physical 
area of 
memory or 1/0 accessed. 
The 82495XP drive these signals in normal memory bus cycles and have 
them as inputs during snooping. 


MCLK[MSTBM) 
I 
Memory 
Bus Clock, [Memory 
Input Strobe) 
In clocked 
memory bus mode this pin provides 
the memory bus clock to the 
82490XP. 
In clocked 
mode, memory bus signals and memory bus data are 
sampled 
on the rising edge of the MCLK. In a clocked 
memory bus write, 


data is driven off of MCLK or MOCLK depending 
upon the configuration. 
This pin is an input to the 82490XP. 
It is sampled 
during reset and 
determines 
the memory bus type. If this input is strapped 
high, the memory 
bus will be strobed. 
If a clock is detected 
at this input, this pin becomes 
the memory 
bus clock, 


and clocked 
memory bus mode is selected. 


MDATAO-MDATA7 
1/0 
Memory Bus Data 
These pins are the 8 memory data pins of the 82490XP. 
All or part of these 
pins will be used depending 
on the cache configuration. 
In clocked 
memory 
bus mode, these pins are sampled with the rising edge of MCLK. New data 
is driven out on these pins with MEOC # or the rising edge of MCLK or 
MOCLK while MBRDY # or MEOC # is active. In strobed 
memory bus 
mode, MDATA is sampled 
on each MISTB edge. New data is driven out on 
these pins with each MOSTB edge or MEOC#-falling 
edge. 
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MDOE# 
I 
Memory Data Output Enable 
This signal is an input to the 82490XP. The memory bus output enable is 
used to control 
the 82490XP's 
driving of data onto the memory bus. When 


this pin is inactive(high), 
the MDATA[O:7] 
pins are tristated. 
When this pin is 
active(low), 
the MDATA[O:7] 
pins are actively driving data. The function 
of 


this pin is the same for strobed 
or clocked 
memory bus operation 
as 
MDOE# 
has no relation to ClK 
or MClK. 


MEOC# 
I 
Memory End of Cycle 
This signal is an input to the 82490XP. 
Since it is synchronous 
to the 
memory bus, it may be used to end a cycle on the memory bus and begin a 
pending cycle without waiting for synchronization 
to the CPU ClK. 
MEOC# 
also causes the latching or driving of data and resetting 
of the memory burst 
counter. 


MFRZ# 
[MEMLDRV] 
I 
Memory Freeze, [Memory 
Bus low 
Drive] 
This signal is an input to the 82490XP. 
It is used for write cycles that could 
cause allocation 
cycles. When this pin is active(low), 
write data is latched 
in 
the 82490XP. The subsequent 
allocation 
will not overwrite 
data latched by 
the write. This prevents 
the actual write to memory from having to be 
performed 
on the memory bus. The allocated 
line will be placed in the [M] 
state in the cache since memory has not been updated. 
During RESET's 
falling edge, this signal is sampled to indicate the 
82490XP's 
memory bus driving strength. 
The 82490XP provides 
normal and 
high drive capability 
buffers. 


MHITM# 
0 
Memory Bus Hit to Modified 
Line 
This signal is driven by the 82495XP 
during snoop cycles and indicates 
whether 
the snooping 
address hit a Modified 
line in the 82495XP 
cache. The 
82495XP automatically 
schedules 
the writing-back 
of modified 
lines when 
snoop hits occur. MHITM# 
is activated 
the ClK 
after SNPCYC# 
and will 
remain active until the next SNPSTB #. 


MKEN# 
I 
Memory Bus Cacheability 
This signal is an input to the 82495XP.lt 
is the memory bus cache enable 
pin. It is used to indicate to the 82495XP 
if the current memory bus cycle is 
cacheable 
or not. This pin is sampled 
by the 82495XP with KWEND# 
assertion. 


MOCLK(MOSTB) 
I 
Memory Output Clock, (Memory Output Strobe) 
MOClK 
controls 
a transparent 
latch at the 82490XP data outputs. 
By 
providing 
a clock input, skewed from MClK, 
MDATA hold time may be 
increased. 
In strobed 
bus mode this pin is the data output strobe. On each MOSTB 
edge, new data will be output onto the memory bus. 


MRO# 
I 
Memory Bus Read-Only 
This pin is an input to the 82495XP. 
It is the READ-ONLY 
attribute 
pin. It is 
used to indicate to the 82495XP that the accessed 
line should get a READ- 
ONLY attribute. 
READ-ONLY 
lines will be non-cacheable 
in the first level 
cache. READ-ONLY 
lines will be cached in the 82495XP 
if MKEN # is 
sampled 
active during KWEND# 
and will be cached in the [S] state. This pin 
is sampled 
by the 82495XP with KWEND# 
assertion. 
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MSEL# [MTR4/TR8#] 
I 
Memory Select, 
[Memory Transfer] 
This signal is a chip select input to the 82490XP. 
MSEl# 
activation 
qualifies the MBRDY# 
input of the 82490XP. 
MSEl# 
going active causes 


the sampling 
of MZBT # for the next cycle. MSEl # going inactive resets 


the 82490XP's 
internal memory burst counter. 


This pin is used to determine 
the number of transfers 
necessary 
on the 
memory bus for each cache line. If high, there are 4 transfers 
on the 
memory bus for each cache line. If low, there are 8 transfers 
on the 
memory bus for each cache line. 


MTHIT# 
0 
Memory Bus Tag Hit 
This signal is driven by the 82495XP during snoop cycles. It indicates 
whether the snooping 
address hit any line (exclusive, 
shared, or modified) 


in the 82495XP 
cache. MTHIT # is activated 
the ClK 
after SNPCYC# 
and 
will remain active until the next SNPSTB #. 


MWB/WT# 
I 
Memory Bus Write Policy 
This signal is an input to the 82495XP. 
It is the mechanism 
by which the 


memory bus can dynamically 
inform the 82495XP 
of the cycle write policy 


(Write-Through/Write-Back). 
This signal is sampled 
by the 82495XP 
with 


SWEND# 
activation. 


MZBT# [MX4/MX8#] 
I 
Memory Zero Based Transfer, 
[Memory 
1/0 Bits] 
This signal is an input to the 82490XP. 
When this pin is sampled 
active 
(with MSEl# 
or MEOC#) 
it indicates 
that the memory bus cycle should 
start with burst location 
zero independent 
of the sub-line address 
requested 
by the CPU. 


This pin is used to determine 
the number of 10 pins used for the memory 


bus. When HIGH it indicates 
that 4 10 pins are used per 82490XP. 
When 


lOW 
it indicates 
that 8 10 pins are used. 


NENE# 
0 
Next Near 
This signal is generated 
by the 82495XP and indicates 
to the memory bus 


controller 
if the address of the requested 
memory cycle is "near" 
the 
address of the previously 
generated 
one (in the same 2K DRAM page). 


This information 
can be used by the memory bus controller 
to optimize 
access to paged or static column 
DRAMs. This signal is valid together 
with 


CADS#. 


PALLC# 
0 
Potential Allocate 
This signal is generated 
by the 82495XP and indicates 
to the memory bus 


controller 
that the current write cycle can potentially 
allocate 
a cache line. 


Potential 
allocate 
cycles are cycles which are 82495XP 
misses with PCD, 


PWT inactive. 


RDYSRC 
0 
Ready Source 
This signal is an output of the 82495XP. 
It indicates 
the source of the 
BRDY generation 
for the CPU. When high it indicates 
that the memory bus 
controller 
should generate 
BRDYs to the CPU, when low it indicates 
that 


the 82495XP will be the one providing 
BRDYs. 
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RESET 
I 
Reset 
This signal forces the 82495XP 
and 82490XP to begin execution 
at a known state. It's 
falling edge will sample the state of the configuration 
pins. RESET is an asynchronous 
input to the 82495XP and 82490XP. 
The following 
82495XP 
pins are sampled 
during reset failing edge: 
CNA# [CFGO): CFGO line of 82495XP configuration 
inputs. 
SWEND# 
[CFG1): CFG1 line of 82495XP 
configuration 
inputs. 
KWEND# 
[CFG2): CFG2 line of 82495XP 
configuration 
inputs. 
FLUSH # [NCPFLD#): 
Enables decoding 
of the non-cacheable 
PFLD mode. Active if low. 


FPFLD# 
[FPFLDEN): 
Enables the external 
FIFa pfld mode. Active high. 
BGT# 
[C490LDRV): 
Indicates the driving strength 
of the 82495XP/82490XP 
interface. 
If 


high, the 82495XP 
can drive up to 10 82490XP's 
without derating. 
If low, the 82495XP 
can drive up to 18 82490XP's 
without derating. 


SYNC# 
[MEMLDRV): 
Indicates the 82495XP's 
memory bus driving strength . 
SNPCLK[SNPMD): 
Indicates the snoop mode, synchronous 
or asynchronous. 
CFGO-CFG2 
signals are used to configure 
the 82495XP/82490XP 
with cache 
parameters. 
They define the lines/sector, 
line ratio, and number of tags. 
MALE [WWOR # l: Enforces 
strong or weak write-ordering 
consistency. 
MBALE[HIGHZ#): 
If active along with SLFTST# 
will tristate 
all 82495XP 
outputs. 


The following 
82490XP 
pins are sampled 
during reset failing edge: 
PAR#: 
If active(low), 
this pin configures 
the 82490XP as a parity storage device. The 
parity configuration 
stores the paritybits 
belonging 
to data stored in other 82490XP's. 


MZBT#[MX4/MX8#): 
Determines 
the number of 10 pins used for the memory bus 
interface. 
If high, four 10 pins are chosen. 
If low, eight 10 pins are chosen. 


MSEL # [MT 4/MT8 
#): Determines 
the number of transfers 
necessary 
on the memory bus 
for each cache line. If high, four memory bus transfers 
are needed to fill a cache line. If 
lOW,eight memory bus tranfers 
are needed to fill a cache line. 


MCLK[MSTBM 
#): If active(low), 
this pin indicates 
a strobed 
memory bus configuration. 
If 
inactive(high), 
a clocked 
memory bus is chosen. 


MFRZ# 
[MEMLDRV): 
Indicates the 82490XP's 
memory bus driving strength. 


SMLN# 
0 
Same Cache Line 
This signal is an output of the 82495XP. 
It is used to indicate to the memory bus controller 
that the current cycle is to the same 82495XP 
line as the previous one. This indication 
can be used by the memory bus controller 
to selectively 
activate 
its SNPSTB # signal to 
other caches. 
For example, 
back to back snoop hits to the same line may be snooped 
only once. This signal is valid together 
with CADS#. 
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SNPADS# 
0 
Cache Snoop Address 
Strobe 
This signal is an output of the 82495XP. 
It has an identical 
functionality 
as 
CADS #, but is generated 
only on snooping-write-back 
cycles. Considering 
that 


snoop write-back 
cycles are the only ones which are generated 
independent 
of 


CPU bus activity, this separate 
address strobe should ease implementation 
of 


the memory bus controller. 
Whenever 
active, the memory bus controller 
should 


abort all pending cycles (cycles for which BGT # was not issued yet. After 
BGT # the memory bus controller 
is responsible 
for the cycle completion). 
The 


82495XP 
assumes that non-committed 
cycles are aborted 
upon SNPADS# 


and may re-issue them again after the completion 
of the snoop. 


SNPBSY# 
0 
Snoop Busy 
This signal is driven by the 82495XP. 
When inactive(high), 
it indicates 
that the 


82495XP is ready to accept another snoop cycle. SNPBSY # will be activated 
for one of two reasons: A snoop hit to a modified 
line, a back-invalidation 
is 
needed when there is one already in progress. 
In either of these cases, the 


82495XP will not perform the look-up for a pending snoop until SNPBSY # is 
de-activated. 


SNPCLK[SNPMD] 
I 
Snoop Clock [Snoop Mode] 
This pin provides 
the 82495XP with the snoop clock to be used in clocked 
memory interfaces. 
During clocked 
mode SNPSTB#, 
SNPINV, SNPNCA, 


MBAOE #, MAOE #, and the Address 
lines will be sampled 
by SNPClK. 


During RESET activation, 
this pin functions 
as the SNPMD (snoop mode) 
signal. If high it indicates 
strobed snooping 
mode. If low it indicates 
synchronous 
snooping 
mode. For clocked 
snooping 
mode, SNPClK 
is 
connected 
to the snoop clock source. 


SNPCYC# 
0 
Snoop Cycle 
This signal is an output of the 82495XP. 
It indicates 
when the snooping 
look-up 


is actually taking place in the 82495XP tag RAM. 


SNPINV 
I 
Snoop Invalidation 
This signal is an input to the 82495XP 
and indicates 
the resulting 
line state in 
case of a snoop hit cycle. If active, it forces the line to go to an invalid state. 
This signal is sampled with SNPSTB # . 


SNPNCA 
I 
Snoop Non Caching Device Access 
This signal is an input to the 82495XP and provides the 82495XP 
information 


on whether the current memory bus master is a non caching 
device (DMA, 


etc). This indication 
allows the 82495XP to avoid changing 
line states from 
exclusive 
to shared unnecessarily. 


SNPSTB# 
I 
Snoop Strobe 
This signal is an input to the 82495XP which is used to initiate a snoop. 
SNPSTB# 
causes the latching of the snoop address and parameters. 
The 
82495XP 
supports 
three latching 
modes: Clocked, 
Strobed, 
Synchronous. 
In 


the clocked 
mode, address and attribute 
signals will be latched with the 
activation 
of SNPSTB # .SNPClK. 
In the strobed 
mode, address and attributes 


will be latched by the SNPSTB # falling edge. In synchronous 
mode, address 
and attribute 
signals will be latched with the activation 
of SNPSTB # .ClK. 
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SWENO# 
[CFG1] 
I 
Snoop Window 
End, [Configuration 
Pin 1] 
This signal is generated 
by the MBC and indicates 
to the 82495XP 
that the 
Snoop Window has expired. At this point the 82495XP will latch the memory 
bus attributes: 
write policy (MWB/WT 
#), and direct to [M] transfer 
(DRCTM #). At the end of the snooping 
window, 
all other devices have 
snooped 
the bus master's 
address and have generated 
address caching 
attributes 
on the bus. Once a cycle begins, the 82495XP 
prevents 
snooping 
until it has received 
SWEND#. 
The 82495XP will act based on those 
attributes 
and will update its tag RAM. 


During RESET's 
falling edge this line functions 
as the CFG1 configuration 
signal which is used to configure 
the 82495XP/82490XP 
with cache 
parameters. 


SYNC# 
[MEMLORV] 
I 
Synchronize 
82495XP 
cache, 
[Memory 
Bus low 
Drive] 
This signal is an input to the 82495XP. 
Activation 
of this line will cause the 
synchronization 
of the 82495XP tag array with main memory. All 82495XP 
modified 
lines will be written back to main memory. The difference 
between 
FLUSH and SYNC is that on SYNC the 82495XP 
and CPU tag array will NOT 
be invalidated. 
All the valid entries will be kept, with all modified 
lines 
(M state) becoming 
non-modified 
(E state). 
During RESET's 
falling edge, this signal is sampled 
to indicate 
the memory 
bus driving strength. 
If it is sampled 
low, the maximum 
capacitive 
load 
without derating 
is 100pf. If it is sampled 
high, the maximum 
capacitive 
load 
without derating 
is 50pf. 


TCK 
I 
Testability 
Clock 
This signal is an input to both the 82495XP 
and 82490XP. 
This is the 
boundary 
scan clock. This signal has to be connected 
to a clock 
synchronous 
to ClK 
to insure initialization 
of the test logic. 


TOI 
I 
Testability 
serial input 
This signal is an input to both the 82495XP and 82490XP. 


TOO 
0 
Testability 
serial output 
This signal is an output of both the 82495XP 
and 82490XP. 


- 
TMS 
I 
Testability 
Control 
This signal is an input to both the 82495XP and 82490XP. 


The following 
pins have internal 
pull-ups: 
During tri-state 
output testing 
sequence, 
all pull-ups 
will be disabled. 
ADS#, 
NA#, 
FPFlD#, 
TDI, 
TMS, 
BGT#, 
KWEND#, 
SWEND#, 
CNA#, 
BRDY#, 
SYNC#, 
FLUSH #, 
SNPSTB #, 
MRO #, 
DRCTM #, 
TCK, 
SNPClK, 
MFRZ#, 
MZBT#, 
MClK, 
MOClK, 
BOFF#, 
PAR#. 


The following 
signals 
are glitch free. These 
signals 
are always at a valid logic level following 
RESET: 


CADS#,CDTS#,SNPADS#,SNPCYC#, 
KlOCK#. 


2-281 


82495XP Cache Controller/82490XP 
Cache RAM 
intel~ 


1.3 Output Pins 


Table 
1-3 lists all output 
pins, from which 
part(s) they are driven, 
and their active levels. 


Table 
1-3. Output 
Pins 


Name 
Part 
Active 
Level 
Name 
Part 
Active 
Level 


BlElIf 
82495XP 
lOW 
MTHITlIf 
82495XP 
lOW 


CADSlIf 
82495XP 
lOW 
NENElIf 
82495XP 
lOW 


CAHOlD 
82495XP 
HIGH 
PAllClIf 
82495XP 
lOW 


CDTSlIf 
82495XP 
lOW 
RDYSRC 
82495XP 
HIGH 


CWAY 
82495XP 
- 
SMlNlIf 
82495XP 
lOW 


CW IR lIf,CD/C lIf,CM/IOllf 
82495XP 
- 
SNPADSlIf 
82495XP 
LOW 


FSIOUTlIf 
82495XP 
lOW 
SNPBSYlIf 
82495XP 
lOW 


KlOCKlIf 
82495XP 
lOW 
SNPCYClIf 
82495XP 
lOW 


MCACHElIf 
82495XP 
lOW 
TOO 
82495XP 182490XP 
- 


MHITMlIf 
82495XP 
lOW 


1.4 Input Pins 


Table 
1-4 lists all input pins, which 
part(s) they are input to, their active 
level, and whether 
they are synchro- 
nous or asynchronous 
inputs. 


Table 
1-4. Input 
Pins 


Name 
Part 
Active 
Level 
Synchronous/Asynchronous 


BGT lIf [C490lDRV] 
82495XP 
lOW 
Synchronous 
to ClK 


BRDYlIf 
82495XP 182490XP 
lOW 
Synchronous 
to ClK 


ClK 
82495XP 182490XP 
- 
- 


CFG3 
82495XP 
- 
Synchronous 
to ClK 


CNAlIf 
(CFGO) 
82495XP 
lOW 
Synchronous 
to ClK 


CRDY lIf [SlFTST 
lIf] 
82495XP 182490XP 
lOW 
Synchronous 
to ClK 


DRCTMlIf 
82495XP 
lOW 
Note 2 


FLUSHlIf 
[NCPFlDlIf] 
82495XP 
lOW 
Asynchronous 


CPUTYP 
82495XP 
lOW 
Synchronous 
to ClK 


KWENDlIf 
(CFG2) 
82495XP 
lOW 
Synchronous 
to ClK 


MALE, MBAlE 
82495XP 
HIGH 
Asynchronous 


MAOElIf,MBAOElIf 
82495XP 
lOW 
Asynchronous 


MClK[MSTBMlIf] 
82490XP 
lOW 
Synchronouos 
to MClK 


MBRDY lIf (MISTB) 
82490XP 
- 


MDOElIf 
82490XP 
lOW 
Asynchronous 


MEOCllf 
82490XP 
lOW 
Synchronousl 
Asynchronous, 
Note 1 
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Table 
1-4. Input 
Pins (Continued) 


Name 
Part 
Active 
Level 
Synchronous/Asynchronous 


MFRZ# 
82490XP 
Low 
Synchronous/ 
Asynchronous, 
Note 1 


MOCLK(MOSTB) 
82490XP 


MSEL[MTR4/TR8#] 
82490XP 
Low 
Synchronousl 
Asynchronous, 
Note 1 


MZBT # [MX4/MX8 
#] 
82490XP 
Low 
Synchronous/ 
Asynchronous, 
Note 1 


MKEN# 
82495XP 
LOW 
Note 2 


MRO# 
82495XP 
LOW 
Note 2 


MWB/WT# 
82495XP 
- 
Note 2 


PAR# 
82490XP 
Low 
Synchronous 
to CLK 


RESET 
82495XP /82490XP 
HiGH 
Asynchronous 


SNPCLK[SNPMD] 
82495XP 
- 
- 


SNPINV 
82495XP 
HIGH 
Note 3 


SNPNCA 
82495XP 
HIGH 
Note 3 


SNPSTB# 
82495XP 
LOW 
Note 3 


SWEND# 
(CFG1) 
82495XP 
LOW 
Synchronous 
to CL~ 


SYNC# 
[MEMLDRV] 
82495XP 
LOW 
Asynchronous 


TCK 
82495XP 182490XP 
- 


Tm 
82495XP 182490XP 
- 
Synchronous 
to TCK 


TMS 
82495XP 182490XP 
- 
Synchronous 
to TCK 


NOTES: 
(1) In Clocked memory bus mode these pins are synchronous with MCLK. In Strobed memory bus mode these pins are 
asynchronous. 
(2) MWB/WT#, 
DRCTM# must be synchronous to ClK during SWEND#. MKEN#, MRO# must be synchronous to ClK 
during KWEND#. 
(3) In clocked memory bus mode these pins are synchronous with SNPClK. In strobed memory mode these pins are 
asynchronous. 


1.5 Input/Output Pins 


Table 
1-5 lists all input/output 
pins, which 
part they interface 
with, and when they are floated. 


Table 
1-5. Input/Output 
Pins 


Name 
Part 
Synch/Asynch 
When 
Floated 


FPFLD# 
[FPFLDEN] 
82495XP 
Synchronous 
to CLK 
- 


MCFAO-MCFA6 
82495XP 
Note 1 
MAOE# 
= High 


MDATAO-MDATA7 
82490XP 
Note 2 
MDOE# 
= Hight and during Reset 


MSETO-MSET10 
82495XP 
Note 1 
MAOE# 
= High 


MTAGO-MTAG11 
82495XP 
Note 1 
MAOE# 
= High 


NOTES: 
(1) With MALE high and MAOE# low, these pins are synchronous to ClK. 
(2) In Clocked memory bus mode these pins are synchronous with MCLK. In Strobed memory bus mode these pins are 
asynchronous. 
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1.6 
Pin State During Reset 


Table 1-6. Pin State During Reset 


Pin Name 
Pin State during Reset 


CADS#,CDTS#,SNPADS# 
High 


CW/R#, 
CD/C#, 
CMIIO#, 
MCACHE# 
Undefined 


RDYSRC, 
PALLC#, 
CWAY 
Undefined 


NENE #, SMLN # 
Undefined 


KLOCK# 
High 


FPFLD# 
High 


MSETO-MSET10, 
MTAGO-MTAG11, 
MCFAO-MCFA6 
Note 1 


CAHOLD 
Note 2 


MHITM #, MTHIT # 
High 


SNPCYC#, 
SNPBSY# 
High 


TOO 
Note 3 


NOTES: 
(1)MSET, MTAG, and MCFA signals are high impedance during reset if MAOE# and MBAOE# are deasserted. 
(2) The state of CAHOLO depends on whether self-test is selected (see testability chapter for details). 
(3) The State of TOO is controlled by the boundary scan which is independent of other signals including RESET (see 
testability chapter for details). 


1.7 
Quick Pin Reference 
(Optimized 
Interface) 


refined and optimized 
to allow zero wait state opera- 
tion at 50 MHz. 


The 
table 
below 
lists 
and 
describes 
the 
pins that 
comprise 
the optimized 
interface. 
This is the inter- 
face 
between 
the 
i860 
XP CPU, 
82495XP 
Cache 
Controller, 
and the 82490XP 
SRAM 
that 
has been 


The information 
provided 
in the table below 
is help- 
ful in identifying 
the pins that make up the optimized 
interface 
to facilitate 
interface 
layout and debugging. 
Timing 
information 
appears 
in chapter 
ten. 


Symbol 
Type 
Name and Function 


A3-A31 
0 
CPU 
The i860 XP CPU Address 
Outputs A3-A31 
are connected 
to the 82490XP 
AO-A15 
I 
490 
address inputs AO-A 15 and t<;>the 82495XP 
address inputs of SET, TAG, and 
CFA. 


ADS# 
0 
CPU 
The i860 XP CPU Address 
Strobe output is connected 
to the 82495XP 
and 
I 
495 
82490XP ADS # inputs and indicates 
the start of a CPU cycle. Please refer to 
I 
490 
the i860 XP CPU data sheet for details. 


AHOLD 
I 
CPU 
Address 
Hold is driven by the 82495XP to the i860 XP CPU AHOLD 
input 
0 
495 
during back-invalidation 
cycles. Please refer to the i860 XP CPU data sheet for 
details. 


BEO#-BE7# 
0 
CPU 
The i860 XP CPU Byte Enable outputs are driven to an external 
latch controlled 
BE# 
I 
490 
by the 82495XP. 
Each byte enable also goes to an 82490XP to control 
partial 
write cycles. Please refer to the i860 XP CPU data sheet for details. 


BLAST# 
0 
495 
The 82495XP 
Burst Last output is driven to indicate the end of a cycle. It is 
I 
490 
connected 
to the 82490XP 
BLAST # input. 


BOFF# 
I 
CPU 
The Backoff 
input of the i860 XP CPU is driven by the 82495XP to resolve 
0 
495 
contention 
on the CPU bus. See the i860 XP CPU data sheet for details. 
I 
490 
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1.7 
Quick Pin Reference 
(Optimized 
Interface) 
(Continued) 


Symbol 
Type 
Name and Function 


BRDYC# 
I 
490 
Burst Ready Cache is an input to the 82490XP that is connected 
to the 
82495XP 
BRDYC2 # output and is used for tracking 
these hit cycles. 


BRDYC1# 
0 
495 
Burst Ready Cache 
1 is output by the 82495XP to the i860 XP CPU RSRVD 
RSRVD 
input during cache hit and posted cycles. 


BRDYC2# 
0 
495 
Burst Ready Cache 2 is output by the 82495XP 
to the 82490XP 
BRDYC# 
input during cache hit and posted cycles. 


BUS# 
0 
495 
The Bus/ Array Select output of the 82495XP 
multiplexes 
either the memory 
I 
490 
bus path or array path to the CPU bus of the 82490XP. 


CDATAO-7 
I/O 
490 
Cache Data I/O pins are the 8 bits comprising 
the I/O data bus interface 
between 
each 82490XP and the i860 XP CPU data bus. 


CFAO-CFA6 
I/O 
495 
Cache Function 
and Address 
pins of the 82495XP 
are multiplexed 
to the i860 
XP CPU address according 
to the 82495XP configuration 
. 


D/C# 
0 
CPU 
The Data/Code 
output of the i860 XP CPU is used by the 82495XP 
to decode 
I 
495 
special cycles. Please refer to the i860 XP CPU data sheet for details. 


DO-D63 
I/O 
CPU 
Data Bus The i860 XP CPU pins are distributed 
to each 82490XP. 
Please refer 
to the i860 XP CPU data sheet for details. 


DPO-DP7 
I/O 
CPU 
The i860 XP CPU Data Parity Bus pins are distributed 
to each 82490XP. 
Please refer to the i860 XP CPU data sheet for details. 


EADS# 
I 
CPU 
The i860 XP CPU External 
Invalidation 
Address 
Strobe 
is driven by the 
0 
495 
82495XP during back-invalidation 
and inquire cycles to maintain 
inclusion. 


Please refer to the i860 XP CPU data sheet for details. 


EWBE# 
I 
CPU 
The i860 XP CPU External 
Wrlteback 
Buffer Empty input is driven by the 
0 
495 
82495XP for use in Strong Ordering 
mode. See the i860 XP CPU data sheet for 
details. 


HITM# 
0 
CPU 
The i860 XP CPU Hit Modified 
output indicates 
a snoop hit to a modified 
line 
I 
495 
when the 82495XP 
performs 
a snoop to the CPU. 


I 
490 


INV 
I 
CPU 
The Invalidate 
input to the i860 XP CPU is driven by the 82495XP 
on back 
0 
495 
invalidation 
cycles. 
. 


KEN# 
I 
CPU 
The i860 XP CPU Cache Enable input is driven by the 82495XP 
during 
0 
495 
cacheable 
cycles. Please refer to the i860 XP CPU data sheet for details. 


LEN 
0 
CPU 
The i860 XP CPU drives the Length pin active to indicate a cycle of 2 transfers. 
I 
495 
With LEN inactive, the cycle length is 1 transfer. 
If a cycle is cacheable, 
LEN is 
undefined 
and the cycle length is 4 transfers. 


LOCK# 
0 
CPU 
The Cycle Lock 
pin of the i860 XP CPU is driven to the 82495XP 
which in turn 
I 
495 
drives the memory bus KLOCK # output. Please refer to the i860 XP CPU data 
sheet for details. 


MIIO# 
0 
CPU 
The i860 XP CPU Memory/IO 
pin is used by the 82495XP to decode 
memory 
I 
495 
and I/O cycles. Please refer to the i860 XP CPU data sheet for details. 


MAWEA# 
0 
495 
The 82495XP asserts Memory 
Bus Array Write Enable or Allocation 
signal 
I 
490 
to the 82490XP to indicate that the data in the memory buffers should be 
written to the array, or that an allocation 
should occur. 


MCYC# 
0 
495 
The 82495XP 
asserts Memory 
Bus Cycle to the 82490XP 
to indicate that the 
I 
490 
current cycle will use the memory buffers. 


NA# 
I 
CPU 
The 82495XP drives Next Address 
to the i860 XP CPU when pipelineing 
CPU- 
0 
495 
to-cache 
cycles. See the i860 XP CPU data sheet for details. 


PCD 
0 
CPU 
The i860 XP CPU Page Cacheablllty 
Disable attribute 
bit is driven on PCD to 
I 
495 
indicate cacheability. 
PCD active causes the 82495XP 
to make the current 
cycle non-cacheable. 
Please refer to the i860 XP CPU data sheet for details. 
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1.7 Quick Pin Reference (Optimized Interface) 
(Continued) 


Symbol 
Type 
Name and Function 


PCYC 
0 
CPU 
The i860 XP CPU drives the Page Cycle 
output during page-table 
special cycles. 


I 
495 
See the i860 XP CPU data sheet for details. 


PWT 
0 
CPU 
The i860 XP CPU Page Write-Through 
attribute 
is driven on PWT to indicate 
I 
495 
write-through. 
PWT active causes the 82495XP to make the current cycle write- 
through. 
Please refer to the i860 XP CPU data sheet for details. 


SETO-10 
I/O 
495 
The 82495XP 
Set Address 
pins are connected 
to 11 bits of the CPU address. 


TAGO-11 
I/O 
495 
The 82495XP Tag Address 
pins are connected 
to 12 bits of the CPU address. 


W/RII 
0 
CPU 
The i860 XP CPU Write/Read 
pin is driven to the 82495XP 
and 82490XP to 
I 
495 
indicate a read or write cycle. Please refer to the i860 XP CPU data sheet for 
I 
490 
details. 


WAY 
0 
495 
The 82495XP Way indication 
is used by the 82490XP to properly 
load and store 
I 
490 
buffers as well as update the MRU bit. 


WB/WTII 
I 
CPU 
The Write 
Back/Write 
Through 
input to the i860 XP CPU is driven by the 
0 
495 
82495XP to direct MESI protocol 
changes 
in the CPU's on-chip caches. 
See the 
i860 XP CPU data sheet for details. 


WBA 
0 
495 
The Write 
Back Buffer 
Address 
(2 lines per sector) 
pin is driven by the 
(SECU) 
I 
490 
82495XP to indicate which line is loaded by the 82490XP. 


The 82495XP 
drives SEC211 during RESET to the 82490XP 
to pass along lines/ 


sector information. 
If active, SEC211 indicates 
2 lines per sector. 


WBTYP 
0 
495 
The 82495XP Write 
Back Cycle 
Type (Line Ratio 
0) pin is driven to the 
(LRO) 
I 
490 
82490XP to indicate a write-back 
or snoop write-back 
cycle. 


LRO is driven by the 82495XP to the 82490XP 
at RESET to pass along line ratio 
information. 


WBWEII 
0 
495 
The 82495XP Write-Back 
Buffer 
Write 
Enable 
(Line Ratio 
1) pin is used in 
(LR1) 
I 
490 
conjunction 
with the WBA pin to load the write-back 
buffers. 


LR1 is driven by the 82495XP to the 82490XP 
at RESET to pass along line ratio 
information. 


WRARRII 
0 
495 
The 82495XP 
Write 
to 82490XP 
Array 
signal controls 
the writing of data into the 
I 
490 
82490XP 
array and updating of the MRU bit. 


2.0 
CHIPSET INTRODUCTION 


The 
82495XP/82490XP 
is 
a 
second-level 
cache 
controller 
chipset 
for the i860 XP CPU. The chipset 
provides 
a unified 
code 
and 
data 
cache 
which 
is 
software 
transparent. 
The 
82495XP/82490XP 
has 
been designed 
to support 
a high-speed 
CPU/cache 
core interface, 
and a same or lower speed 
memory 
bus interface. 


The 82495XP 
is the cache controller. 
It contains 
8K 
tags and control 
logic to control 
up to a 512K size 
cache. 
The 82490XP 
is a custom 
cache 
data RAM 
designed 
to be used with the 82495XP. 
Between 
8 
and 18 82490XPs 
are required 
to create 
a 256K to 
512K cache, 
respectively. 
The memory 
bus control- 
ler (M BC) is the set of logic required to interface 
the 
82495XP 
and 
82490XP 
to 
the 
memory 
bus. 
The 
MBC provides 
product 
differentiation, 
and its imple- 
mentation 
ultimately 
determines 
system 
perform- 
ance. 
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2.1 
Main Features 


The 
82495XP/82490XP 
have 
the 
following 
main 
features: 


- 
Tracks 
the speed of the i860 XP CPU 


- 
Large Cache Size support: 


4K or 8K Tags 


1 or 2 lines per sector 


4 or 8 transactions 
per line 


64 memory 
bus or 128-bit wide memory 
bus 


256K or 512K cache 


- 
Write-Back 
cache 
with full multiprocessing 
con- 
sistency 
support: 


supports 
the MESI protocol 


watches 
memory 
bus to guarantee 
1st level, 2nd 
level cache consistency 


maintains 
inclusion 


- 
Two-way 
set-associative 
with MRU hit prediction 
algorithm 


intel~ 


- 
Zero wait state hit cycles on MRU hit. One wait 
state on MRU misses 
- 
Concurrent CPU and Memory Bus transactions 
Supports 
synchronous, 
asynchronous, 
and 
strobed memory bus architectures 


2.2 
CPU/Cache Core Description 


Figure 2-1 depicts a block diagram of the basic 
cache subsystem. The cache subsystem provides a 
gateway between the CPU and the memory bus. All 
CPU accesses which can be serviced locally by the 
cache subsystem will be filtered out from the memo- 
ry bus traffic. Therefore local cycles (CPU cycles 
which hit the cache and do not require a memory 
bus cycle) will be completely invisible to the memory 
bus providing the reduction in memory bus band- 
width necessary for multiprocessing systems. Anoth- 
er very important function of the B2495XP cache 
subsystem is to provide speed decoupling between 
the CPU and memory busses. Processors are quick- 
ly achieving operating frequencies which can be 
very difficult for the memory subsystem to meet. The 
B2495XPcache subsystem is optimized to serve the 
CPU with zero wait-states up to very high frequen- 
cies (50 Mhz), at the same time providing the decou- 
pling necessary to run slower memory bus cycles. 


The Basic Functions of the cache subsystem ele- 
ments are: 


82495XP: Main control element, includes the tags 
and line states and provides hit or miss decisions. It 
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Figure 2-1. 82495XP Cache Subsystem 


handles the CPU bus requests completely and coor- 
dinates with the memory bus controller when an ac- 
cess 
needs 
the 
memory 
bus. 
It 
controls 
the 
B2490XP data paths for both hits/misses to provide 
the CPU with the correct data. It dynamically adds 
wait states based on the MRU prediction mecha- 
nism. The B2495XP is also responsible for perform- 
ing memory bus snoop operations while other devic- 
es are using the memory bus. The B2495XP drives 
the cycle address and other attributes during a 
memory bus access. 
A 
block 
diagram of 
the 
B2495XP is shown in Figure 2-2. 


To/from 
CPU 


Cache 
Directory 
Array 


To/from 
Memory _--_ 
Bus 
Controller 
Memory 
Bus 
Controller 
Interface 


Memory 
Bus 
Queue 
(Tag 
RAM) 


240956-6 


Figure 2-2. 82495XP Block Diagram 
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82490XP: Implements the cache SRAM storage and 
data path. It includes latches, muxes, logic which 
allow it to work in lock-step with the 82495XP to 
efficiently serve both hit and miss accesses. It takes 
full advantage of internal silicon flexibility to provide 
a degree of performance otherwise unachievable 
with discrete implementations. It supports zero wait 
state hit accesses, concurrent CPU and memory bus 
'accesses, and includes a replication of the MRU bits 
for autonomous way prediction. During memory bus 
cycles it acts as a gateway between CPU and mem- 
ory buses. A block diagram of the 82490XP is shown 
in Figure 2-3. 


Memory Bus Controller: Server for memory bus cy- 
cles. It adapts the CPU/Cache core to a specific 
memory 
bus 
protocol. 
It 
coordinates 
with 
the 
82495XP line fills, flushes, write-backs, etc. The 
memory bus controller's flexibility allows customers 
to easily adapt the 82495XP cache subsystem to 
their specific architectures, and to provide their own 
differentiation. Figure 2-4 shows an example memo- 
ry bus controller. The MBC handles all cycle control, 
data transferring, snooping, and any synchroniza- 
tion. 
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Figure 2-3. 82490XP Block Diagram 
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Figure 2-4. MBC Example Block Diagram 
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3.0 
CACHE OVERVIEW 


This chapter gives a brief description of 82495XP/ 
82490XP configurations, interface, snooping mecha- 
nism, cycle control mechanism, and memory bus 
control mechanism. Each section of this overview is 
described in more detail in later chapters. 
' 


3.1 Configuration 


The 82495XP/82490XP cache chipset offers a num- 
ber of configuration options. The system designer 
can choose from a number of different operating 
characteristics, 
including 
memory 
bus 
modes, 
snooping modes, and internal physical attributes 
(line size, lines per sector, etc.). The flexibility of 
these configuration options allow the 82495XP/ 
82490XP cache to be used in a wide range of appli- 
cations. 


Configurations 
are 
selected 
by 
altering 
the 
82495XP/82490XP inputs during RESET. They are 
not dynamically changeable, and to conserve pins 
some configuration 
inputs become 82495XP or 
82490XP inputs/outputs after RESET. 


3.1.1 PHYSICAL CACHE 


Physically, the 82495XP/82490XP can be config- 
ured to support many different cache configurations. 
By selecting one cache configuration, other configu- 
rations may be excluded. The 82495XP/82490XP 
can be configured to support: 
- 
256K or 512K cache 
- 
64 or 128 bit wide memory bus 
- 
One or two lines per sector 


82495XP Cache Controller/82490XP 
Cache RAM 


- 
1:1, 1:2, or 1:4 CPU to 82495XP line size ratio 


- 
4 or 8 memory bus transactions per line 


- 
4K or 8K tag size 
- 
Strong or weak write ordering 


Figure 3-1 summarizes the basic configurations 
available when using the 82495XP/82490XP. 


3.1.2 SNOOP MODES 


When another master snoops the 82495XP, the 
MBC must initiate the snoop request and pass on 
the response. The 82495XP allows the MBC to initi- 
ate this snoop request in one of three modes: syn- 
chronous, clocked, and strobed. The snoop re- 
sponse of the 82495XP is always synchronous. 


When initiating the snoop in synchronous snoop 
mode, all snoop 
information is latched by' the 
82495XP synchronous to the CPU ClK. The snoop 
is then performed on the next ClK edge and the 
response given on the ClK edge after that. This is 
the fastest possible method of snooping. 


In clocked snooping mode, information is latched by 
the 82495XP with respect to an external snoop 
clock (slower than ClK) source. The 82495XP must 
internally synchronize this information to ClK and 
provide a response. 


In strobed snooping mode, information is latched 
into the 82495XP with respect to the falling edge of 
another signal. Thus, the snoop initiation is clock in- 
dependent. The 82495XP again synchronizes this in- 
formation with ClK. 


MEM BUS ~ 64 Bits 
MEM BUS ~ 128 Bits 
Number of 


4 Trans. 
8 Trans. 
4 Trans. 
8 Trans. 
82490XP Devices 


1 
2 
LR ~ 1 
LR ~ 2 
Tags ~ 8k 
Tags ~ 4k 
8 
LIS ~ 1 
LIS ~ 1 


3 
4 
4 
5 
LR ~ 1 
LR ~ 2 
LR ~ 2 
LR ~ 4 
Tags ~ 8k 
Tags ~ 8k 
Tags ~ 8k 
Tags ~ 4k 
16 
LIS ~ 2 
LIS ~ 1 
LIS ~ 1 
LIS ~ 1 


[.~ 
--,l Nol Supported 
LR ~ 82495XP/CPU line Ratio 
LIS ~ 82495XP lineslSector 


Cache Device 
2. 4. 8 Bits Wide 


Figure 3-1. 82495XP/82490XP Configurations 
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source synchronous or half-clock (divided) synchro- 
nous to the CPU ClK. 


In strobed mode, data is driven from the rising edge 
of one signal, and read with the rising edge of anoth- 
er. Like the strobed snooping mode, this carries no 
clock skew problems, or memory bus speed limita- 
tions. 
' 


3.2 
CPU Bus Interface 


The CPU bus interface is the connection of the 
82495XP and 82490XP to the i860 XP CPU. Be- 
cause this interface is optimized to achieve the high 
speed performance, it is not a flexible interface. The 
majority of the signals in the CPU bus interface must 
be 
connected 
strictly 
between 
the 
82495XPI 
82490XP cache and the i860 XP CPU. Chapter 10 
addresses the use of such signals. 


Some CPU signals are, however, accessible by the 
MBC. These are the following pins: RESET, ClK, 
BROY2i11,INT, BERR, PCHKiII, PENill, TCK, TOI, 
TMS, TRSTiII, and TOO, CPU pins KBO,KB1, HITiII, 
and BREQ are also available to the MBC, but are of 
limited use in an 82495XP/82490XP system. 


Other CPU pins flow through a '377 type latch to the 
MBC. The latch enable is controlled by the 82495XP 
through the BlE ill pin. The following CPU signals 
flow through this latch: PCO, PWT, BEOill-BE7i11, 
CACHEill, lEN, PCYC, and CTYP. 


3.3 
82495XP/82490XP 
Interface 


The 82495XP/82490XP interface is the connection 
between 82495XP and 82490XP. Like the CPU bus 
interface, this isolated interface is not flexible and 
may not be altered beyond what Intel has provided. 
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logic block, a cycle control block (with synchronizers 
if necessary), and data path control block. The 
snoop block must be able to communicate with the 
other caches when snooping is necessary. At the 
same time, the cycle control block must interface to 
some arbitration logic for bus arbitration. 


3.4.1 
SNOOPING 
LOGIC 


The MBC snooping logic is responsible for initiating 
a snoop in the 82495XP and providing the response 
to the rest of the system. Snoop logic must recog- 
nize what other caches are doing, and snoop if nec- 
essary. Snoop logic must also recognize when its 
82495XP is not capable of snooping and delay its 
snoop initiation. 


When a cycle begins on the bus, all other caches 
snoop. Once all the snoop results are returned to 
the master 82495XP, its snoop logic must recognize 
the result and alter the cycle appropriately. This 
could mean aborting the current cycle in memory, 
delaying the cycle until a write-back is performed, or 
changing the master's tag state according to the 
snoop information. 


3.4.2 
CYCLE 
CONTROL 
LOGIC 


Cycle control logic is responsible for initiating a 
memory bus cycle, providing proper 82495XP cycle 
attributes during the cycle, and terminating the cy- 
cle. Cycle control logic determines the cacheability 
of the cycle, whether cycles are allocatable, pipelin- 
ing, and all aspects of the progress of the current 
cycle. 


Since cycle control logic interfaces memory bus sig- 
nals to the 82495XP, and since the memory bus is 
not necessarily synchronous to the 82495XP ClK, it 
may also provide proper synchronization. Careful 
design of this synchronization logic can minimize or 
eliminate synchronization penalties. 
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3.4.3 
DATA 
PATH CONTROL 


Data path control logic controls how data is written 
from the 82490XP or read into the 82490XP and 
CPU. It handles the actual transferring of data tol 
from the memory data bus. Data path control logic 
also handles the CPU burst order, and the holding of 
data during allocation cycles. In systems with memo- 
ry busses that are wider than the CPU bus, the data . 
path control logic appropriately steers data to the 
correct 82490XP's. 


3.5 Test 


The 
82495XP/82490XP 
provide 
two 
means 
of 
cache testing. These are a built-in self-test, and 
boundary scan test. The built-in self-test (BIST) is 
initiated during RESET. The boundary scan test 
uses separate and dedicated pins on the 82495XP. 
These are described in a later chapter. 


4.0 
CACHE CONSISTENCY 
PROTOCOL 


One of the 82495XP objectives is to implement a 
high performance second level cache for multipro- 
cessor systems. To fulfill this objective the 82495XP 
implements a "write-back" cache with full support 
for multiprocessing data consistency. Being a write- 
back cache means that the 82495XP may contain 
data which is not updated in the main memory. 
Therefore a mechanism is implemented to insure 
that data read by any system bus master, at any 
time, is correct. 


. A key feature for multiprocessing systems is reduc- 
tion of the memory bus utilization. The memory bus 
quickly becomes a resource bottleneck with the ad- 
dition of multiple processors. The 82495XP cache 
consistency mechanism insures minimal usage of 
memory bus bandwidth. 


The 82495XP allows portions of memory to be de- 
fined as non-cacheable. For the cacheable areas, 
the 82495XP allows selected portions to be defined 
as write-through locations. 


The 82495XP protocol is implemented by assigning 
state bits for each cached line. Those states are de- 
pendent on both 82495XP data transfer activities 
performed as the bus master, and snooping activi- 
ties performed in response to snoop requests gener- 
ated by other memory bus masters. 
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4.1 
Cache Consistency Protocol 
Model 


The 82495XP consistency protocol is the set of rules 
which allows the 82495XP to contain data that is not 
updated in main memory while ensuring that memo- 
ry accesses by other devices do not receive stale 
data. This consistency is accomplished by assigning 
a special consistency state to every cached entry 
(line) in the 82495XP. 


NOTE: 
The following rules apply to memory read and write 
cycles. All 
1/0 
and special cycles 
bypass the 
cache. 


The 82495XP protocol consists of 4 states. They de- 
fine whether a line is valid (hit or miss), if it is avail- 
able in other caches (shared or exclusive), and if it is 
modified (has been modified). 


The 4 States are: 
[I] -INVALID 
Indicates that the line is not avail- 
able in the cache. A read to this 
line will be a miss and cause the 
82495XP to execute a line fill 
(fetch the whole line and deposit 
it into the cache SRAM). A write 
to 
this 
line 
will 
cause 
the 
82495XP 
to 
execute 
a 
write- 
through cycle to the memory bus 
and in some circumstances initi- 
ate an ALLOCATION. 
[Sj - SHARED 
This state indicates that this line 
is potentially shared with other 
caches (The same line may exist 
in 
more 
than 
one 
cache). 
A 
Shared line can be read out of the 
cache 
SRAM 
without 
a 
main 
memory 
access. 
Writing 
to 
a 
Shared 
line 
updates 
the 
82495XP/82490XP 
cache, 
but 
also 
requires the 
82495XP to 
generate a write-through cycle to 
the memory bus. In addition to 
updating main memory, the write- 
through cycle will invalidate this 
line in other caches. Since writing 
to a Shared line causes a write- 
through cycle, the system can en- 
force a "write-through policy" to 
selected 
addresses 
by 
forcing 
those 
addresses 
into 
the 
[Sj 
state. This can be done by setting 
the PWT attribute in the CPU 
page 
table 
or 
asserting 
the 
MWB/WT # pin each time the ad- 
dress is referenced. 
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[E] - EXCLUSIVE This state indicates a line which is 
exclusively available in ONLY this 
cache, and that this line is NOT 
MODIFIED (main memory also 
has a valid copy). Writing to an 
Exlusive line causes it to change 
to the Modified state and can be 
done 
without 
informing 
other 
caches, so no memory bus activi- 
ty is generated. 


[M] - MODIFIED 
This state indicates a line which is 
exclusively available in ONLY this 
cache, and is MODIFIED (main 
memory's copy is stale). A Modi- 
fied line can be updated locally in 
the cache without acquiring the 
memory bus. Because a Modified 
line is the only up-to-date copy of 
data, it is the 82495XP's respon- 
sibility to flush this data to memo- 
ry on accesses to it. Flushing of 
this data to memory will be exe- 
cuted immediately after comple- 
tion of the current CPU bus cycle. 


4.2 Basic State Transitions 


This section covers the most common, basic memo- 
ry accesses. The special functions which force a cy- 
cle to be noncacheable, locked, read only, or direct- 
to-Modified are not in use. These might be used, for 
example, in read for ownership and cache to cache 
transfers, and are covered in section 4.3. This basic 
transitions section is divided into two parts: the first 
covers MESI state changes which occur in a CPU/ 
cache core due to its own actions; the second de- 
scribes MESI state transitions in a CPU/cache core 
caused by the actions of other, external devices. 
Figure 4-1 shows a partial state diagram of the MESI 
coherency protocol which includes these basic tran- 
sitions. 


The 82495XP accepts line attributes from the CPU 
and memory buses. The 82495XP assumes that all 
caches on the memory bus have the SAME number 
of bytes per line. 


4.2.1 TRANSITIONS IN CACHE STATES 
CAUSED BY OWN CPU TRANSACTIONS 


The MESI state of each 82495XP/82490XP cache 
line changes as the 82495XP/82490XP services the 
read and write requests generated by its CPU. 
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4.2.1.1 Read Hit 


A read hit occurs when the CPU generates a read 
cycle on its bus, and the data is present in and re- 
turned by the 82495XP/82490XP. The state of the 
cache line (M, E, or S) remains unchanged by a read 
operation which hits the cache. 


4.2.1.2 Read Miss 


A read miss arises when the CPU generates a read, 
and 
the 
data 
is 
not 
present 
in 
the 
82495XP/82490XP cache-either 
the tag lookup 
does not produce a match or a match occurs but the 
data is Invalid. The 82495XP generates a memory 
access to fetch the data (which is assumed cache- 
able for this discussion) and the surrounding data 
needed to fill the cache line. This data is placed in 
the 82495XP/82490XP cache in an invalid line or (if 
both valid) replaces the least recently used line, 
which is written back to memory if Modified. 


The new line is placed in the Exclusive state, unless 
either the CPU or memory indicates that it should be 
a write-through on its next write access using PWT 
or MWB/WT #, respectively. If either of these is as- 
serted, the new line is placed in Shared state. A new 
line could also be read in and placed directly into 
Modified state: see section 4.3.4 for details and use. 


4.2.1.3 Write Hit 


When the CPU generates a write cycle, if the data is 
present in the 82495XP/82490XP cache, it is updat- 
ed and may undergo a MESI state change. 


If the hit line is originally in the Exclusive state, it 
changes to Modified state upon a write. If the hit line 
is originally in the Modified state, it remains in that 
state. Neither of these cases generates any bus ac- 
tivity. 


A write to a line which is in the Shared state causes 
the 82495XP to write the data out to memory as well 
as update the 82495XP/82490XP cache. The write 
to main memory also serves to invalidate any copy 
of the data which resides in another cache. The 
cache line state changes according to activity on the 
PWT and MWB/WT # pins. If neither of these pins is 
asserted, the write hit line becomes Exclusive. If ei- 
ther of these pins is asserted, the line is forced to 
remain write-through, so the state remains Shared. 
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An existing line can also be written and forced di- 
4.2.2.1 Snooping 
rectly into Modified state: see section 4.3.4 for de- 
• 


tails and use. 


4.2.1.4 Write Miss 


The CPU generates a write cycle, and the data is not 
present in the 82495XP/82490XP cache. In a simple 
write miss, the 82495XP/82490XP assists CPU in 
delivering data to memory, but the data is not placed 
in the cache. No cache lines are affected, so no 
state changes take place, 


4.2.1.5 Write Miss with Allocate 


This is a special case of a write miss where the 
memory location written by the CPU is not currently 
in the 82495XP/82490XP cache, but is brought into 
the cache and updated. Like a regular write miss, 
the 82495XP/82490XP assists the CPU in writing 
the data out to main memory. After the data is writ- 
ten to memory, the 82495XP/82490XP reads back 
the same data following the rules of a read miss, 
above. 


The ability to perform an allocation depends on all of 
the following conditions: 


the write is cacheable 
PWT is not asserted, forcing write-through 
the write is not LOCKed 
the write is to memory (not to 1/0) 


4.2.2 TRANSITIONS CAUSED BY OTHER 
DEVICES ON BUS 


MESI state transitions in the 82495XP/82490XP 
cache of one core (CPU/82495XP/82490XP) can 
be induced by actions initiated by other cores or de- 
vices on the shared memory bus. In the following, 
the 82495XP which is responding to actions of other 
devices does not currently own the bus, and may be 
referred to as a "slave" or, in the case of snooping, 
a "snooper". The device which currently owns the 
bus is the "master". 


The master which is accessing data from memory 
on the bus sends a request to all caching devices on 
the bus (snoopers) that they check or snoop their 
caches for a more recently updated version of the 
data being accessed. If one of the snoopers has a 
copy of the requested data, it is termed a "snoop 
hit". 


If a snooper has a modified version of the data 
("snoop hit to a Modified line"), it proceeds to gener- 
ate an "inquire cycle" to the i860 XP CPU, asking 
the i860 XP CPU if it also has a Modified copy of the 
line (which would be more recently modified than the 
82495XP/82490XP's version). The most up-to-date 
line 
is 
written 
out 
by 
the 
snooping 
82495XP/82490XP to the bus (to main memory or 
directly to the requesting master) so that the re- 
questing master can utilize it. 


The changes in MESI protocol state in a snooping 
cache which has a snoop hit depend on attribute 
inputs SNPINV and SNPNCA, which are driven by 
the master. 


The 
SNPINV 
input 
tells 
a 
snooping 


82495XP/82490XP 
to 
invalidate the 
line 
being 
snooped if hit: the master requesting the snoop is 
about to write to its copy of this line and will there- 
fore have the most up-to-date copy. When SNPINV 
is asserted on the snoop request, any snoop hit is 
placed in Invalid state, and a "back invalidation" is 
generated which instructs the CPU to check its 
cache and likewise invalidate a copy of the line. 
When the snooping 82495XP has a snoop hit to a 
Modified line and SNPINV was asserted by the bus 
master, the back invalidate is combined with the in- 
quire cycle. 


The 
SNPNCA 
input 
tells 
a 
snooping 
82495XP/82490XP whether the requesting master 
is performing a Non-Caching Access. If the request- 
ing master is not caching the data, a snoop hit to a 
Modified or Exclusive line can be placed in the Ex- 
clusive state: since the requester isn'~ caching the 


• 
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line, if the snooper has a future write hit to the line, 
an invalidation does not have to be broadcast. If the 
requesting master is caching the data, then a snoop 
hit to a Modified or Exclusive line must be placed in 
the Shared state, which insures that a future write hit 
causes an invalidation to other caches. Note that a 
snoop hit to a Shared line must remain in the Shared 
state regardless of SNPNCA. Also note that an as- 
serted SNPINV always overrides SNPNCA. 


4_2_2.2 
Cache Synchronization 


Cache synchronization is performed to bring the 
main 
memory 
up-to-date 
with 
respect 
to 
the 
82495XP/82490XP. 
Two 
devices 
exist 
in 
the 
82495XP/82490XP to accomplish this: FLUSH and 
SYNC. 


A cache flush is initiated by asserting the 82495XP 
FLUSH# pin. Once initiated, the 82495XP writes all 
Modified lines out to main memory, performing back 
invalidations and inquire cycles on the CPU. When 
completed, all 82495XP/82490XP and CPU cache 
entries will be in the Invalid state. 


Activation of the SYNC# pin also causes all of the 
82495XP's Modified lines to be written to memory. 
Unlike the FLUSH# pin, the cache lines remain valid 
after the SYNCH# 
process has completed, with 
Modified lines changing to the Exclusive state. 
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4.3 
The Effects of Special Cycles on 
MESI States 


4.3.1 NON-CACHEABLE ACCESS 


The 82495XP allows cacheability to be determined 
on both a per page and per line basis. The page 
cacheability function is determined by software, 
while cacheability on a line-by-line basis is driven by 
hardware. 


The PCD (Page Caching Disabled) pin is a 82495XP 
input driven by the CPU's PCD output, which corre- 
sponds to a cacheability bit in the page table entry of 
a memory location's virtual address. If the PCD bit is 
asserted when the CPU presents a memory ad- 
dress, that location will not be cached in either the 
82495XP or the CPU. 


MKEN# is a 82495XP input which connects to the 
memory bus controller or the memory bus. MKEN# 
inactive prevents the caching of the memory loca- 
tion in both the 82495XP and the CPU, affecting only 
the current access. 


If a read miss is indicated non-cacheable by either of 
these, 
the 
line 
is 
not 
placed 
in 
the 
82495XP/82490XP or CPU cache, and no cache 
states are modified. On a write miss, a noncachea- 
ble indication from either input forces a write miss 


READ HIT 
SNooP.INV 
+ FLUSH 
-------- 
,"';EAD 
IIISS + WRI~'''''''' 
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READ 
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'.• 
-- •.•...•.•_-----_.;' 
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Figure 4-1. Major State Transitions 
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without 
allocation. 
Note 
that 
if 
the 
82495XP/ 
82490XP already has a valid copy of the line, the 
PCD attribute from the CPU is ignored. 


4.3.2 READ ONLY ACCESSES: MRO# 


The MRO# (Memory Read Only) input is driven by 
the memory bus to indicate that a memory location 
is read only. 


When asserted during a read miss line fill, MRO# 
causes 
the 
line 
to 
be 
placed 
in 
the 
82495XP/82490XP cache in the Shared state and 
also sets a read-only bit in the cache tag. MRO# 
accesses are not cached in the CPU. On subse- 
quent write hits to a read-only line, the write is actu- 
ally written through to memory without updating the 
82495XP/82490XP 
line, 
which 
remains 
in 
the 
Shared state with the read-only bit set. 


4.3.3 LOCKED ACCESSES: LOCK# 


The LOCK# signal driven by the CPU indicates that 
the requested cycle should lock the memory loca- 
tion for an atomic memory access. Because locked 
cycles are used for interprocessor and intertask syn- 
Chronization, all locked cycles will appear on the 
memory bus. 


On a locked write, the 82495XP treats the access as 
a write-through cycle, sending the data to the memo- 
ry bus-updating 
memory and invalidating other 
cached copies. If the data is also present in the 
82495XP/82490XP cache, it is updated but its M, E, 
or S state remains unchanged. 


For locked reads, the 82495XP assumes a cache 
miss and starts a memory read cycle. If the data 
resides in the 82495XP/82490XP, the M-E-S state 
of the data remains unchanged. If the requested 
data is in the 82495XP/82490XP and is in the Modi- 
fied state when the memory bus returns data, the 
82495XP will use the 82490XP data and ignore the 
memory bus data. 


LOCKed read and write cycles which miss the 
82495XP/82490XP cache are noncacheable in both 
the 82495XP/82490XP and CPU. 
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4.3.4 FORCING LINES DIRECT·TO·MODIFIED: 
DRCTM# 


The DRCTM# (Direct To Modified) pin is an input 
which informs the 82495XP to skip the Exclusive 
state and place a line directly in the Modified state. 
The 
signal 
can 
be 
asserted 
during 
82495XP/82490XP reads of the memory for special 
82495XP/82490XP 
data accesses 
like read-for- 
ownership and cache-to-cache-transfer. The signal 
can also be asserted during writes, for purposes of 
cache tracking. 


4.4 State Tables 


Lines cached by the 82495XP can change states as 2 
a result of either the CPU bus activity (that some- 
times require the 82495XP to become a memory bus 
master) or as a result of memory bus activity gener- 
ated by other system masters (snooping). 


State transitions are affected by the type of CPU/ 
memory bus transactions (reads, writes) and by a 
set of external input signals and internally generated 
variables. In addition, the 82495XP will drive certain 
CPU/memory bus signals as a result of the consist- 
ency protocol. 


4.4.1 CPU BUS 


- 
PWT (Page Write Through, PWT Input pin) Indi- 
cates a CPU bus write-through request. Activat- 
ed by the i860 XP CPU PWT pin. This signal af- 
fects line fills and will cause a line to be put in the 
[SI state if active. The 82495XP will NOT exe- 
cute ALLOCATIONS (line fills triggered by a 
write) for write-through lines. If PWT is asserted, 
it 
overrides 
a 
write-back 
indication 
on 
the 
MWB/WT # pin. 


- 
PCD (Page Cacheability Disable, PCD input pin): 
indicates that the accessed line is noncachea- 
ble. If PCD is asserted, it overrides a cacheable 
indication from an asserted MKEN#. 


- 
NWT 
(i860 XP CPU Write-Through Indication, 


82495XP's WB/WT # 
Output Pin): When low 
forces the i860 XP CPU to keep the accessed 
line into the SHARED state. 
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Write back mode (WB=1) will be indicated by 
the !NWT notation. In those cases the i860 XP 
CPU is allowed to go into exclusive states [E], 
[M]. NWTis 
normally active unless explicitly stat- 
ed. 


- 
KEN (CPU caching enable, KEN# output pin): 
When active indicates that the requested line 
can be cached by the CPU 1st level cache. KEN. 
is normally active unless explicitly stated. 


4.4.2 
MEMORY BUS 


- 
MWT 
(Memory Bus Write-Through Indication, 
MWB/WT# 
Input Pin): When active forces the 
82495XP to keep the accessed line into the 
SHARED state. Write back mode (MWB= 1) will 
be indicated by the !MWTnotation. In those cas- 
es the 82495XP is allowed to go into exclusive 
states [E], [M]. 


- 
DRCTM 
(Memory Bus Direct To [M] indication, 


DRCTM# Input Pin): When active forces skip- 
ping of the [E] state and direct transfer to [M]. 


- 
MKEN 
(Memory 
Bus 
Cacheability 
Enable, 
MKEN# Input pin): When Active Indicates that 
the memory bus cycle is cacheable. 


- 
MRO 
(Memory 
Bus 
Read-Only 
Indication, 


MRO# Input Pin): When Active forces line to be 
READ-ONLY. 


- 
MTHIT(Tag 
Hit, MTHIT# Output pin): Activated 
by the 82495XP during snoop cycles and indi- 
cates that the current snooped address hits the 
82495XP cache. 


- 
MHITM 
(Hit to a line in the [M] State, MHITM# 
Output pin): Activated by the 82495XP during 
snoop cycles and indicates that the current 
snooped address hits a modified line in the 
82495XP cache. 


- 
SNPNCA 
(Non Caching device access): When 
active indicates to the 82495XP that the current 
bus master is a non-caching device. 


- 
SNPINV 
(Invalidation): When active indicates to 
the 82495XP that the current snoop cycle will 
invalidate that address. 


4.4.3 
TAG STATE 


- 
TRO (Tag Read Only, 82495XP Tag bit): This bit 
when set indicates that the 1 or 2 lines associat- 
ed with this tag are Read-Only lines. 
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As a function of State Changes the 82495XP 
may execute the following cycles: 


- 
BINV: Execution of a CPU Back Invalidation Cy- 
cle (Snoop with INV active) 


- 
INOR: Execution of a i860 XP CPU Inquire Cy- 
cle(l). 


- 
WBCK: 
82495XP Write-Back Cycle. This is a 
Memory Bus write 
cycle 
generated 
by the 
82495XP when MODIFIED data cached in the 
82495XP needs to be copied back into main 
memory. A write-back cycle affects a complete 
82495XP line. 


- 
WTHR: 82495XP Write Through Cycle. This is a 
system write cycle in response to a processor 
write. It mayor may not affect the cache SRAM 
(update). In a write-through cycle, the 82495XP 
drives the Memory Bus with the same Address, 
Data and Control signals as the CPU does on the 
CPU Bus. Main Memory is updated, and other 
Caches invalidate their copies. 


- 
RTHR: 82495XP Read Through cycle. This is a 
special cycle to support locked reads to lines 
that hit the 82495XP cache. The 82495XP will 
request a Memory Bus cycle for lock synchroni- 
zation reasons, data will be supplied from the 
BUS except for [M] state which will have data 
supplied from the CACHE. 


- 
LFIL: 82495XP Cache line fill. 82495XP will gen- 
erate Memory Bus cycles to fetch a new line and 
deposit into the cache. 


- 
RNRM: 82495XP Read Normal Cycle: This is a 
normal read cycle which will be executed by the 
82495XP for non-cacheable accesses. 


- 
SRUP: 
82495XP SRAM UPDATE. Occurs any 
time new information is placed in the 82495XP 
cache. An SRAM update is implied in the LFIL 
cycle. 


- 
ALLOC: 
82495XP ALLOCATION. Write Miss cy- 
cle that has determined to be cacheable so the 
82495XP issues a line read. 


NOTE: 


1. An inquire cycle may be executed with INV ac- 
tive, performing a back-invalidation simultaneously. 
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STATE TABLES 


Table 4-1. Master 82495XP 
Read Cycle 


Pres. 
Mem 
CPU 


State 
Condition: 
Next State 
Bus 
Bus 
Comments 
Activity 
Activity 


M 
!LOCK:M 
- 
!NWT 
Normal Read Hit [M] 


LOCK:M 
RTHR 
!KEN 
Read Through Cycle, Data From 
Array 


E 
!LOCK:E 
- 
NWT 
Normal Read Hit [E] 


LOCK:E 
RTHR 
!KEN 
Read Through Cycle, Data From 
Memory 


S 
!LOCK.!TRO:S 
- 
NWT 
Normal Read Hit [SI 


!LOCK.TRO:S 
- 
!KEN 
Normal Read to Read-Only 
sector. Stays in [SI state and 
deactivate KEN to prevent CPU 


- 
from caching line 


LOCK:S 
RTHR 
!KEN 
Read Through Cycle, Data from 
Memory 


I 
PCD+ !MKEN+ LOCK:I 
RNRM 
!KEN 
Non-Cacheable 
Read, Locked 
cycles 


!PCD.MKEN.!LOCK.MRO:S 
LFIL 
!KEN 
Cacheable read, Read-Only. Fill 
line to 82495XP. Do not allow 
CPU to cache line by deactivating 
KEN #. Set the 82495XP's TRO 
bit to indicate the sector read only 
attribute 


!PCD.MKEN.!LOCK.!MRO.(PWT+ MWT):S 
LFIL 
NWT 
Cacheable Reads, forced Write- 
Through 


!PCD.MKEN.!LOCK.!MRO.!PWT.!MWT.!DRCTM: 
E 
LFIL 
NWT 
Line not shared, thus enabling the 
82495XP to move into tan 
exclusive state 


!PCD.MKEN.!LOCK.!MRO.!PWT.!MWT.DRCTM: 
M 
LFIL 
NWT 
As before with direct [M] state 
transfer. Keep i860 XP CPU in 
Write Through mode 
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Table 4-2. Master 
82495XP 
Write 
Cycle 


Pres. 
Mem 
CPU 
Condition: 
Next State 
Bus 
Bus 
Comments 
State 
Activity 
Activity 


M 
!LOCK: M 
- 
SRUP, 
Write hit. Write to cache. Allow i860 XP 
!NWT 
CPU to perform 
internal write cycles 
(Enter intor [El, [Ml states). 


LOCK: M 
WTHR 
SRUP, 
Locked Cycle. Write-Through 
updating 
!NWT 
cache SRAM. Most updated 
copy of the 
line is still owned by 82495XP. 
All 
Locked write cycles are posted. 


E 
!LOCK:M 
- 
SRUP, 
Write hit. Update SRAM. Let i860 XP 
!NWT 
CPU execute 
internal write cycles. 


LOCK: E 
WTHR 
SRUP, 
Lock forces cycle to memory bus. Main 
NWT 
memory remains updated. 


S 
TRO:S 
WTHR 
- 
Read-Only. 
Write cycle with write 
through attribute 
from CPU or Memory 
Bus. Locked Cycles. 


!TRO.(PWT+ MWT+ LOCK):S 
WTHR 
SRUP, 
Not Read-Only. 
Write cycle with write 
NWT 
through attribute 
from CPU or Memory 
Bus. Locked Cycles. 


!TRO.!PWT.!LOCK.!MWT.!DRCTM:E 
WTHR 
SRUP, 
Not Read-Only. 
No write-through 
cycle, 
NWT 
no lock request allow going into 
exclusive 
state. 


!TRO.IPWT.!LOCK.!MWT.DRCTM:M 
WTHR 
SRUP, 
Not Read-Only. 
No write-through 
cycle, 
NWT 
no lock request allow going into 
exclusive 
state. DRCTM forces final 
state to M. 


I 
PCD+ !MKEN+ PWT+ LOCK+ MRO: I 
WTHR 
- 
Write Miss Non-Cacheable, 
Write- 
Through, 
locked cycle or Read-Only. 


!PCD.MKEN.!PWT.!LOCK.!MRO:I 
WTHR, 
- 
Write Mis with allocation. 
After the write 
LFIL 
cycle, a line fill (allocation) 
is scheduled. 


!PCD.MKEN.!PWT.!LOCK.MRO:S 
If MKEN and MRO are asserted, 
an 
allocation 
to the [Sl state will occur 
Allocation 
Final State 
ALLOC 
Allocation 
final state as a function 
of 
MWT:S 
line fill attributes. 


!MWT.!DRCTM:E 
!MWT.DRCTM:M 


NOTE: 
The WB/WT# 
pin will only be activated for 82495XP lines that are in the [M) state. In this state, the 82495XP always 
assumes that the line owner MAY be the i860 XP CPU. On all other states the i860 XP CPU will be forced to perform Write- 
Through cycles. This mechanism will make sure that any i860 XP CPU write cycle is seen at least once on the CPU Bus. 
Allocations, which are consequences of write-misses, will disregard the MKEN# and MRO# attributes during the line fill. In 
other words, once an allocation is scheduled, it cannot be cancelled. 
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Pres. 
Condition: 
Mem 
CPU 
Bus 
Bus 
Comments 
State 
Next State 
Activity 
Activity 


M 
!SNPNCA:S 
MTHIT 
INOR 
Snoop hit to modified 
line. 82495XP 
indicates 
tag hit and 
SNPNCA: 
E 
MHITM 
modified 
hit. 82495XP 
schedules 
flushing of the modified 


WBCK 
line to memory. 
If non-cacheable 
device, stay in [El state. 


E 
!SNPNCA: S ' 
MTHIT 
- 
' If snooping 
by cacheable 
device, indicate 
MTHIT and go 
SNPNCA: E 
to shared state. If no caching 
device only indicate 
MTHIT, 


stay exclusive. 


S 
S 
MTHIT 
- 


I 
I 
- 
- 


Table 
4-3. Snooping 
82495XP 
without 
Invalidation 
Request 


NOTE: 
• 
Usage 
of DRCTM 11 to avoid 
[E) states 
may be in conflict 
with the SNPCNA 
cycle 
attribute. 
Note 
in the table 
that 
snoops 
with SNPNCA 
may cause 
an [E) state transition. 


Pres. 
Mem 
CPU 


State 
Next State 
Bus 
Bus 
Comments 
Activity 
Activity 


M 
I 
MTHIT 
INOR, 
Snoop hit to modified 
line. 82495XP 
indicates 
tag hit and 
MHITM 
BINV 
modified 
hit. 82495XP 
schedules 
flushing 
of the modified 


WBCK 
line to memory. 
Invalidate 
CPU. 


E 
I 
MTHIT 
BINV 
Inidicate tag hit, infalidate 
82495XP, 
CPU lines. 


S 
I 
MTHIT 
BINV 
Same as before 


I 
I 
- 
- 


Table 
4-4. Snooping 
82495XP 
with 
Invalidation 
Request 


Pres. 
Mem 
CPU 


State 
Next State 
Bus 
Bus 
Comments 
Activity 
Activity 


M 
E 
WBCK 
INOR 
Get modified 
data from i860 XP CPU, flush to memory 
E 
WBCK 
- 


E 
E 
- 
- 
Memory already synchronized 


S 
S 
- 
- 
Memory already synchronized 


I 
I 
- 
- 


Table 
4-5. SYNC Cycles 
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Table 4-6. FLUSH Cycles 


Pres. 
Mem 
CPU 


State 
Next State 
Bus 
Bus 
Comments 
Activity 
Activity 


M 
I 
WBCK 
INOR, 
Flush and invalidate i860TMXP CPU 
BINV 


E 
I 
- 
BINV 
Invalidate i860 XP CPU 


S 
I 
- 
BINV 
Invalidate i860 XP CPU 


I 
I 
- 
- 


NOTE: 
Usage of DRCTM'" 
to avoid 
[E) states 
may be in conflict 
with the SYNC cycle. 
Note in the table that SYNC cycles 
move an 
[M) state 
line to [E). 


5.0 
CONFIGURATIONS 
line ratio, tag size, lines per sector, DUswidth, and 
cache size. These parameters are sampled at the 
falling edge of RESET and are not dynamically 
changeable. 
The 82495XP/82490XP 
cache system was de- 
signed to fit a variety of applications. For the great- 
est 
performance, each 
application 
requires the 
82495XP/82490XP to be configured differently. The 
82495XP/82490XP 
therefore has many possible 
configurations that are set on RESET and affect the 
82495XP/82490XP 
architecture, 
operation, 
and 
electrical characteristics. 


Because of physical cache constraints, choosing 
one parameter limits the flexibility of other parame- 
ters. The following table summarizes the possible 
i860 XP CPU basic cache configurations. CFGO- 
CFG2 are multiplexed to select one of 5 possible 
line ratio/tag size/lines per sector configurations. 
This information is automatically passed from the 
82495XP to 82490XP during RESET. CFGO-CFG3 
must be valid at least 10 clocks before RESET's fail- 
ing edge. 


5.1 Physical Cache 


The 
physical 
configurations 
of 
the 
82495XP/ 
82490XP consist 
of 
parameters that 
alter 
the 
82495XP/82490XP basic architecture. These are 


I Not Supported 
LR = 82495XP/CPU line Ratio 
'-- 
__ 
...J 
LIS = 82495XP lineslSector 


Trans = Memory Bus Transactions per 82490XP line Fill 


MEM BUS = 64 Bit. 
MEM BUS = 128 Bit. 
Number of 


4 Tran•. 
8 Trans. 
4 Trane. 
8 Trane. 
82490XP Devlcee 


1 
2 


LR = 1 
LR = 2 
Tags = 8k 
Tags = 4k 
8 
LIS = 1 
LIS = 1 


3 
4 
4 
5 
LR = 1 
LR = 2 
LR = 2 
LR = 4 
Tags = 8k 
Tags = 8k 
Tags = 8k 
Tags = 4k 
16 
LIS = 2 
LIS = 1 
LIS = 1 
LIS = 1 


Figure 5-1. 82495XP/82490XP Configurations 
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Table 5-1. CFG Configuration 
Inputs 


Cflg 
Line 
Llnesl 
No. of 
CFG2 
CFG1 
CFGO 
No. 
Ratio 
sec 
Tags 


1 
1 
1 
8K 
0 
0 
1 


2 
2 
1 
4K 
1 
1 
1 


3, 
1 
2 
8K 
0 
0 
0 


4 
2 
1 
8K 
0 
1 
1 


5 
4 
1 
4K 
1 
1 
0 


5.1.1 LINE RATIO (LR) 


Line Ratio (LR) is the ratio of the 82495XP/82490XP 
cache line size to the CPU cache line size. For ex- 
ample, if LR=2 
then the 82495XP/82490XP line 
size is 64 bytes. This information is also used to de- 
termine the number of back invalidations or inquire 
cycles to the i860 XP CPU. 


5.1.2 TAG SIZE (TAGS) 


The 82495XP/82490XP cache tag size may be 4K 
or 8K tag entries. By reducing tag size, the line ratio 
(LR) can be doubled without a change in cache size. 


5.1.3 LINES PER SECTOR (LIS) 


The 82495XP/82490XP may be non-sectored (LIS 
= 1) or contain two lines per sector (LIS = 2). If 
LIS 
= 2, then the 82495XP contains one tag for two 
consecutive cache lines and each cache line has its 
own set of MESI state bits. This allows just one line 
to be filled on replacements or written back on 
snoop hits. Both lines are written back during re- 
placements, if both are modified. 


5.1.4 BUS SIZE 


The 82495XP/82490XP supports 64 and 128 bit 
memory bus widths for the i860 XP CPU. 


5.1.5 CACHE SIZE 


The 82495XP/82490XP may be configured to be 
256K or 512K. Cache size is a direct result of the 
number of 
82490XP devices 
used. It takes 
8 
82490XP's to make a 256K byte cache and 16 
82490XP's for a 512K cache. 


5.1.6 FUNCTION AND ADDRESS 
CONNECTIONS (CFAO-CFA6) 


Table 5-2 lists which address lines should be con- 
nected to each of the CFAO-CFA6 lines for each 
cache 
configuration. 
CFAO-CFA6 
provide 
the 
82495XP with proper multiplexed addresses for 
each of the possible cache configurations. Depend- 
ing on the mode selected, either CFA5 or CFA4 will 
operate as the 82495XP's CTYP input. This input is 
connected to the i860 XP CPU's CTYP output. 


Table 5-2 also lists the connections between the 
82495XP's TAG and SET lines and the remaining 
CPU address lines. 


The 82495XP MCFA, MTAG, and MSET pins con- 
nect to the system memory address bus in the same 
order as the corresponding CFA, TAG, and SET pins 
are connected to the CPU. Note that the MCFA pin 
which corresponds to the CFA pin being used as 
CTYP should be left unconnected. 
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Table 5-2. CFA Address Connections 


Cflg 
Line 
Llnesl 
No. of 
CFA6 
CFA5 
CFA4 
CFA3 
CFA2 
CFA1 
CFAO TAG[11:0] 
SET[10:0] 
No. 
Ratio 
sec 
Tags 


1 
1 
1 
8K 
A5 
CTYP 
A31 
A30 
A29 
A4 
A3 
A28-A17 
A16-A6 


2 
2 
1 
4K 
A5 
CTYP 
A31 
A30 
A29 
A4 
A3 
A28-A17 
A16-A6 


3 
1 
2 
8K 
A6 
A5 
CTYP 
A31 
A30 
A4 
A3 
A29-A18 
A17-A7 


4 
2 
1 
8K 
A6 
A45 
CTYP 
A31 
A30 
A4 
A3 
A29-A18 
A17-A7 


5 
4 
1 
4K 
A6 
A5 
CTYP 
A31 
A30 
A4 
A3 
A29-A18 
A17-A7 


5.2 
Cache Modes 


Cache 
modes 
are 
ways 
of 
configuring 
the 
82495XP/82490XP to operate differently. These op- 
tions are all sampled at RESET and are not dynami- 
cally changeable. If some of these configuration op- 
tions share a pin, such as the 82495XP's SYNC# 
and MEMlDRV, the configuration option must meet 
a specific setup and hold time to RESET's falling 
edge. For the 82495XP, setup time is usually 4 
clocks, and for the 82490XP, setup time is usually 1 
clock. For both parts, the configuration option must 
be held until RESET is detected low. 


elK 


RESET 


Conflg )()OC()()()(:>OE===1'==fY'IfY\ 


Setup 
Hold 
240956-10 


Figure 5-2. Configuration 
Input Sampling 
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5.2.1 MEMORY BUS MODES 


The 82495XP/82490XP may be configured to have 
a clocked or strobed memory bus. Memory bus 
mode is selected by the 82490XP MSTBM pin (same 
as MClK 
pin). If MSTBM is strapped high, the 
82490XP's operate in strobed mode. If MSTBM is 
toggling, ie it is connected to the memory bus clock, 
the 82490XP operates in clocked mode. MClK need 
not be synchronous to ClK. 


5.2.2 SNOOPING MODES 


The 82495XP/82490XP supports three snooping 
modes: synchronous, clocked, and strobed.'Snoop- 
ing mode is selected by the SNPMD (same as 
SNPClK) pin. If SNPMD is low the 82495XP snoops 
synchronously. If SNPMD is high the 82495XP 
snoops in strobed mode. If SNPMD is toggling, 
clocked mode is selected and SNPMD becomes a 
snoop clock source, SNPClK, which clocks in the 
snoop requests. 


These three snooping modes only alter the way the 
memory bus controller may initiate a snoop request 
to the 82495XP. The 82495XP response is always 
synchronous to the CPU CLK. 


5.2.3 
BUS DRIVERS 


The 82495XP/82490XP provide 2 types of memory 
bus drivers: High capacitance drivers and low capac- 
itance drivers. The high capacitance drivers are se- 
lected by driving both the 82495XP and 82490XP 
MEMLDRV pins low at RESET.Similarly, the low ca- 
pacitance drivers are selected with MEMLDRV high. 


With C490LDRV the 82495XP also provides two 
types of drivers when driving the 82490XP's. Refer 
to the interface document to determine C490LDRV. 


5.2.4 
STRONG/WEAK 
WRITE 
ORDERING 


If the 82495XP pin WWOR# is sampled low at 
RESET, the 82495XP enforces weak write-ordering. 
If sampled high, the 82495XP enforces strong write- 
ordering. 
Strong 
write-ordering 
prevents 
the 
82495XP from completing a write cycle that would 
go to 'M' state if a posted write is pending (has not 
been granted the bus with BGT#). By doing this, 
strong ordering ensures that write cycles from the 
CPU are written to memory in the same order that 
they appear in the i860 XP CPU program. 


5.2.5 
1860TM 
XP CPU PFLD SUPPORT 


The i860 XP microprocessor executes PFLD (Pipe- 
lined Floating-Point Load) instructions to implement 
special data handling, typically for vector operations. 
This instruction allows loading of data through a 
FIFO pipeline, to hide memory latency. The i860 XP 
CPU does not cache data returned by a PFLD cycle. 


The 82495XP can be configured to decode the 
i860 
XP 
microprocessor's 
PFLD 
cycles. 
The 
82495XP supports 3 operational modes for PFLD 
cycle decoding: 
Mode # 1. PFLD cycles are cached in the 82495XP. 
This mode is used in applications that 
can fit entirely in the 82495XP/82490XP 
cache. The 82495XP treats PFLD cycles 
as normal read cycles. 
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Mode # 2. PFLD cycles are not cached 
in the 
82495XP, without an external PFLD ex- 
tension FIFO. 
This mode is used when applications are 
too large to fit in the 82495XP/82490XP 
cache. The 82495XP treats PFLD cycles 
as noncacheable, using the same proto- 
col as cycles with PCD= 1 (if data is al- 
ready cached, it will be supplied from the 
cache). 
Mode #3. PFLD cycles not cached in the 82495XP, 
with an external PFLD extension FIFO. 


This mode allows the PFLD FIFO to be 
extended beyond the three stages built 
into the i860 XP CPU by adding external 
FIFO hardware. The 82495XP, treats 2 
PFLD cycles in the same manner as its 
treatment of LOCKed cycles (all cycles 
go to the bus, even if data already pres- 
ent in cache). To support the external 
FIFO, the 82495XP identifies PFLD cy- 
cles by asserting its FPFLD output. For 
proper operation, data which can be ac- 
cessed by PFLD must never be in the 
cache in the Modified state, and software 
must be aware of the length of the com- 
bined PFLD pipeline. Because this mode 
is not software transparent, it must be 
used with extreme care. 


The choice of PFLD mode is largely application de- 
pendent. The PFLD mode of the 82495XP is select- 
ed by configuration pins FPFLDEN and NCPFLD#, 
which are sampled at RESET. FPFLDEN shares a 
pin with FPFLD, and NCPFLD# shares a pin with 
FLUSH#. Depending on the PFLD mode, data for 
reads will either be supplied to the CPU from the 
82495XP, or from the memory bus. Table 5-3 sum- 
marizes, the 82495XP's support for i860 XP CPU 
PFLD cycles. 
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Table 5-3. 82495XP PFLD Modes 


Mode# 
FPFLDEN 
NCPFLD# 
Data Supplied From 
LIne Fill 


[J] 
[SI 
[E] 
[M] 
on [J] 


1 
0 
1 
MEMBUS 
CACHE 
CACHE 
CACHE 
Yes 


2 
0 
0 
MEMBUS 
CACHE 
CACHE 
CACHE 
No 


3 
1 
1 
MEMBUS 
MEMBUS 
MEMBUS 
MEMBUS 
No 


X 
1 
0 
Illegal Mode 


5.3 
82490XP Bus Configuration 


The 82490XP needs to be configured so it knows to 
drive 4 or 8 MDATA lines and whether it should do 4 
or 8 memory transfers per line fill. This is done 
through the MX4/MX8# 
and the MTR4/MTR8# 
configuration inputs. For a given line ratio (memory 
bus line size / CPU line size), they should be sam- 
pled as follows: 


Table 5-4. MX/MTR Configurations 


Line 
MX4/ 
MTR4/ 
Membus 
CPUbus 
Ratio 
MX8# 
MTR8# 
I/O 
I/O 


1 
1 
1 
4 
4 


2 
1 
0 
4 
4 


2 
0 
1 
8 
4 


4 
0 
0 
8 
4 


1 
0 
1 
8 
8 


2 
0 
0 
8 
8 


5.3.1 82490XP PARITY CONFIGURATION 


A 82490XP may be designated as a parity device. 
This is done by strapping the PAR# pin low. In this 
configuration CDATA[0:3] are used to store 4 parity 
bits, and CDATA[4:7] are used as 4 bit enables. The 
four bit enables allow the writing of individual parity 
bits. 


Every mode and 
configuration 
of 
a 
non-parity 
82490XP may be used and selected on the parity 
82490XP device. The 82490XP parity configurations 
are as follows: 


Table 5-5. Parity Configurations 


Cache 
Memory 
Number 
82490XP 


Size 
Bus 
of Parity 
I/O bits 
Width 
Devices 
(CPU/Mem) 


256K 
64 
2 
4/4 


516K 
64 
2 
4/4 


512K 
128 
2 
4/8 


5.3.2 CPU 82490XP ADDRESS 
CONFIGURATIONS 


The 82490XP Address inputs (A) are multiplexed to 
the CPU address lines (CA) according to the cache 
size: 


Table 5-6. 82490XP Address Connections 


82490XP Address Pins 
Size 
A15 
A14 
A13 
A12 
A11 
A10 
A9 
A8 
A7 
A6 
A5 
A4 
A3 
A2 
A1 
AO 


256K 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
Vss 
Vss 
16 
15 
14 
13 
12 
11 
10 
9 
8 
7 
6 
5 
4 
3 


512K 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
CA 
Vss 
17 
16 
15 
14 
13 
12 
11 
10 
9 
8 
7 
6 
5 
4 
3 


NC - 
No Connect. 
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V 


Figure 
6·1. Memory 
Bus Controller 
Interface 
Model 


Figure 6·1 shows 
the memory 
bus controller 
(M BC) 
interface 
model. The memory bus controller 
interfac- 
es to the 
i860 
XP CPU, 82495XP, 
82490XP, 
and 
memory 
bus. The MBC interface 
was defined 
with a 
minimal 
set of assumptions 
as to the memory 
bus 
implementation. 
The chipset was designed 
to enable 
flexibility 
in the design of a memory 
bus and control- 
ler. 


The 82495XP 
requests 
control 
of the memory 
bus 
by signalling 
the memory 
bus controller. 
The memo- 
ry bus controller 
is responsible 
for 
arbitrating 
and 
granting 
the bus to the 82495XP. 
Once granted, 
the 
memory 
bus controller 
is responsible 
for' executing 
the requested 
cycle, snooping 
the other caches, 
and 
ending 
the 
cycle. 
The 
82495XP 
supports 
different 
modes of snooping, 
different 
modes of memory 
bus 
operation, 
and various 
special 
cycles. 
Memory 
Bus 
Controller 
design 
dictates 
which 
of these 
features 
are used, and exactly 
how they are used. 
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6.1 
Cycle Attribute and Progress 


CADS"', SNPADS. 
COTS- 


Cycle Request 
SGT- 
KWENO-(ATIRIB: WKEN-. WRO-) 
SWENIl't(ATIRIB: 
WWB/wr_. 
DRCTW-) 
CN.•.- 


CRDYiIII 


Cycle Progress 
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Figure 6-2. Cycle Attribute 
and Progress 
Signals 


CADS# 
indicates the start of the cycle address 
phase. CDTS# 
tracks CADS# 
and indicates the 
start of the cycle data phase. For READ cycles it 
indicates that starting in the next ClK the CPU data 
bus is in read mode under the control of the MBC 
until the last BRDY#. In Read cycles, if the MBC 
already owns the CPU data bus, CDTS# will be actio 
vated with CADS#. For AllOCATE 
cycles the MBC 
does not need the CPU data bus, therefore COTS# 
is activated together with CADS#. 


For Write cycles COTS# indicates that the 1st piece 
of data is available on the memory bus. For write- 
back cycles COTS# indicates that all data is avail- 
able (write-back buffer or snoop buffer loaded with 
correct write-back data). 


As a response to the cycle request, the memory bus 
controller responds with cycle progress signals. All 
cycle progress signals are sampled ONCE in specif- 
ic windows and then ignored until CRDY# of the 
corresponding cycle. BGT# indicates a commitment 
by the memory bus controller to complete the cycle 
execution on the memory bus. Up until this point the 
82495XP owns the cycle. This means that interven- 
ing snoop-write-backs will abort it and the 82495XP 
re-issues the cycle to the MBC. There is only one 
case where the 82495XP will issue a new, not a re- 
issued, cycle; if the original CADS# operation is a 
write-back cycle, and the interrupting snoop cycle 
hits that write-back buffer, then the subsequent 
CADS# will be for a completely new cycle (not a re- 
issuing of the interrupted CADS# operation). 
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After BGT# the memory bus controller owns the cy- 
cle. The 82495XP assumes the cycle will terminate 
and will not re-issue it on snoop-write-backs. Follow- 
ing BGT# comes KWEND# which indicates that the 
cacheability window is closed and that the 82495XP 
can sample MKEN#, MRO# attributes. Those indi- 
cate to the 82495XP cacheability and read-only re- 
spectively. These attributes can be determined by 
decoding the 82495XP address. Based on those at- 
tributes 
the 
82495XP executes 
AllOCATIONS, 
Line-fills, Replacements, etc. 


Following KWEND#, SWEND# is activated. It indi- 
cates 
that 
the 
Snoop Window is closed. 
The 
82495XP samples MWB/WT # and DRCTM# attri- 
butes. These attributes are determined by snooping 
the other caches in the system. At this point the 
82495XP updates its TAGRAM state related to the 
line access in progress. 


Lastly the MBC issues CRDY#, which indicates to 
the 82495XP the end of the transaction data phase. 


The 82495XP allows memory bus pipelining by pro- 
viding CNA# which allows the MBC to request a 
new address phase before the conclusion of the cur- 
rent data phase. The 82495XP supports a 1 level 
deep address pipeline on the Memory Bus. 


6.2 
Snoop Operations 


The 82495XP provides the capability of snooping 
operations on the memory bus to ensure cache con- 
sistency. A snoop operation consists of two phases: 
1) initiation phase and 2) response phase. 


< Initiation> 
< Response> 
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Figure 6·3. 82495XP 
Snooping 
Operations 


During the initiation phase the MBC provides the 
82495XP with the snoop address information. During 
the response phase the 82495XP provides the 
snoop status information. 


82495XP 
Cache Controller/82490XP 
Cache RAM 


NOTE: 
The 82495XP samples the SNPClK[SNPMDj 
sig- 


nal at the falling edge of RESET to determine the 
snoop 
mode. 
If a rising edge 
occurs 
on 
the 
SNPClK[SNPMDj 
after RESET has gone inactive, 


clocked 
mode will be selected. Systems using 
stobed or synchronous mode must ensure that no 
rising edge occur on SNPClK[SNPMDj 
after RE- 


SET has gone inactive. 


Figure 6-4 shows the strobed method of snoop ini- 
tiation. The memory address, SNPNCA, SNPINV, 
and MBAOE# are latched with the falling edge of 
the SNPSTB#. If MAOE# is sampled active (Iow), 
the SNPSTB# will not cause a snoop. The snoop 
initiation is recognized by the 82495XP, is synchro- 
nized in the next clock, and causes a snoop in the • 
following clock. 
,--------- 


6.2.1 
SNOOP 
INITIATION 
PHASE 


The 82495XP provides three modes for initiating 
snoops: 
1. Strobed: the falling edge of SNPSTB# is used. 
2. Clocked: SNPSTB# is sampled with SNPClK. 
3. Synchronous: SNPSTB# is sampled with ClK. 


These three snooping modes are configured as fol- 
lows: 
1. Strobed: The SNPClK[SNPMDj 
signal must be 
strapped high. 


2. Clocked: The SNPClK[SNPMDj 
signal must be 
connected to the snoop clock source. 


3. Synchronous: The 
SNPClK 
[SNPMDj 
signal 
must be strapped low. 


,\-_---£..'_---£..!_----J! 
SNPSTB# 


SNPINV 


SNPNCA 


WSETO-IO 


WTAGO-II 


WCfAO-6 


WBAO[# 


WAO[# 


ClK 


SNPCYC# 


< SNOOP> 


,'-- 
....1' 


t.4THIT# 
,\\\ 
\\\\ 
WHITW# 
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SNPBSY# 


Figure 6-4. Strobed 
Snoop Mode 
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Figure 6-5 shows the clocked method of snoop ini- 
tiation. The memory address, SNPNCA, SNPINV, 
and MBAOE# are latched with the rising edge of 
SNPCLK when SNPSTB# 
is first sampled low. 
SNPSTB# must be sampled high for at least one 


SNPCLK in order to rearm for another snoop. If 
MAOE# is sampled active (Iow), the SNPSTB# will 
not cause a snoop. The snoop initiation is recog- 
nized by the 82495XP, is synchronized in the next 
clock, and causes a snoop in the following clock. 


x.; 
''----..J/ 
, 
/ 
',-~/ 
SNPClK 
\ 
I 


SNPSTB# 
,\.._+--J.' 
..,j!I--__ 
....J! 


SNPINV 


SNPNCA 


~SETO-l0 


t.4TAGO-ll 


t.4CfAO-6 


t.4BAOE# 


ClK 


.........~ 


I 
SNPCYC# 


<SNOOP> 
, 
/ 


t.4THIT# 
\\SS 
\SSs 


SNPBSY# 
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Figure 6-5. Clocked 
Snoop Mode 
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Figure 6-6 shows the synchronous snoop mode. The 
memory address, SNPNCA, SNPINV, and MBADE# 
are latched with the rising edge of ClK 
when 
SNPSTB# is first sampled low. SNPSTB# must be 
sampled high for at least one ClK in order to rearm 


for another snoop. If MADE# 
is sampled active 
(Iow), the SNPSTB# will not cause a snoop. The 
snoop initiation is recognized by the 82495XP, and 
causes a snoop in the next clock. 


CLK 


•• 


SNPSTB# 


SNPINV 


SNPNCA 


WSETO-10 


WTAGO-11 


WCFAO-6 


WBAOE# 


WAOE# 


< SNOOP> 


SNPCYC# 


WTHIT# 


WHITW# 
\SSS 


SNPBSY# 
\SSS 
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Figure 6-6. Synchronous 
Snoop Mode 
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6.2.2 
RESPONSE 
PHASE 


The 
snoop 
response 
phase 
consists 
of two 
parts: 


1) 82495XP 
state indication 
2) 82495XP 
snoop 
pro- 


cessing 
completion. 
The 
response 
phase 
is At- 


WAYS 
synchronous 
with 
the 
CPU 
ClK. 
The 
82495XP 
state indication 
is presented 
on MHITM # 
and 
MTHIT # 
and 
remains 
stable 
until 
the 
next 
snoop. 
These 
signals 
indicate 
the 
state 
of 
the 
82495XP 
line just prior to the snoop 
operation. 
The 
memory 
bus controller 
can predict 
the final state of 
the 82495XP 
line knowing 
the initial 
state 
and the 
SNPINV 
and SNPNCA 
inputs. 
The 
snoop 
comple- 
tion information 
is determined 
by the SNPBSY # out- 
put. The 
SNPBSY # 
output 
inactive 
indicates 
that 
the 82495XP 
is ready to accept 
another 
snoop 
cy- 
cle. 


Figure 6-7 shows 
the 82495XP 
response 
to snoops 
without 
invalidation. 
The first snoop is to a line which 
is not currently 
stored 
in the cache. 


Figure 6-8 shows 
the 82495XP 
response 
to snoops 
with invalidation. 


The 
SNPBSY # 
signal 
will be activated 
for one 
of 


two 
reasons: 
1) a snoop 
hit 
to 
a 
modified 
line, 
SNPBSY # will remain active until the modified 
line 


intelQP 


has 
been 
written 
back. 
2) a Back 
invalidation 
is 
needed 
and there 
is a back invalidation 
in process. 


The SNPBSY # minimum 
active time is two ClK 
pe- 


riods. This allows 
an external 
logic to trap-hold 
ac- 


tive SNPBSY # using ClK. 
The external 
logic must 
first 
look 
for active 
SNPCYC# 
and then 
trap-hold 


SNPB&Y#. 


6.2.3 
PIPE LINED 
SNOOPS 


The 82495XP 
allows 
the memory 
bus controller 
to 


pipeline 
snoop 
operations. 
The 82495XP 
allows 
the 


next 
snoop 
address 
to 
be supplied 
and 
the 
next 


snoop requested 
before the last snoop has complet- 


ed. 


There are a set of rules which govern 
the operation 


of pipelined 
snoops. 
These 
rules are as follows: 


(1) For strobed 
mode snoops, the memory 
bus con- 


troller 
cannot 
cause 
a second 
falling 
edge 
of 


SNPSTB# 
until 
after 
the 
falling 
edge 
of 


SNPCYC#. 


(2) For clocked 
mode snoops, 
the memory 
bus con- 


troller 
cannot 
cause 
a second 
falling 
edge 
of 


SNPSTB# 
to be sampled 
by SNPClK, 
until after 


the falling edge of SNPCYC#. 


~ 
I stat.~ 
E, S state, 


SHPSTSjJ 
\'-- 
__ 
J! 
" 
. 


lot State 
\ 
/ 
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Figure 6-7. Snoops without Invalidation 
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Figure 6-8. Snoops with Invalidation 
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Figure 6-9. Fastest Possible Synchronous Snooping 
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Figure 6-10. Fastest Possible Asynchronous 
Snooping 


(3) For synchronous mode snoops, the memory bus 
controller cannot cause a second falling edge of 
SNPSTB*" to be sampled by ClK, until the ClK 
after SNPCYC*" is active. 


6.2.4 OVERLAPPING SNOOPS WITH MEMORY 
BUS CYCLES 


The 82495XP allows snoops to be overlapped with 
data transfers. The 82495XP divides the memory 
bus cycle into 4 main regions as shown below: 


CRDY# 
CADS# 
BGT# 
SWEND# CRDY# 
CADS# 


L-..:...1 ----L---=2~1=u 
4 
I 
1 


Region 1 is after a previous memory bus cycle (i.e. 
after CRDY*,,) and before the new memory bus cy- 
cle starts (before CADS*,,).A snoop in this region is 
looked up immediately and serviced immediately. 


Region 2 is after a memory bus cycle has started 
(CADS*,,)but before the 82495XP has been granted 
the bus (BGT*,,).A snoop in this region is looked up 
immediately and serviced immediately. CADS*" is 
re-issued for the aborted cycle once the snoop com- 
pletes. 


Region 3 is after the 82495XP has been granted the 
bus and before the SWEND*" is completed. A snoop 
in this region has its lookup blocked until after the 
SWEND*". After SWEND*", the snoop response is 
given, but no write-back will be initiated until after 
CRDY*". 


Region 4 is after SWEND*" and before CRDY*". A 
snoop in this region is looked up immediately but 
serviced after CRDY*". This snoop is logically treat- 
ed as if it occurred after CRDY*" (snoop hits to mod- 
ified data will schedule a write-back which will be 
executed after the conclusion of the current memory 
bus cycle). Note that the result of the snoop 
MHITM#, 
MTHIT# 
will be available immediately 
with the look-up. 
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6.2.5 SNOOP INTERLOCK 


The 82495XP uses two interlock mechanisms to en- 
sure that Snoops are identified within the proper re- 
gion. The first interlock ensures that once a BGT# 
has been given snoops are blocked until after 
SWEND#. The second interlock ensures that once 
a snoop has been started BGT# cannot be given 
until after the snoop has been serviced. 


Figure 6-11 shows how once the 82495XP sees a 
BGT# it blocks all snoops until after SWEND#. If a 
snoop has been initiated, and no SNPCYC# has 
been issued before BGT# assertion, the snoop has 
been blocked. 


Figure 6-12 shows a snoop occurring before BGT# . 
Once the 82495XP has honored a snoop, the 
82495XP, depending on the result of the snoop, may 
ignore BGT# 
until the snoop is serviced. The 
82495XP will always ignore BGT# when SNPCYC# 


is active. If the snoop result is a hit to a modified line 
(MHITM# active), the 82495XP will ingore BGT# as 
long as both SNPBSY# and MHITM# remain ac- 
tive. In this case, it is the memory bus controller's 
responsibility to hold BGT# until SNPBSY# goes 
inactive or reassert it after SNPBSY# becomes in- 
active. If the snoop result is not a hit a modified line 
(MHITM# inactive), the 82495XP is capable of ac- 
cepting BGT# even when SNPBSY# is active. This 
allows the memory bus controller to preceed with a 
memory bus cycle by asserting BGT# while the 
82495XP is performing back-invalidations. 


These two interlock mechanisms provide a flexible 
method of ensuring predictable handling of over- 
lapped snoops. 


NOTE: 
Even when snoops are delayed, address latching is 
performed with SNPSTB# activation. 
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Figure 6-11. BGT# Blocking a Snoop 


ClK 


SNPCYC- 


__ 
--_--_---_~ 
<SNOOP> 
\.J...J~--------~----------- 


I 


SNPBSY- 
x 
: 
K~~~\~ 
-- __--~I 


I 


I 
I 
I 
X: 
BGT- :IGNOREO 


240956-22 


Figure 6·12. Snoop Occurring before BGT# 
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6.2.6 SNOOPS CONCURRENT WITH LINE FILL 
CYCLES 


During snoops concurrent with line-fills/allocates, 
the following responsibility boundaries must be full- 
filled in order to insure data consistency: 


• If a snoop happens before BGT#, more precisely 
if SNPCYC# is active before BGT#, it is the sys- 
tem's responsibility not to return stale data within 
the line-fill/allocation. 
• If a snoop happens after BGT#, more precisely if 
SNPCYC# 
is 
active 
after 
BGT#, 
then 
the 
82495XP insures data consistency by providing 
interlocks with the CPU which avoid caching of 
stale data. 


6.3 Memory Bus Controller Interface 
Rules 


To begin a cache cycle, the 82495XP outputs the 
CADS# signal. The cache address and other cycle 
parameters 
are 
guaranteed 
to 
be 
stable 
with 
CADS# assertion. These parameters are guaran- 
teed to be stable until CNA# or CRDY# of that cy- 
cle. After CNA# or CRDY# these parameters are 
undefined. 


Either during, or after CADS# the CDTS# signal is 
asserted. Data is guaranteed to stable with CDTS# 
assertion, or the data path is available. 


82495XP Cache Controller/82490XP 
Cache RAM 


BGT# and CRDY# are required for all (non-snoop) 
cycles. KWEND# and SWEND# are only required 
for those cycles which sample them. 


Once a signal has been sampled, it is a "don't care" 
until CRDY# of that cycle. Additionally, these sig- 
nals plus the attributes MRO#, 
MKEN#, 
MWB/ 
WT#, and DRCTM# need only follow setup and 
hold times when they are being sampled. 


For pipelined cycles, the cycle attributes (BGT# , 
KWEND#, ... ) will only be sampled after CRDY# 
of the previous cycle. 


Note that there are many other rules that govern 
when signals may be asserted in relation to one an- 
other. These may be found in the specific pin de- 
scriptions of each signal in chapter 7. 


Snoop-Write-Back cycles are a subset of the normal 
cycles. Snoop-Write-Back cycles are requested as a 
consequence of snoop hits to Modified lines. Those 
are intervening cycles and are requested by activat- 
ing SNPADS# instead of CADS#. For those cycles, 
the 82495XP only samples the CRDY# response. 
The 82495XP assumes that the memory bus con- 
troller owns the bus to perform the intervening write- 
back (restricted back-off protocol) and that no other 
agents will snoop this cycle. Also the 82495XP will 
ignore ,CNA# during Snoop-Write-Backs. 


182~95 
Output 
Signals I 


CAOS* 
COTS* 
'\. \.- -------------- -'.,. 
f 
f 
f 
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Time 


Figure 6-13. Cycle Progress 
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82495 Output Signals 
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I 82495 Input Signals I 
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Tim. 


CRDY* 


6.4 
LOCK# Protocol 


Figure 6-14. Cycle Progress 
for Snoop Cycles 


The 82495XP provides a LOCK signal for the memo- 
ry bus called KLOCK11. KLOCK11 is generated by 
the 82495XP whenever the CPU generates the 
LOCK11 signal. KLOCK11, like the other cycle attri- 
butes, is valid with CADSII assertion. 


When the 
CPU generates 
a 
LOCK cycle, the 
82495XP always generates a bus cycle. LOCK cy- 
cles are non-cacheable to both the 82495XP and 
CPU, so the information is passed through the 
82490XPs to the CPU with BRDYs generated by the 
MBC. If the LOCKed read cycle is a hit in the 
82495XP; the 82495XP ignores the data that it is 
receiving and supplies data from the 82490XP array / 
(in accordance with the BRDYs supplied by the 
MBC). Locked writes are posted like any other write. 
LOCKed cycles, 
both 
reads and writes, 
never 
change the 82495XP tag state. 


During a LOCKed cycle, the MBC must prevent oth- 
er masters from snooping the 82495XP. Specifically, 
the MBC must prevent SNPSTBII between BGTII 
of the first LOCKed transfer, and SWENDII of the 
last LOCKed transfer. 


6.4.1 
SEMAPHORE 
CONSISTENCY 


The 82495XP is optimized for high performance. In 
order to achieve this high performance the 82495XP 
overlaps back-invalidations of the primary cache 
with other activities. Normally this overlapping does 
not cause any problems. However, in systems where 
locked semaphores are used to insure mutual exclu- 
sion of processor access to shared data, special 
care is needed to insure that once the semaphore 
has been obtained, that all back-invalidation of the 
primary cache have been completed. 
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The 82495XP provides two write-ordering modes: 
weak write-ordering and strong write-ordering. In the 
weak write-ordering mode, the 82495XP does NOT 
guarantee that writes will be issued on the memory 
bus in the same order that they were issued by the 
processor. In the strong write-ordering mode the 
82495XP, together with the CPU, guarantees that 
writes will be issued on the memory bus in exactly 
the same order as they were issued by the proces- 
sor. 


The weak write-ordering mode fits the weak-consist- 
ency model and relies on synchronization events 
visible to the hardware (locked cycles) to insure cor- 
rect program execution. The strong write-ordering 
mode fits the Processor-Consistency model and in- 
sures data consistency for most applications. 


The i860 XP CPU uses weak consistency during nor- 
mal operations and strong consistency for LOCK11 
operations. Thus special care is needed to ensure 
that all data accessed within a CRITICAL REGION 
surrounded by a locked semaphore is strongly con- 
sistent. 
Therefore, 
the 
Memory 
Bus 
Controller 
(MBC) must obey an additional rule when handling 
locked read cycles. This additional rule is as follows: 
• The MBC must not provide the first BRDY11 of a 
locked memory read until the C5 has finished all 
pending back-invalidations. The MBC must moni- 
tor CAHOLD to determine when all pending back- 
invalidations have completed. 


• The first BRDY11 of a locked memory read must 
be at least three CLKS after the last SNPCYCII, 
and CAHOLD must be inactive. 


NOTE: 
The 
82495XP guarantees write-ordering 
to 
the 
memory bus, but write-ordering from the memory 
bus to the memory is a system responsibility. 
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6.5 Cycle Length 


When CADS# is generated, the 82495XP outputs 
CW/R# and MCACHE#. These signals provide the 
MBC with enough information to determine the type 
of 82495XP cycle. Table 6-1 summarizes the cycle 
types for the 82495XP/82490XP. All line-fills and 
write-backs to the 82495XP/82490XP cache oper- 
ate on the entire length of a cache line. 


In addition to the length of the cycle from the 
82495XP/82490XP, the memory bus controller may 
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need to determine the length of the cycle to the 
CPU. Specifically, for those 82495XP cycles where 
RDYSRC= 1, the MBC must decode the i860 XP 
CPU's W/R#, 
LEN, and CACHE# outputs to deter- 
mine the number of BRDY#s which the MBC will 
provide to the CPU. These signals are captured for 
the current cycle by a user-provided BE latch (see 
Section 7.2 for details). Table 6-2 presents the CPU 
cycle length definitions; see the i860 XP microproc- 
essor Data Sheet (Order #240874) for further de- 
tails. 
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Figure 6-15. Snooping During LOCKed Cycles 


, 
, 
, 


SNPCYCIII r---T'""\. 
, 
J 
, 
,,""-", 


CAHOLD 


BRDYIII 
••, -------, 
•••••G!..!~..•,----- 


EARLIEST 
240956-61 


Figure 6-16. BRDY# Timing for Semaphore Consistency 
Snooping During LOCKed Cycles 
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Table 6-1. 82495XP/82490XP Cycle Determination 


Cycle Type 
CW/RII 
RDYSRC 
MCACHEII 
MKENII 


Posted Write 
1 
0 
1 
X 


Write Backs 
1 
0 
0 
X 


Non-Cacheable Read 
0 
1 
1 
X 


Non-Cacheable Read 
0 
1 
0 
1 


Cacheable Read 
0 
1 
0 
0 


Allocation 
0 
0 
0 
X 


Table 6-2. 1860XP CPU Cycle Determination 


W/RII 
LEN 
CACHE 11 
MKENII 
Cycle Description 
Burst Length 


0 
0 
1 
- 
Non-Cacheable 54-Bit Read 
1 


0 
0 
- 
1 
Non-Cacheable 64-Bit Read 
1 


1 
0 
1 
- 
64-Bit Write 
1 
- 
0 
1 
- 
1/0 and Special Cycles 
1 


0 
1 
1 
- 
Non-Cacheable 128-Bit Read 
2 


0 
1 
- 
1 
Non-Cacheable 128-Bit Read 
2 


1 
1 
1 
- 
128-Bit Write 
2 


0 
- 
0 
0 
Cache Line Fill 
4 


1 
- 
0 
- 
Cache Write-Back 
4 


NOTE: 
If MRO# 
is asserted 
to the 82495XP, 
the effect 
on i860 XP CPU cycle 
determination 
is the same as when 
MKEN# 
= 1. 


6.6 Consecutive Cycles 


Because a 82495XP line can be longer than a CPU 
line, there are circumstances where a read miss will 
be to a line that is currently being filled. If this is the 
case, the 82495XP treats this like a read hit, but 
supplies data after CRDY11 for the line fill. Data is 
supplied from the 82490XP array. 


6.7 CPU/Memory Bus Concurrency 


The 82495XP allows concurrency between the CPU 
and memory buses. CPU bus cycles will either be 
serviced locally by the 82495XP (hits) or require 
memory bus service. Whenever a CPU cycle re- 
quires memory bus service, it will be scheduled to 
run on the memory bus, and CPU bus activity will be 
allowed to continue. 


Examples of concurrency are: 
- 
Snoops and CPU bus operations 
- 
Posted writes with CPU and memory bus opera- 
tions 
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- 
CPU bus operation on the back of long line fills 
(82495XP line longer than the CPU line) 
- 
Allocations 
and replacements with CPU and 
memory bus operations. 


In certain cases, consistency of data and prevention 
of deadlocks preclude concurrency. Problems may 
occur when the current memory bus cycle changes 
the tag state and therefore affects the operation of 
the next CPU cycle request. In those cases the 
82495XP will hold concurrency to ensure data con- 
sistency. Handling of those cases is completely 
transparent to the MBC. 


The 82490XP supports two modes of memory bus 
operation: clocked mode and strobed mode. In 
clocked mode, memory bus signals are sampled by 
the 82490XP on rising edges of MClK. Similarly, 
memory bus data and signals are output by the 
82490XP with respect to MClK (or MOClK) rising 
edge transitions. 


In strobed mode, memory bus signals are sampled 
or output with respect to rising and falling edges of 
other signals. Strobed mode has the advantage of 
not requiring setup and hold times to a ClK or MClK 
edge. 


6.8 Memory Bus Modes 


Clocked Memory Bus Mode 


ClK~ 


WClK~ 
I 
I 
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1- 
~ 
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Strobed Memory Bus Mode 
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Figure 6-16a. Clocked and Strobed 
Mode Sampling 


6.8.1 CLOCKED MODE 


In clocked mode operation MClK is used to refer· 
ence 
the 
signals 
MDATAO-MDATA7, 
MSEl#, 
MFRZ#, MZST#, MSRDY#, and MEOC#. Clocked 
mode will be selected if the 82490XP detects a 
clock at its MClK input after RESET. MClK need 
not have any relation to ClK. If this is the case, the 
memory bus is said to be operating in "clocked 
asynchronous" mode. If MClK = ClK, the memory 
bus is operating in "clocked synchronous" mode. If 
MClK x N = ClK (where N = 2, 3, 4 ... ), the 
memory bus is operating in "clocked divided syn- 
chronous" mode. These three clocked modes, asyn- 
chronous, synchronous, and divided synchronous, 
are not differentiated by the 82490XP. 


MOClK controls a transparent latch at the 82490XP 
data output pins. If a clock is provided at this input, 
data is latched with MOClK going low. This clock is 
available in clocked mode only. MOClK allows the 
system to provide a greater MDATA hold time by 
skewing MOClK from MClK. If MOClK is tied high, 
MDATA is driven from MClK. 


6.8.1.1 Synchronous 
Clocked Mode 


In synchronous clocked mode MClK 
= ClK. This 
means 
the 
CPU clock 
is 
used 
for 
82495XP, 
82490XP, and the memory bus. A synchronous 
memory bus allows memory to communicate with 
the 
82495XP 
without 
synchronizers 
since 
the 
82495XP runs with ClK. With a synchronous design, 
however, high clock frequencies must be routed to 
all parts of a system with minimal skew. As with 
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any synchronous design, synchronous memory sys- 
tem and memory bus controller must be redesigned 
when future speed upgrades are required. 


6.8.1.2 Asynchronous 
Clocked Mode 


In asynchronous clocked mode, MClK 
is not the 
same frequency as ClK. 
Some memory signals, 
since they reference MClK, must be synchronized 
to ClK to communicate with the 82495XP. For ex- 
ample, when a cycle completes, the memory system 
asserts a signal, driven from MClK, to the memory 
bus controller which will be synchrbnized to ClK to 
become CRDY#. This is because CRDY# is syn- 
chronous to ClK and not MClK. 


Asynchronous mode allows the rest of the system to • 
run at a lower frequency than the CPU ClK. Not only 
does this simplify system design, but allows the de- 
. 


signer to place hooks to allow the same design to 
scale easily to a higher frequency. If all the features 
of the 82495XP are used properly, an asynchronous 
memory design does not have to incur much syn- 
chronization penalty. For example, MEOC# 1s syn- 
chronous to the memory environment (MClK). This 
allows the memory system to end the current cycle 
and start the next before CRDY# is synchronized in 
the CPU environment. 


6.8.1.3 Divided Synchronous 
Clocked Mode 


Divided synchronous clocked mode is a subset of 
synchronous clocked mode. It allows two things to 
happen: One, the memory system is capable of 
communicating with the 82495XP without synchroni- 
zation. Two, a slower frequency clock may be routed 
around the system. 


Divided synchronous mode still requires clock skew 
restrictions. It also carries the same scalability draw- 
backs that full synchronous mode does. 


6.8.2 STROBED MODE 


Strobed mode is configured on the 82490XP by 
strapping MClK high. In strobed mode: 
- 
MDATAO-MDATA7 are sampled with respect to 
edges of MEOC#, MISTS, and MOSTS. 
- 
For write cycles, MFRZ# 
is sampled when 
MEOC# goes active. 
- 
MZST# 
is sampled when MSEl# 
is inactive, 
and 
is 
latched 
when 
MSEl# 
goes 
active. 
MZST# is also sampled for the next operation 
when MSEl# 
is active and MEOC# goes active. 
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6.9 
Memory Bus Operation 


All data is handled by the 82490XP cache RAMs. 
The 82495XP instructs the 82490XP whether to use 
the data array or buffers, and specifically which buff- 
er to use. The MBC is responsible for bursting data 
in and out of the 82490XP's, in and out of the CPU 
during miss cycles, and indicating when the opera- 
tion 
is 
finished. 
Communication 
between 
the 
82490XPs and memory bus may be done in a 
clocked mode or strobed mode. See the Memory 
Bus Modes section for more details. 


A 82490XP has 4 memory buffers. It has 2 memory 
cycle buffers, one write-back buffer, and one snoop 
buffer. Each buffer is capable of holding an entire 
82495XP line of the longest configurable length. 


The memory cycle buffers of the 82490XP are used 
for 
posting 
writes 
and 
holding 
data 
during 
82495XP/82490XP line-fills. The write-back buffer is 
used for holding data from a cache replacement. 
This data is ready to be written out, and the write- 
back buffer is snoopable. The snoop buffer is used 
to hold modified data that has.been hit by a snoop. 
Since snoop hits are the highest priority cycle, this 
buffer will be emptied before any other cycle or 
snoop request begins. 


6.9.1 
82490XP 
BUFFERS 
AND MUXES 


The four 82490XP memory buffers are all multi- 
plexed (muxed) to the memory bus. The mux is used 
to select which buffer is on the bus, and specifically 
which slice of that buffer is on the bus. MBRDY11 
assertion increments a counter for this mux which 
selects the next slice of that buffer. 


The counter used to increment through the buffer . 
slices is called the memory burst counter. The mem- 
ory burst counter follows the CPU burst order de- 
pending on the subline address of the initial slice. 
Once the MBC is finished with a buffer, MEOCII is 
asserted to switch the mux to the next buffer to be 
used. MEOCII will also reset the counter and latch 
the last slice of data. 


On the CPU side, the 82490XP contains a CPU buff- 
er and mux. The CPU buffer captures data from the 
appropriate,memory buffer or 82490XP array to feed 
it to the CPU. The mux selects which slice is muxed 
to the CPU bus. The counter for this mux is incre- 
mented with BRDY11. 


The 82490XP array contains a mux that selects 
which way, based on the MRU algorithm, will be 
read during hit cycles. This mux is used during write 
cycles to write to the correct way. 
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6.9.2 
MEMORY 
CYCLE 
BUFFERS 


There are 2 memory cycle buffers in the 82490XP. 
They are used for line-fills, allocates, and memory 
writes. The buffers are 64-bits wide (per 82490XP) to 
support 8 transfers with 8 memory bus I/O pins 
(maximum configuration). The 82490XP alternates 
use of these buffers. When one buffer has a posted 
write or is being used for a memory read, the other 
one is available for the next cycle. 


During allocation cycles, read for ownership may be 
implemented by using the MFRZII signal. If MFRZII 
is sampled active during the write cycle, the memory 
cycle buffer will freeze the write data in the buffer so 
the subsequent line-fill fills around it. This way the 
write cycle need not be written to memory. The line 
must be tagged as modified. 


6.9.3 
WRITE 
BACK 
BUFFER 
AND SNOOP 
BUFFER 


The write back buffer and snoop buffer are both 
64-bits to handle the maximum 82495XP line length. 
The write back buffer is used when replaced data 
must be written back to main memory (including 
FLUSH and SYNC cycles) and the snoop buffer is 
used when data must be written out from a snoop 
hit. 


Before a line fill begins, the 82495XP checks to see 
if it must remove a modified line to make room for 
the new line. If so, the modified line is placed in the 
write back buffer and the new line is filled into a 
memory cycle buffer. Should the line-fill be selected 
as non-cacheable, both buffer contents are discard- 
ed and the 82490XP array value remains as it was 
before the line-fill. 


There is no need to run the line-fill, replacement 
(write back), FLUSH, or SYNC cycles contiguously. 
If a snoop is requested between the two cycles, the 
write back buffer is snoopable, and data can be writ- 
ten directly out of it if need be. 


6.9.4 
MEMORY 
BUS CONTROL 
SIGNALS 


The main memory bus control signals are MSELII, 
MEOCII, 
MBRDYII, 
and CRDYII. 
These signals 
control the 82490XP data path, buffers, and muxes. 


MSELII selects which 82490XPs are being used in 
the current cycle by qualifying the MBRDY11 signal. 
If MSEL11 is inactive, MBRDY11 is not recognized for 
that 82490XP. MSELII 
is also used to reset the 
memory burst counter. If MSEL11 goes inactive, the 
counter is initialized to its starting value. This is use- 


ful for aborted/restarted cycles. MSEL# may remain 
active for many or all cycles. MSEL# must, howev- 
er, be inactive sometime after RESETto initialize the 
memory burst counter for the first time. 


MEOC# is asserted by the MSC to end finish with 
the current buffer, and switch the memory bus to the 
next buffer to be used. MEOC# latches in the last 
piece of data and resets the memory burst counter 
before switching to the new buffer. 


MSRDY# is used to increment the memory burst 
counter to select the next slice of data. This will 
strobe data out of the 82490XP (write cycles) or load 
data into the 82490XP (read cycles). MSRDY# is 
ignored by the 82490XP if MSEL# is inactive. 


CRDY# finishes the current cycle. Once CRDY# is 
asserted, the 82490XP disposes of the information 
in the buffers used in that cycle, and loads informa- 
tion into the 82490XP array. CRDY# must be as- 
serted on the clock or after MEOC# is asserted for 
a particular cycle. 


6.9.4.1 Memory Burst Counter and Burst 
Latches 


The burst order for replacements, allocates, and in- 
quire cycles is always 0-1-2-3 for a 4 transfer bus 
and 0-1-2-3-4-5-6-7 for an 8 transfer bus. Write 
backs caused by snoop hits can be zero burst based 
(MZST# sampled active), or start at the snoop sub- 
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line address sent by the 82495XP. Memory reads, 
snoops and write throughs start at the address de- 
termined by the CPU and use the following burst 
orders: 


4 Transfers 
8 Transfers 


0-1-2-3 
1-0-3-2 
2-3-0-1 
3-2-1-0 


0-1-2-3-4-5-6-7 
1-0-3-2-5-4-7-6 
2-3-0-1-6-7-4-5 
3-2-1-0-7-6-5-4 
4-5-6-7-0-1-2-3 
5-4-7-6-1-0-3-2 
6-7-4-5-2-3-0-1 
7-6-5-4-3-2-1-0 


6.9.5 82490XP DATA PATH 


An example 82490XP read data path is shown in 
Figure 6-17. The path between the CPU and memo- 
ry bus is a flow-thru' path, not a clocked path. Each 
entire 82495XP cache line of data in the CPU buffer 
is available at the memory buffer with some propa- 
gation delay. Likewise, each entire 82495XP cache 
line of data in the memory buffer is available in the 
CPU buffer with some propagation delay. Data is 
burst into and out of the memory buffer using 
MSRDY# or MISTS/MOSTS. Data is burst into and 
out of the CPU buffer using SRDY#. This means 
there is no synchronization required between memo- 
ry and CPU data paths. 
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Figure 6-17.82490XPRead Data Path 
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To give an example how the path works, during a 
CPU line fill, data may be returned to the CPU in two 
different fashions. One, each time the memory buff- 
er fills a dword, BRDY# may be asserted a clock 
later to burst it back to the CPU. Two, the memory 
buffer can be filled and then BRDY# asserted on 
four consecutive clocks to burst data back to the 
CPU. 


6.9.6 
WRITE 
CYCLES 


There are 3 basic types of write cycles: CPU gener- 
ated write cycles, write back cycles caused by a 
cache replacement, and snoop write back cycles 
caused by a snoop hit. All write cycles, except the 
snoop write back, begin with CADS# assertion. The 
snoop write back cycle begins with SNPADS#. 


6.9.6.1 
CPU Generated 
Write Cycles 


When the CPU begins a write cycle, four things can 
happen to it. One, the CPU write is a hit to a modi- 
fied or exclusive line. In this case the write is termi- 
nated by the cache immediately and invisibly to the 
MBC. 


Two, the write is to a shared location. This type of 
write is posted to the 82490XP memory cycle buffers 
and the cycle is terminated by the cache. If a memo- 
ry cycle buffer is occupied with a write cycle, the 
CPU waits until the previous write completes. The 
write cycle must be written to the memory bus so 
that other copies of the write in other caches be 
invalidated. 


Three, the write is a cache miss. This type of write is 
posted to a memory cycle buffer if the 82490XP is 
not waiting for another posted write to complete. If 
PALLC# is asserted, the write may be turned into an 
allocation. 


Four, the write is a LOCKed write. LOCKed writes 
are posted regardless of the tag state. The write is 
then treated as if it were a miss except that there is 
no Changein the tag state and no allocation allowed. 


6.9.6.2 
Cache Generated 
Write Cycles 


The 82495XP/82490XP will generate a write cycle in 
three situations: a line fill or allocation causing a 
cache replacement, a snoop hit to a modified loca- 
tion, and write backs caused by FLUSH or SYNC. 
Write back caused by FLUSH or SYNC are indestin- 
guishable from write-back cycles caused by replace- 
ment. Cache generated write cycles are the length 
of a cache line. 


Cache 
replacements 
and 
FLUSH/SYNC 
cycles 
cause a line (or two lines if sectored) of cache data 
to be placed in the write-back buffer of the 82490XP. 
If no cycle is pending, CADS# is asserted and the 
data is written out. If a snoop hits the write-back 


2-320 


intel~ 


buffer, the data is written out via SNPADS# like a 
normal snoop hit. The write back is then cancelled 
since the data was written through the snoop hit. 


A snoop hit to a modified location causes a line of 
cache data to be written out to memory. Snoop hits 
are the highest priority cycle and must be serviced 
immediately.A snoop hit to a modified location caus- 
es the snooped line to be written to the snoop buffer 
of the 82490XP. SNPADS# is then asserted and the 
snoop is written out. 


6.9.6.3 
Memory 
Bus Controller 
Responsibility 


The MBC recognizes a write cycle with CADS# and 
CW/R# 
(or 
SNPADS# 
for 
snoop 
cycles). 
If 
MCACHE# is active, the MBC knows the cycle is a 
write back cycle, otherwise it is a CPU-generated 
cycle. 


CPU-generated write cycles are written to the main 
memory bus so that other caches can invalidate 
their copies of this information. The other caches do 
this by snooping with SNPINV active during snoop 
initiation if they detect a write cycle on the bus. 


Once the MBC detects CDTS# active, the data will 
be available for writing in the next clock in the appro- 
priate 82490XP buffer. The MBC should assert 
MSEL# so bursting is enabled, and burst through 
the write using MBRDY# (or MOSTB). MSEL# acti- 
vation also caused MZBT# to be sampled. MZBT# 
must be inactive at this time if the data will be written 
according to CPU burst order. 


Once the write cycle is complete, MEOC# must be 
asserted to end the write cycle and switch to the 
next pending cycle. If this write cycle is turned into 
an allocation, MFRZ# is sampled with MEOC# to 
freeze the write data in the 82490XP. 


MEOC# simply switches buffers from the current 
one in use to the buffer of the next pending cycle. 
CRDY# needs to be asserted to actually end the 
cycle and allow the 82495XP and 82490XP to dis- 
pose of the information. 


6.9.6.4 
Write Allocation 
and Read for 
Ownership 


The 82495XP/82490XP supports write allocation. 
An allocation cycle is a read of a cache line caused 
by a write miss to the same location. In its simplest 
form, a write miss is written to memory, then the 
82495XP requests a line from that same location. 
Meanwhile, the CPU only sees the write cycle. 


Write allocation may only be done if PALLC# is ac- 
tive during CADS# of the write cycle. For the alloca- 
tion to occur, MKEN# must be returned active dur- 
ing KWEND# of the write cycle. The write cycle may 


be an actual write or a "dummy" 
write. Dummy 
writes are write cycles that are terminated in the 
82495XP and 82490XP as if they were normal 
writes, but the data is not actually written to memory. 
This saves a data write to memory. 


During write allocation, the write cycle will progress 
like a normal write cycle except MKEN# must be 
active during KWEND# activation. If the write cycle 
is a dummy write, MFRZ# 
must be used with 
MEOC# so that the line filled data is read around 
the write data into the 82490XP buffer. The line fill 
cycle is like any other line fill cycle except the CPU 
doesn't get any data. If a dummy write was per- 
formed, 
DRCTM# 
must 
be 
asserted 
during 
SWEND# activation to fill the line to the M state, 
and any cache supplying the data must invalidate its 
copy. 


Using dummy write cycles and filling data to the M 
state from another cache or memory is called Read 
For Ownership. This is because ownership is being 
transferred. In read for ownership cycles, memory is 
avoided as much as possible. First, the dummy write 
cycle avoids memory. Second, a line fill is performed 
as a cache to cache transfer with DRCTM# assert- 
ed. All caches were snooped with invalidate to elimi- 
nate their copies. 


For allocation cycles, SWEND# is not sampled for 
the write portion of the allocation. 


6.9.7 
READ CYCLES 


The CPU initiates all read cycles. These are usually 
line 
fills 
to 
the 
CPU 
and 
line 
fills 
to 
the 
82495XP/82490XP. The signal MCACHE# is output 
with CADS# to indicate whether this cycle mayor 
may not be cacheable. If cacheable, MKEN# is re- 
turned by the MBC to ultimately determine cachea- 
bility. 


Read hit cycles are serviced by the cache without 
MBC intervention. The only read cycles seen by the 
MBC (except 1/0 or special) are read misses and 
locked read cycles. 


Read misses cause CADS# to be asserted at most 
two clocks after ADS# of the CPU read cycle. If 
cacheable, as determined from MCACHE#, 
the 
MBC will return 4 BRDYs back to the CPU and 4 or 8 
MBRDYs to the 82495XP/82490XP. If the transfer is 
non-cacheable, the i860 XP CPU lEN and CACHE# 
outputs indicate the number of transfers to be given 
to the CPU. MBRDY# 
need not be used in the 
transfer if only a single piece of data is required by 
the CPU. 
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If the read cycle is cacheable, it may cause another 
cached line to be bumped out of the cache. This is 
called a replacement and, if modified, causes a write 
back cycle. While one of the 82490XP memory buff- 
ers is being filled for the line fill, the write back buffer 
is loaded. If the line fill turns out to be non-cache- 
able at the end of the transfer, the write-back buffer 
is discarded, and the line in the cache remains valid. 
Otherwise, CADS# will be generated after the read 
cycle so the write back can be performed. The write 
back need not happen immediately after the line fill 
since the write-back buffer is snoopable. 


All locked reads go to the memory bus. If the read is 
a cache hit to M', the 82495XP/82490XP will ignore 
the data that the MBC returns, and provide it from its 
array. locked reads are not cacheable by the CPU 
or the 82495XP/82490XP. Snoop write-backs that 
are a result of a lOCKed read/write request must 
update memory. 


6.9.7.1 
Memory 
Bus Controller 
Responsibility 


Once the MBC sees a read cycle on the memory 
bus, it must determine whether the read is cache- 
able or non-cacheable using MCACHE# and its own 
address decoding. If non-cacheable, the CPU ex- 
pects a number of transfers as determined by its 
lEN and CACHE# outputs. If cacheable, the CPU 
expects 4 transfers, and the cache expects 4 or 8 
(configuration dependent). 


MKEN# is sampled during KWEND# to determine 
cacheability. Before MKEN# is sampled, KEN# is 
active assuming cacheability for the CPU. MKEN# 
must be sampled 1 clock before the first BRDY# to 
make the cycle non-cacheable. 


Once the read cycle is given to the memory system, 
all 82495XP/82490XP caches snoop to see if they 
contain the data in modified form. If so, the MBC 
must abort the cycle in memory and receive the data 
directly from the 82495XP/82490XP that has it, or 
wait until that cache writes it to memory. If the data 
transfer avoids memory, ie goes cache to cache, 
DRCTM# must be asserted with SWEND# to place 
the line in the M' state and the cache giving the data 
must invalidate its copy. 


MSEl# 
is activated and MBRDY# (or MISTB) used 
to sample input data from the read cycle. Once 
COTS# has been seen active, the CPU read data 
path is clear. BRDY# may be returned to the CPU 
sometime after each MBRDY# for each piece of in- 
put data (see MDATA setup to ClK). 
Once the 
transfer completes, MEOC# and CRDY# are as- 
serted to complete the cycle in the 82495XPI 
82490XP. 


• 
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6.9.8 I/O AND SPECIAL CYCLES 


I/O and special cycles (flush, etc) are decoded by 
the 82495XP and not posted. These cycles wait until 
all buffers have been written, and all cycles have 
been completed, before they cause CADS# asser- 
tion. The CPU waits until the special cycle ends with 
the MBC's BRDY# assertion before it continues. 


When 
the 
82495XP/82490XP 
is 
performing 
a 
FLUSH or SYNC, many write back cycles are re- 
quired. These cycles look like ordinary write back 
cycles, and should be handled as such. FSIOUT# is 
active during these write back cycles, so when FSI- 
OUT# goes inactive the cycle is complete and the 
memory bus controller can supply BRDY# to the 
CPU. 


6.10 
Different Bus Widths 


The 82490XP is capable of supporting either 64- or 
128-bit memory bus widths. Depending on the con- 
figuration, the 82490XP's CPU and I/O busses may 
be multiplexed. The following diagram shows how 
an i860 Xp· CPU may be connected to a 128-bit 
memory bus: 


0124-0127 
060-063 
060-063 
--~ 
~ 
--~ 
~ 
--~ 
~ 


--~ 


--~ 


068-071 
04-07 
04-07 


064-067 
00-03 
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Figure 6-18. 82490XP On Wide Bus 
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In this example, the CPU port of the 82490XPs is in 
x4 mode and the memory bus port is in x8 mode. 
This allows all 128 bits of the memory bus to be 
multiplexed to the 64-bit CPU bus. 


For read cycles, each MBRDY# loads 8 bits into 
each 82490XP. This is 128-bits of data. It will take 2 
BRDY# assertions to load this into the CPU. The 
first BRDY# assertion loads the first 4 bits onto the 
CPU bus, and the next BRDY# assertion loads the 
remaining 4 bits. 


For a 64·bit write cycle, the data is available at the 
on the appropriate data bits. On the i860 XP CPU 
with a 128-bit bus, this is determined by CPU ad- 
dress bit A3. The other data bits are undefined. For 
write-back cycles, all 128 bits are available at once. 
MBRDY# assertion will strobe the next 128 bits on 
the memory bus. 


7.0 
DETAILED PIN DESCRIPTIONS 


The following chapter provides a detailed descrip- 
tion of each pin of the 82495XP and 82490XP. The 
pins have been categorized by function. Each pin 
description has a heading which summarizes the 
most important aspects of the pin. The heading is 
organized as: 


Pin Name 


Name Meaning 
Pin Function 
liD, 
82495XP/82490XP/i860 
XP CPU, (locatton) 
Signal Type 
Synchronous! Asynchronous 


for example, 


CADS# 


Cache Address Strobe 
Indicates beginning of cache cycle 
Output from 82495XP (pin E3) Cycle Control Signal 
Synchronous to ClK 


Following the heading are three sections. The first 
section, Signal Description, provides information of 
what the signal does, how to use it, and in what 
modes it operates. The second section, When Sam- 
pled or When Driven, indicates all the exact places 
where the part samples the signal, generates the 
signal, or neither. The third section, Relation to Oth- 
er Signals, mentions the other signals that are af- 
fected by this signal, synchronization requirements, 
and shared pins. 


All specific information about each pin is provided in 
this chapter. 


7.0.1 
CONFIGURATION 
SIGNALS 


These signals are inputs to the 
82495XP and 
82490XP that are sampled at RESET and alter the 
configuration and operation of the cache. 


Setup 
Hold 
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Figure 7-1. Configuration 
Input Setup and Hold 


Each set of configuration inputs may have different 
setup times, but all signals have the same hold time: 
The signals may be released on the CPU clock edge 
that RESET is detected inactive. There are some 
configuration signals mat are strapping options and 
cannot change their value during 82495XP opera- 
tion. 


7.0.2 
CPU BUS INTERFACE 
SIGNALS 


These pins comprise the interface between CPU 
and 82495XP/82490XP. The signals in this interface 
are not flexible; Chapter 10 addresses the use of 
these signals. The following are the CPU bus inter- 
face signals: 


SETO-SET10 
ADS# 
M/IO# 
PWT 
BRDYC1# 
EADS# 
BOFF# 


TAGO-TAG11 
W/R# 
HITM# 
PCD 
KEN# 
BEO-BE7#< 


CFAO-CFA6 
D/C# 
lOCK # 
lEN 
AHOLD 
INV 


The majority of these signals must be connected 
strictly between the i860 XP CPU and the 82495XP. 
However, a subset of these signals is needed by the 
MBC to decode the i860 XP CPU cycle in cases 
where the MBC provides BRDYs to the CPU. For 
these purposes the following signals must also be 
inputs to a latch controlled by the 82495XP's BlE # 
output: 


BEO#-BE7# 
lEN 
PWT 


CACHE# 
PCD 
CTYP 
PCYC 
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7.0.3 
82495XP/82490XP 
INTERFACE 
SIGNALS 


These pins comprise the interface between the 
82495XP and 82490XP. The 82495XP uses these 
pins to control the 82490XP and its buffers. The sig- 
nals in this interface are not flexible; Chapter 10 ad- 
dresses the use of these signals. The following are 
the 82495XP/82490XP interface signals: 


WRARR# 
BUS# 
WBA[SEC2] 
BLAST# 


WAY 
MCYC# 
WBTYP[lRO] 
BOFF# 


MAWEA# 
WBWE#[lR1] 
BRDYCU 


SIGNAL DESCRIPTIONS 


7.1 
BGT# 


Bus Guaranteed Transfer 
Signals 82495XP of memory bus controller's com- 
mitment to complete the bus cycle. 
Input to 82495XP (pin M4) Cycle Progress Signal 
Synchronous 


7.1.1 
SIGNAL 
DESCRIPTION 


The 82495XP owns all bus cycles (initiated by 
CADS#) until the memory bus controller accepts 
ownership. During this time cycles may be aborted 
due to a snoop. The memory bus controller signals 
its acceptance of ownership by driving BGT# active 
into the 82495XP. Once BGT# is driven active, the 
memory bus controller is responsible for completing 
the cycle on the memory bus. CRDY# signals com- 
pletion of the cycle. 


Once BGT# is asserted, other devices may not per- 
form snoops into the 82495XP until the end of the 
snooping window, SWEND# activation. The snoop 
address is latched if SWEND is asserted between 
BGT# and SWEND#, but the snoop does not begin 
until after SWEND# is asserted. SNPCYC# will not 
be asserted until the snoop window ends with 
SWEND# 
asserted. The advantage of asserting 
BGT# early is that it allows the 82495XP to start 
inquiries to the CPU, load the write-back buffer, and 
progress forward in the CPU bus pipeline. The disad- 
vantage is that snooping of this 82495XP is now 
blocked until SWEND# is asserted. 


7.1.2 
WHEN 
SAMPLED 


After the 82495XP asserts CADS#, it begins sarn- 
pling BGT# until it is sampled active. 


BGT# is a "Don't Care" after it has been recog- 
nized for cycle N and prior to the assertion of 
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CADS# 
for cycle N+ 1. In addition, BGT# is a 


"Don't Care" once a cycle started by CADS# 
is 
aborted by a snoop, until the cycle is restored by the 
re-issueing of CADS#. 


7.1.3 
RELATION 
TO OTHER 
SIGNALS 


When implementing BGT# in the MBC the following 
rules should be used: 
1. BGT# 
must follow every assertion of CADS#, 


unless the cycle is aborted due to a snoop. 
2. It must preceed CRDY# (for line fills and alloca- 
tions BGT# must preceed CRDY# by at least 3 
ClKS). 
3. In addition BGT# must be asserted with or be- 


fore the assertion of KWEND# and SWEND#. 


4. BGT# must be asserted with or before the asser- 
tion of BRDY# by the MBC. 


5. BGT# is not required following the assertion of 
SNPADS#. 


6. BGT# must be asserted with or before MEOC# 


is asserted. 


7.2 BLE# 


BE latch Enable 
Controls latching of i860 XP CPU's byte enable and 
cycle attribute signals 
Output of 82495XP (pin C16) 
Cycle Control Signal 
Synchronous to ClK 


7.2.1 
SIGNAL 
DESCRIPTION 


BlE# 
is used to control the enable line of an exter- 
nal latch (clock edge triggered '377 type). This latch 
is used to capture the i860 XP CPU's byte enables 
(BEO#-BE7#) 
and CPU cycle attribute signals 
which do not go through the 82495XP.The 82495XP 
manages the opening and closing of this latch: when 
BlE # is active, new values from the CPU enter the 
latch at each rising edge of ClK. 


The 82495XP latches the byte enables after ADS# 
of a memory bus bound cycle. It relatches this infor- 
mation with CRDY# or CNA# of that cycle if anoth- 
er cycle is pending. 


7.2.2 
WHEN 
DRIVEN 


The 82495XP latches the BE latch signals 1 clock 
after ADS# of a memory-bound cycle. Thus latched 
BEO#-BE7# 
are valid with CADS#. The 82495XP 
opens, then closes this latch if a cycle is pending 
and CNA# 
or CRDY# 
is asserted. Thus latched 
BEO#-BE7# 
are valid two clocks after CNA# or 
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CRDY#, which is one clock AFTER CADS#' for 
back-to-back cycles. The signals latched in the BE 
latch are only valid for CPU generated memory bus 
cycles (ie, not a 82495XP generated writeback or 
allocation). 


7.2.3 
RELATION 
TO OTHER 
SIGNALS 


The following CPU signals must be latched in the BE 
latch: 


BEO#-BE7# 
lEN 
PWT 


CACHE# 
PCD 
CTYP 
PCYC 


All other Signals in the 82495XP to CPU interface 
(listed in sec. 7.0.2) must be connected only be- 
tween the i860 XP CPU and the 82495XP. 


7.3 BRDY# 


Burst Ready 
Memory 
Bus 
Controller 
Burst 
Ready 
input 
to 
82495XP, 82490XP, and i860 XP CPU 
Input to 82495XP and 82490XP (82495XP pin P1, 
82490XP pin 60) Cycle Progress Signal 
Input to i860 XP CPU (BRDY2#, pin U1) 
Synchronous to ClK 


7.3.1 SIGNAL 
DESCRIPTION 


The 
BRDY# 
input to 
both 
the 
82495XP and 
82490XP must be connected to the BRDY# signal 
which the MBC is providing to the i860 XP CPU's 
BRDY2# pin. The signal is used by the 82495XP for 
burst tracking purposes. In the 82490XP, it incre- 
ments the CPU latch burst counter. 


During CPU read cycles, BRDY# allows the next 32 
or 64-bit slice of read data to be available at the 
82490XP's CDATA outputs (CPU bus) by advancing 
the CPU latch burst counter. At the same time, 
BRDY# is latching the previous slice of data into the 
i860 XP CPU. Refer to chapter 6 for more details. 


During CPU write cycles, BRDY# is used to latch 
each slice of write data into the CPU latches and 
advance the latch counter. 


During CPU special and 1/0 cycles (which are not 
posted) BRDY# is used to end the cycle. 


BRDY# must not be asserted until the bus is grant- 
ed (BGT# asserted) and until the data path is ready 
for transferring (CDTS# is asserted). 


7.3.2 
WHEN 
SAMPLED 


BRDY11 is sampled by the CPU, the 82495XP, and 
the 82490XP at every ClK 
edge. It must always 
meet proper setup and hold times to ClK. 
Even 
though the CPU latch may not be in use, BRDY11 
assertion will still advance the latch counter. 


7.3.3 
RELATION 
TO OTHER 
SIGNALS 


BRDY11 controls the CPU and 82490XP CPU latch- 
es. BRDY11 has the following implication rules: 
1. The last BRDY11 for cycle N must be asserted 2 
clocks before MEOC11 for cycle N+ 1. 
2. BRDYII 2 BGTII 
3. BRDYII > CDTSII 


7.4 
C490LDRV 


82490XP low Drive Buffer 
Selects the 82495XP low capacitance driving buffers 
Input to 82495XP (pin M3) Configuration Signal 
Synchronous to ClK 


7.4.1 
SIGNAL 
DESCRIPTION 


C490lDRV 
selects 
the 
driving strength 
of 
the 
82495XP buffers that interface to the 82490XP. Re- 
fer to the layout specifications for information how 
C490lDRV should be connected. 


7.4.2 
WHEN 
SAMPLED 


C490lDRV is a configuration input sampled like Fig- 
ure 7-1. C490lDRV requires a setup time of 4 CPU 
clocks. After sampling, C490lDRV is a "don't care" 
until it is sampled as the BGT11 pin after the first 
CADS11 assertion. 
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7.4.3 
RELATION 
OT OTHER 
SIGNALS 


C490lDRV shares a pin with BGT11. 


7.5 
CADS# 


Cache Address Strobe 
Indicates beginning of cache cycle 
Output from 82495XP (pin E3) Cycle Control Signal 
Synchronous to ClK 


7.5.1 
SIGNAL 
DESCRIPTION 


CADSII requests the execution of a memory bus 
cycle to the MBC, and indicates that the cycle attri- 
butes (ie. CD/CII, 
CM/lOll, 
CW/RII, 
PAllCII, 


etc.) are valid. 


If the 82495XP receives a snoop hit to an [M] state 
line before BGT11 is asserted by the MBC, the cur- 
rent CADS11 is aborted and reissued after the snoop 
has completed. If the current line (issued by the 
stalled CADS11) is invalidated by the snoop, then 
that CADSII is cancelled ( ie. will not be reissued 
after the snoop is completed). 


CADSII is a glitch-free signal. 


7.5.2 
WHEN 
DRIVEN 


CADSII is asserted by the 82495XP for exactly one 
ClK, and is always a valid logic level. 


7.5.3 
RELATION 
TO OTHER 
SIGNALS 


CADS11, when asserted, indicates that the cache 
cycle control and attribute signals (ex. CD/CII, 
NENEII, CW/RII, etc.) are valid. 


Since allocations do not require BRDY11s to the 
CPU, the CDTSII of an allocation cycle will always 
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occur with CADS# of the allocation. In normal cy- 
cles the 82495XP will generate CADS# followed by 
CDTS#. 


CADS# = = CDTS# for all write-through cycles. 


Once CADS# is active, PAllC#, 
CWAY, CDTS#, 
and BUS# are valid. Address and cycle specifica- 
tion signals (MSETO-MSET10, MTAGO-MTAG11, 
MCFAO-MCFA6, 
CW/R#, 
CM/IO#, 
CD/C#, 
RDYSRC, MCACHE#,NENE#,SMlN#,KlOCK#, 
and CPlOCK#) 
will be valid with CADS# active as 
well. 


Every CADS# initiated cycle requires a BGT# and 
CRDY# input from the MBC. 


CADS# and SNPADS# will never be asserted on 
the same ClK. 


7.6 
CAHOLD 


82495XP AHOlD Output 
Self-test result and AHOlD output status 
Output of 82495XP (pin G4) Test Signal 
Synchronous to ClK 


7.6.1 
SIGNAL 
DESCRIPTION 


CAHOlD has two functions. One, it indicates the re- 
sult of the built-in self-tests of the 82495XP. Two, it 
represents the 82495XP AHOlD 
into the i860 XP 
CPU. 


The 82495XP drives CAHOlD after the 82495XP 
self-tests 
have completed. 
CAHOlD 
should be 
latched when FSIOUT# goes inactive after reset. If 
CAHOlD is high, the self-tests have passed, other- 
wise they have failed. 


When the 82495XP drives AHOlD to the i860 XP 
CPU, it also drives CAHOlD, thus providing a means 
of tracking inquire cycles and back invalidations for 
performance monitoring. 


7.6.2 
WHEN 
DRIVEN 


CAHOlD is always at a valid logic level. During self- 
test, CAHOlD is held until the clock edge that FSI- 
OUT# is sampled inactive. After self-test, or reset, 
CAHOlD 
is asserted whenever the 82495XP as- 
serts AHOLD. 
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7.6.3 
RELATION 
TO OTHER 
SIGNALS 


CAHOlD reflects the value of AHOlD except during 
self-test. During self-test, the value of CAHOlD 
should be latched with the falling edge of FSIOUT# 
to determine pass/fail. 


7.7 
CD/C# 


Cache Data/Code 


Indicates whether current cycle is Code or Data 
Output from 82495XP (pin D3) Cycle Control Signal 
SynChronousto ClK 


7.7.1 
SIGNAL 
DESCRIPTION 


CD/C#, 
along with CW/R# 
and CMIIO#, 
is a 
82495XP cycle definition signal. It indicates the type 
of bus cycle being requested of the MBC. CD/C# 
can be pipelined by the memory bus controller (by 
using the CNA# inl1utto the 82495XP). 


7.7.2 
WHEN 
DRIVEN 


CD/C# 
is valid in the same ClK as CADS# and 
remains valid until CRDY# or CNA#. C/DC# 
is al- 
ways a valid logic level. 


7.7.3 
RELATION 
TO OTHER 
SIGNALS 


Address and cycle specification signals (MSETO- 
MSET10, 
MTAGO-MTAG11, 
MCFAO-MCFA6, 
CW/R#, 
CM/IO#, 
CD/C#, 
RDYSRC, MCACHE#, 
NENE#, SMlN#, 
KLOCK#, and CPlOCK#) will be 
valid with CADS#. 


7.8 CDATAO-CDATA7 


CPU Data Bus Connection 
Data Bus Connection from 82490XP to CPU 
Input/Output to 82490XP (pins 48, 54, 49, 55, 46, 
51,52,57) 
Isolated Interface 
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7.8.1 SIGNAL DESCRIPTION 


CDATAO-7 is the 82490XP data bus connection to 
the CPU. All or part of these 8 pins will be used in 
connecting the 82490XP to the CPU depending on 
the cache configuration. See layout information for 
details. 


7.9 
CDTS# 


Cache Data Strobe 
Indicates availability of CPU data/data bus 
Output from 82495XP (pin F4) Cycle Control Signal 


Synchronous to ClK 


7.9.1 SIGNAL DESCRIPTION 


For read cycles, CDTSlIf, when asserted, indicates 
that in the next CPU clock the data bus path is avail- 
able. This is the earliest time in which BRDYlIf may 
be supplied to the CPU. For CPU initiated write cy- 
cles, it indicates that the data is available on the 
memory bus. For i860 XP CPU inquire cycles, 
CDTSlIf informs the MBC that the last piece of in- 
quire data is valid on the CPU bus. 


Usage of this signal allows complete independence 
between address strobes (CADSlIf and SNPADSlIf) 
and data strobe. CDTSlIf allows the 82495XP to sig- 
nal the MBC that a new cycle has begun as soon as 
addresses are available. This allows memory bus cy- 
cles to start before data is ready to be given/taken. 


CDTSlIf is a glitch-free signal. 
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7.9.2 WHEN DRIVEN 


CDTSlIf is asserted for one ClK, at the same time or 
later than CADSlIf for any given cycle. 


7.9.3 RELATION TO OTHER SIGNALS 


When the MBC samples CDTSlIf asserted, it can 
begin providing BRDYlIfs for the read cycle to the 
CPU in the next ClK. CDTSlIf must always be as- 
serted before CRDYlIf and must be asserted prior to 
the first BRDYlIf. 


The CDTSlIf of an allocation will always occur with 
CADSlIf of the allocation. In normal cycles the 
82495XP will generate CDTSlIf following CADSlIf. 


COTSlIf will be asserted at least one ClK 
after 
SNPADSlIf. 


7.10 
CFGO-CFG2 


Configuration Pins 
Determine Cache Characteristics 
Input to 82495XP (pins l4, Q1, M4,) Configuration 
Signals 
Synchronous to ClK 


7.10.1 SIGNAL DESCRIPTION 


CFGO-CFG2 are the 3 cache configuration inputs 
that determine cache characteristics such as line ra- 
tio, tag size, and lines per sector. During RESET,this 
information is passed on to the 82490XPs. The fol- 
lowing table maps CFGO-CFG2 to their respective 
configurations for the i860 XP CPU: 


Conflg 
Line 
Llnesl 
No. of 
CFG2 
CFG1 
CFGO 
No. 
Ratio 
Sector 
Tags 


1 
1 
1 
8K 
0 
0 
1 


2 
2 
1 
4K 
1 
1 
1 


3 
1 
2 
8K 
0 
0 
0 


4 
2 
1 
8K 
0 
1 
1 


5 
4 
1 
4K 
1 
1 
0 
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7.11 ClK 


i860 XP CPU, 82495XP, 82490XP Clock 
Input to the 82495XP (011) 


7.11.1 
SIGNAL 
DESCRIPTION 


The CLK input determines the execution rate and 
timing of the 82495XP, 82490XP, and CPU. Pin tim- 
ings are specified relative to the rising edge of this 
signal. The i860 XP CPU, 82495XP, and 82490XP 
requires TTL levels on CLK for proper operation. 


7.12 CM/IO# 


Cache Memory/lO 
Indicates whether current cycle is Memory or 10 
Output from 82495XP (04) Cycle Control Signal 
Synchronous to CLK 


7.12.1 
SIGNAL 
DESCRIPTION 


CM/IO#, 
along with CW/R# 
and Co/C#, 
is a 
82495XP cycle definition signal. It indicates the type 
of bus cycle being requested of the MBC. CM/IO# 
can be pipelined by the memory bus controller 
(CNA# input to the 82495XP). 


7.12.2 
WHEN 
DRIVEN 


CM/IO# 
is valid in the same CLK as CAoS#, and 
remains active until CRoy# 
or CNA#. 


7.12.3 
RELATION 
TO OTHER 
SIGNALS 


Address and cycle specification signals (MSETO- 
MSET10, MTAGO-MTAG11, MCFAO-MCFA6, CWI 
R#, 
CM/IO#, 
Co/C#, 
RoYSRC, 
MCACHE#, 
NENE#, SMLN#, KLOCK#, and CPLOCK#) will be 
valid with CAoS# assertion. 
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CNA# is an optional input for all cycles initiated with 
CAoS#. 


7.13.2 
WHEN 
SAMPLED 


CNA# 
is sampled with the later of BGT# 
and 
CoTS#. If CoTS# was given for a cycle, CNA# will 
be sampled from the clock of BGT# until CRoy # is 
sampled active. If BGT# is given before COTS# , 
CNA# will be sampled from the clock of COTS# 
until CRoy # is sampled active. Once CNA# is sam- 
pled active, or once CRoy # has been given for a 
cycle, CNA# is ignored until the later of BGT# or 
COTS# for the next cycle. 


CNA# is ignored during snoop write-back cycles. 


7.13.3 
RELATION 
TO OTHER 
SIGNALS 


Once the 82495XP samples this signal active, it is- 
sues the CADS# for the next memory bus cycle as 
soon as one begins. 


1.14 
C,RDY# 


Cache Ready 
Ends a cycle in the 82495XP/82490XP 
Input to 82495XP and 82490XP (pins M2, 43) Cycle 
Progress Signal 
Synchronous to CLK 


7.14.1 
SIGNAL 
DESCRIPTION 


CRoy # is used by the 82495XP and 82490XP to 
end a memory bus cycle. CRoy # indicates full com- 
pletion 
of 
the 
cycle 
and 
allows 
the 
82495XP/82490XP to free internal resources for the 
next cycle. In the 82490XP, this means that the cur- 
r~nt memory buffer in use is emptied (put in array, 
discarded, etc). In the 82495XP, CRoy # assertion 
allows 82495XP cycle progress signals (BGT#, 
KWEND#, SWEND#) to be sampled for the next 
cycle if pipelining is used. 
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CRDY# is required for all 82495XP/82490XP mem- 
ory bus cycles, including snoop cycles. CRDY# 
must be asserted to the 82495XP and 82490XP at 
the same time. 


7.14.2 
WHEN 
SAMPLED 


CRDY# for a given cycle is ignored until KWEND# 
is returned for that cycle. If KWEND# is not required 
for the cycle, CRDY# is ignored until BGT#. When 
CRDY# is ignored, it may violate setup and hold 
times. 


7.14.3 
RELATION 
TO OTHER 
SIGNALS 


CRDY'" 
must be sampled by the 82495XP and 
82490XP at the same time. For the 82495XP, 
CRDY# has many cycle implication rules: 
1. CRDY# > CDTS# 
2. CRDY# > BGT# . 
3. CRDY# > BGT# + 2 clocks if cycle is a line-fill 
or allocation 
4. CRDY# > KWEND# if cycle is a line-fill or write- 
through with potential allocation (PAllC# 
= 0) 


For the 82490XP, CRDY# has three basic rules: 
1. MEOC# for cycle N must be sampled with or be- 
fore CRDY# for cycle N. 
2. MEOC# for cycle N+ 1 must be sampled at least 
2 CPU clocks after CRDY# for cycle N. 
3. CRDY# for cycle N+ 1 must be after the last 
BRDY# for cycle N. 


MBRDY# fills the current 82490XP memory buffer. 
CRDY# emties this buffer and makes it available for 
new cycles. CRDY# may be asserted on the same 
clock as MEOC# which may be asserted on the 
same clock as MBRDY#. 


CRDY# shares a pin with SlFTST #. 


7.15 CWAY 


Cache Way 
Indicates WAY used by the current cycle 
Output from 82495XP (pin J3) Cycle Control Signal 
Synchronous to ClK 
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7.15.1 
SIGNAL 
DESCRIPTION 


CWAY is a cycle definition signal which indicates to 
the MBC the WAY used by the requested cycle. On 
line-fills it indicates the way the line will be loaded. 
For write-hits (to [S] state or lOCKed) it indicates 
the way which was a hit. For write-backs it indicates 
the way that was written-back. 


CWAY is utilized by external tracking machines in 
order for the 82495XP tags to be accurately dupli- 
cated. 


7.15.2 
WHEN 
DRIVEN 


CWAY is valid together with CADS# and remains 
valid until CRDY# or CNA#. 


7.15.3 
RELATION 
TO OTHER 
SIGNALS 


CWAY is valid with CADS#. 


7.16 CW/R# 


Cache Write/Read 
Indicates whether current cycle is write or read 
Output from 82495XP (pin E4) Cycle Control Signal 
Synchronous to ClK 


7.16.1 
SIGNAL 
DESCRIPTION 


CW/R#, 
along with CD/C# 
and CM/IO#, 
is a 
82495XP cycle definition signal. It indicates the type 
of bus cycle being requested of the MBC. CW/R# 
can be pipelined by the memory bus controller 
(CNA# input to the 82495XP). 


7.16.2 
WHEN 
DRIVEN 


CW/R# 
is valid in the same ClK as CADS# and is 
valid until CRDY# or CNA# . 


7.16.3 
RELATION 
TO OTHER 
SIGNALS 


Address and cycle specification signals (MSETO- 
MSET10, 
MTAGO-MTAG11, 
MCFAO-MCFA6, 
CW/R#, 
CM/IO#, 
CD/C#, 
RDYSRC, MCACHE#, 
NENE#, SMlN#, 
KlOCK#, 
and CPlOCK#) will be 
valid with CADS#. 
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7.17 DRCTM# 


Memory Bus Direct to [M] State 


Signals 82495XP to tag data direct to the [M] state, 
skipping the [E] and [S] states. 


Input to the 82495XP (pin M1) Cycle Attribute Signal 
Synchronous to CLK 


7.17.1 SIGNAL DESCRIPTION 


DRCTM# is an input to the 82495XP from the memo 
ory bus. When sampled active at the end of the 
snooping 
window 
(SWEND# 
activation), 
the 
82495XP moves the line fill in progress directly to 
the [M] state. 


There are three cases in which this is useful. 
1. Simplifies External State Tracker 
External trackers can only track the [Ml, [Sl, and 
[I] states. The [E] state can not be tracked exter- 
nally since cache write hits internally change [E] 
state lines to [M] state. DRCTM# can be used to 
eliminate the [E] state from the MESI protocol. 


2. Read For Ownership 
During a write miss with allocation the write may 
go to the memory buffer and not be written to 
memory. A read from memory, in conjunction with 
the MFRZ# signal asserted, reads the data to fill 
around the bytes written by the CPU. The con- 
tents of the memory buffer are then entered into 
the cache. The cache would normally tag this 
data in the [E] state (The cache assumes the 
write went to main memory). The system has the 
option of never completing the write to memroy 
(increases performance by completing the alloca- 
tion quicker). If the write is not performed to 
memory, the cache is the only owner of the new 
data and therefore the cache entry must be 
tagged to the [M] state. 
3. Cache to Cache Transfer 
A cache to cache transfer may occur as a result 
of a snoop. For example, if CPU/Cache 1 per- 
forms a read from main memory and CPU/Cache 
2 flags it as a snoop hit to an [M] state line. To 
expedite the transfer, the system may perform 
the writeback from CPU/Cache 
2 directly to 
CPU/Cache 1, bypassing memory. CPU/Cache 1 
assumes the write-back went to memory and 
would normally tag the line to the [S] state. Since 
the system did not perform the write to memory, 
the system should drive DRCTM# to force the 
line to the [M] state. In addition, the line should 
be 
invalidated 
in 
CPu/Cache 
2 
by 
driving 
SNPINV. 


infel~ 


7.17.2 WHEN SAMPLED 


DRCTM# is synchronous to CLK. It is only sampled 
when SWEND# is active (the end of the snooping 
window). When SWEND# is inactive DRCTM# is 
ignored and does not have to meet setup and hold 
times. 


7.17.3 RELATION TO OTHER SIGNALS 


DRCTM# (direct to [M]) and MWB/WT # (write polt- 
cy) combine to define the memory bus attributes and 
are sampled on ClK at the end of the snooping wln- 
dow (SWEND# activation). 


If MRO# 
is sampled active 
during KWEND#, 


DRCTM# is ignored. 


7.18 FLUSH# 
' 


Flush 


Causes a 82495XP Cache Flush 
Input to 82495XP (N4) Cache Synchronization Sig· 
nal 
Asynchronous input 


7.18.1 SIGNAL DESCRIPTION 


This signal causes the 82495XP to flush all its modi- 
fied lines to main memory. The flushing of modified 
lines require the 82495XP to perform back-invalida- 
tion and inquire cycles to the CPU. At the end of 
flush, the 82495XP tag array will be completely inval- 
idated. 


FlUSH# 
will invalidate the entire 82495XP tag ar- 
ray. It takes two clocks to look-up and invalidate a 
tag entry. The 82495XP will also invalidate tags in 
the CPU cache by running back-invalidation cycles. 
If the 82495XP tag state is modified, the 82495XP 
will run inquire cycles to the i860 XP CPU to see is 
the line is modified in its cache. If so, the i860 XP 
CPU will write back the line into the 82495XP write 
buffer. All modified 82495XP cache lines must be 
written to memory. 


7.18.2 WHEN SAMPLED 


FLUSH# can be asserted at any time. The 82495XP 
will complete all outstanding transactions on the 
CPU 
and 
memory 
bus 
before 
beginning 
the 
FLUSH# process. The memory bus controller does 
not have to prevent FLUSH# during locked cycles 
because the 82495XP will complete its locked trans- 
action before the FLUSH# process will begin. 


Once a FLUSH# operation has begun, the FLUSH# 
signal is ignored until the operation completes. If 
RESET is activated while the FLUSH# operation is 
in progress, the FLUSH# operation will be aborted 
and the RESET immediately executed. 


FLUSH# is an asynchronous jnput. FLUSH# must 
have a pulse width of 2 ClK's in order to guarantee 
82495XP recognition. 


7.18.3 RELATION TO OTHER SIGNALS 


To initiate a FLUSH#, the 82495XP will complete all 
pending cycles and prohibit the processor from issu- 
ing any further ADS#'s 
while the FlUSH# 
is in 
progress. The FSIOUT# output signal is used to in- 
dicate the start and end of the FLUSH# operation. It 
will become active when the FLUSH# signal is inter- 
nally recognized (all outstanding cycles have com- 
pleted) and will de-activate with the CRDY# of the 
last FLUSH# write-back. 
' 


The memory bus controller supplies BRDY# to the 
CPU once FSIOUT# 
has gone inactive and the 
FLUSH is complete. Once FLUSH# has begun, and 
FSIOUT# active, all CADS#'s and CRDY#'s corre- 
spond to write-backs caused by the FLUSH# opera- 
tion. 


The 82495XP can be snooped during FLUSH# cy- 
cles and the snooping protocols will be the same as 
that for any memory bus cycle. 


7.19 
FPFLD# [FPFLDEN] 


External FIFO PFlD 
Indicates PFLD cycle during external PFLD FIFO 
mode 
Output of the 82495XP (J4) Cycle Control Signal 
Sync to ClK 


7.19.1 SIGNAL DESCRIPTION 


During RESET, this pin functions as the FPFlDEN 
configuration signal. The 82495XP can be config- 
ured to decode the i860 XP microprocessor's PFlD 
cycles. The 82495XP supports 3 operational modes 
for PFlD cycle decoding, as defined by FPFlDEN 
and NCPFlD#: 
Mode # 1. PFlD cycles are cached in the 82495XP. 
Mode # 2. PFlD 
cycles are not cached in the 
82495XP, without an external PFLD ex- 
tension FIFO. 
Mode #3. PFlD cycles not cached in the 82495XP, 
with an external PFlD extension FIFO. 
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Mode 
FPFLDEN 
NCPFLD# 


1 
0 
1 


2 
0 
0 


3 
1 
1 


Illegal Mode 
1 
0 


If mode 3 has been selected, the 82495XP allows 
the PFlD pipeline to be extended with an external 
FIFO. After RESET, when this mode has been se- 
lected, the FPFlD output will indicate that the re- 
quested cycle is a PFlD cycle. See Section 5.2.5 for 
more details. 


7.19.2 WHEN DRIVEN 


FPFlDEN is sampled on RESET as in figure 7-1, 
with a setup time of 4 CPU clocks. In PFlD mode 
#3, the FPFlD# 
output is valid in the same ClK as 
CADS# and remains valid until CRDY# or CNA#. 


7.19.3 RELATION TO OTHER SIGNALS 


Address and cycle specification signals (MSETO- 
MSET10, 
MTAGO-MTAG11, 
MCFAO-MCFA6, 
CW/R#, 
CM/IO#, 
CD/C#, 
RDYSRC, MCACHE#, 
NENE#, SMlN#, 
KlOCK#, and CPlOCK#) will be 
valid with CADS#. 


7.20 
FSIOUT# 


Flush, Sync, Initialization Output 
Indicates the start and end of the Flush, 
Sync, anp Initialization operations. 
Output of the 82495XP (01) Cache Synchronization 
Signal 
Sync to ClK 


7.20.1 SIGNAL DESCRIPTION 


This signal indicates the start and the end of either a 
Flush, Sync, or Initialization (inciuding self-test if re- 
quested) operation. These operations are mutually 
exclusive. This signal is activated when the 82495XP 
begins the operation and goes inactive upon com- 
pletio~ of the operation. 


7.20.2 WHEN DRIVEN 


This signal will be asserted whenever a Flush, Sync, 
or Initialization operation is internally recognized by 
the 82495XP and is in progress. 
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7.20.3 
RELATION 
TO OTHER 
SIGNALS 


FSIOUT# active indicates that either Flush, Sync, or 
Initialization operation is in progress. Only one of 
these operations can be run within the 82495XP at a 
time. 


The table below shows the priorities of these three 
operations: 


Operation 
Trigger 
Priority 


Initialization 
RESET 
Highest 


Flush 
FLUSH# 


Sync 
SYNC# 
lowest 


If a trigger of higher priority occurs while a lower 
priority operation is running, the lower priority opera- 
tion is aborted and the higher priority one executed. 
If a trigger of lower priority occurs when a higher 
priority one is running, the lower priority trigger is 
ignored. Once a FlUSH# 
or SYNC# operation has 
begun, its trigger is ignored until the operation com- 
pletes. 


When a higher priority operation aborts a lower prior- 
ity one, FSIOUT# remains active. 


Since RESET, FlUSH# 
and SYNC# are all asyn- 
chronous, FSIOUT# 
will be activated when the 
82495XP is actually internally executing the opera- 
tion. 


7.21 HIGHZ# 


High Impedance Outputs 
Causes 82495XP outputs to be tristated 
Input to 82495XP (pin P4) Test Signal 
Synchronous to ClK 


7.21.1 
SIGNAL 
DESCRIPTION 


The 82495XP will enter self-test if both SlFTST # is 
active and HIGHZ# 
is inactive during reset. If 
SlFTST# 
is sampled active and HIGHZ# is sam- 
pled active during reset, the 82495XP floats all its 
outputs until the 82495XP is reset again. Activation 
of HIGHZ# without SlFTST # does nothing. 


7.21.2 
WHEN 
SAMPLED 


HIGHZ# is sampled like figure 7-1 with a setup time 
of 10 CPU clocks. HIGHZ# is then a don't care until 
the 82495XP reset sequence is complete (with FSI- 
OUT# going inactive) where it becomes the MBAlE 
pin. 
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7.21.3 
RELATION 
TO OTHER 
SIGNALS 


HIGHZ# shares a pin with MBAlE. 82495XP out- 
puts are tristated if both HIGHZ# and SlFTST # are 
sampled active during reset. 


7.22 KLOCK# 


82495XP lOCK # 
Request to MBC of lOCKed cycle 
Output from 82495XP (pin C3) Cycle Control Signal 


Synchronous to ClK 


7.22.1 
SIGNAL 
DESCRIPTION 


KlOCK # indicates to the MBC that there is a re- 
quest to execute a locked cycle. This signal follows 
the CPU lock request. 


KlOCK # is simply a one-clock flow-through version 
of the CPU lOCK # signal. The 82495XP will acti- 
vate KlOCK# 
with CADS# of the first cycle of a 
lOCKed operation and it will remain active until the 
CADS# of the last cycle of the lOCKed operation. 


Note that if the memory bus is pipelined, there may 
be a situation in which KlOCK # deactivation is in 
the same ClK as its new activation (together with 
CADS#). In this case KlOCK# 
won't go inactive 
between back-to-back locked sequences. KlOCK# 
will never go inactive if the CPU lOCK # does not go 
inactive. The 82495XP will not open arbitration win- 
dows between back-to-back locked sequences; it is 
the memory bus controller's responsibility to imple- 
ment this functionality by detecting a lOCKed write 
followed by a lOCKed read. 


KlOCK # activation is not qualified by the tag array 
look-up (hit/miss indications); therefore, KlOCK# 
can be active before CADS# is asserted. 


7.22.2 
WHEN 
DRIVEN 


KlOCK# 
assertion is a flow-through of 1 ClK from 
the CPU lOCK # after the 82495XP completes all 
pending cycles. KlOCK # 
deassertion is a flow- 
through of 1 ClK from the CPU lOCK # signal, and 
must be at least 1 ClK after the last CADS# of a 
lOCKed sequence. KlOCK # is always driven to a 
valid logic level. 


7.22.3 
RELATION 
TO OTHER 
SIGNALS 


Address and cycle specification signals (MSETO- 
MSET10, MTAGO-MTAG11, MCFAO-MCFA6, CW/ 
R#, 
CM/IO#, 
CD/C#, 
RDYSRC, 
MCACHE#, 
NENE#, SMlN#, 
KlOCK#, 
and CPlOCK#) will be 
valid with CADS#. 


7.23 
KWEND# 


Cacheability Window End 
Closes 82495XP Cacheability Window 


Input to 82495XP (pin M4) Cycle Progress Signal 
Synchronous to CLK 


7.23.1 
SIGNAL 
DESCRIPTION 


KWEND# is a cycle progress input to the 82495XP 
that, when active, closes the cacheability window 
and causes the cacheability attributes MKEN# and 
MRO# to be sampled. 


KWEND# is sampled by the 82495XP after BGT# 
has been sampled active. KWEND# should be as- 
serted by the MBC once the memory address has 
been decoded and cacheability (MKEN#) and read- 
only (MRO#) attributes have been determined. 


The sampling of KWEND# active allows SWEND# 
to be sampled. Resolving KWEND# quickly allows 
the non-cacheable window between BGT# 
and 
SWEND# to be closed more quickly. KWEND# ac- 
tivation also allows the 82495XP to start allocations 
and begin replacements. 


7.23.2 
WHEN 
SAMPLED 


KWEND# is sampled by the 82495XP on the clock, 
or after, BGT# has been sampled active. Once 
KWEND# is sampled active it is not sampled again 
until BGT# of the next cycle. KWEND# need not 
follow setup and hold times if it is not being sampled. 


BGT#, KWEND# and SWEND# may be asserted 
on the same clock edge. 


KWEND# need only be activated for those cycles 
which require the sampling of MKEN# and MRO#. 
These are line-fills and write cycles with potential 
allocation. 


7.23.3 
RELATION 
TO OTHER 
SIGNALS 


KWEND# is sampled on or after BGT# and allows 
the sampling of SWEND#. 
KWEND# 
activation 
causes the sampling of MKEN# and MRO#. 


According 
to 
cycle 
progress 
implication 
rules, 


CRDY# must be at least one clock after KWEND# 
for line fills and write-through cycles with potential 
allocate. 


KWEND# shares a pin with CFG2. 
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7.24 
MALE 


Memory Address Latch Enable 
Tristates/Enables Memory Address Outputs 


Input to 82495XP (pin 02) Cycle Control Signal 
Asynchronous 


7.24.1 
SIGNAL 
DESCRIPTION 


The 82495XP contains an address latch which con- 
trols the last stage of the 82495XP address output. It 
is controlled by four signals: MAOE#, 
MBAOE#, 


MALE, and MBALE. The signals MALE and MBALE 
control the latching of the entire 82495XP address 
where MBALE controls the subline portion and 
MALE controls the rest. 


MALE is provided so that the memory bus controller 
can control when the next pipelined address is driv- 
en. With MALE high, the 82495XP address latch is in 
'flow-through' mode and the 82495XP address is 
available at the 
memory bus. Changes in the 
82495XP address are seen immediately at the mem- 
ory bus. When MALE is driven low the address at 
the latch input is latched. Any subsequent address 
driven by the 82495XP will not be seen at the memo- 
ry bus outputs until MALE is driven high again. 


MALE will latch 82495XP.addresses regardless of 
the state of MAOE#. If MAOE# is inactive, MALE 
will still operate the latch properly, but the memory 
bus will be tristated. 


7.24.2 
WHEN 
SAMPLED 


MALE is asynchronous and can be asserted and 
deasserted at any time. MALE should always be 
driven to a valid state since it directly controls the 
operation of the address latch. 


7.24.3 
RELATION 
TO OTHER 
SIGNALS 


MALE together with MBALE control the latching of 
the entire 82495XP output address. The other latch 
control signals, MAOE# and MBAOE#, provide the 
memory bus controller complete command over the 
address outputs. MAOE# and MBAOE# do not af- 
fect the operation of MALE or MBALE. 


MALE shares a pin with the WWOR# configuration 
pin. 
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7.25 MAOE# 


Memory Address Output Enable 
Tristates/Enables Memory Address Outputs 
Input to 8249SXP (pin 84) Cycle Control Signal 


Asynchronous except during snoop cycles 


7.25.1 SIGNAL DESCRIPTION 


The 8249SXP has an address latch which is con- 
trolled by a latch input, MALE, and an output enable 
input, MAOE#. MAOE# has two main functions. 
One, driving MAOE# active will enable the 8249SXP 
to drive it's address lines MTAGO-11, MSETO-10, 
and MCFAO-6. Two, MAOE# is a qualifier for snoop 
cycles and must be inactive for the 8249SXP to 
snoop. 


In general, MAOE# should be active if its 8249SXP 
is the current bus master. When that 8249SXPgives 
up the bus, MAOE# should be inactive to float the 
address lines and allow another master to snoop. 


MAOE# controls the output of the 8249SXP ad- 
dress except the subline (burst) portion. This portion 
has a separate output control: MBAOE#. 


7.25.2 WHEN SAMPLED 


MAOE# 
is an asynchronous input (except during 
snoop cycles) and always has full control over the 
address output. For this reason, MAOE# must al- 
ways be driven to a valid state. 


The 8249SXP does, however, sample MAOE# dur- 
ing snoop cycles. When sampled, MAOE# must 
meet proper setup and hold times. In synchronous 
snoop mode MAOE# is sampled on a ClK edge. In 
clocked mode MAOE# is sampled on a SNPCLK 
edge. In strobed mode MAOE# is sampled with the 
falling edge of SNPSTB#. If MAOE# is sampled ac- 
tive, 
the 
snoop 
will 
be 
ignored. 
This 
allows 
SNPSTB# 
to share a common line for multiple 
8249SXPs. 


MAOE# need not meet any setup or hold time if it is 
not being sampled during a snoop cycle. 


7.25.3 RELATION TO OTHER SIGNALS 


MAOE# together with MBAOE# control the entire 
8249SXP address. Both signals are asynchronous 
and thus need never be synchronized to any clock. 
Both signals are, however, sampled during snoop 
cycles and require proper setup and hold times in 
these situations. 
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MALE and MAOE# together provide full control 
over the 8249SXP address output latch. 


7.26 MBALE 


Memory Burst Address Latch Enable 
Tristates/Enables Memory Burst Address Outputs 
Input to 8249SXP (pin P4) Cycle Control Signal 
Asynchronous 


7.26.1 SIGNAL DESCRIPTION 


The 8249SXPaddress latch is controlled by four sig- 
nals: MAOE#, MBAOE#, MALE, and MBALE. The 
signals MALE and MBALE control the latching of the 
entire 8249SXP address where MBALE controls the 
subline portion and MALE controls the rest. 


MALE and MBALE are provided so that the memory 
bus controller has complete flexibility when the next 
address is driven. With MBALE high, the subline por- 
tion of the, 8249SXP address latch is in "flow- 
through" mode and the 8249SXP subline address is 
available at the 
memory bus. Changes in the 
8249SXP subline address are seen immediately at 
the memory bus. When MBALE is driven low the 
subline address at the latch input is latched. Any 
subsequent subline address driven by the 8249SXP 
will not be seen at the memory bus outputs until 
MBALE is driven high again. 


MBALE will latch 8249SXP addresses regardless of 
the state of MAOE# or MBAOE#. If MBAOE# is 
inactive, MBALE will still operate the latch properly, 
but the subline portion of the memory bus will be 
tristated. 


Separate line and subline address latch controls are 
provided so that the latch outputs may be driven at 
different times. The table below indicates the subline 
address bits for each line size. 


Line Size (Bytes) 
Subllne Address 


32 
A3,A4 


64 
A4,AS 


128 
AS,A6 


7.26.2 WHEN SAMPLED 


MBALE is asynchronous and can be asserted and 
deasserted at any time. MBALE should always be 
driven to a valid state since it directly controls the 
operation of the address latch. 


7.26.3 
RELATION 
TO OTHER 
SIGNALS 


MALE together with MBALE control the latching of 
the entire 82495XP output address. The other latch 
control signals, MAOE11 and MBAOE11, provide the 
memory bus controller complete command over the 
address outputs. MAOEII and MBAOEII do not at- 
fect the operation of MALE or MBALE. 


MBALE shares a pin with the HIGHZII configuration 
pin. 


7.27 MBAOE# 


Memory Burst Address Output Enable 
Tristates/Enables Memory Subline Address Outputs 
Input to 82495XP (pin P6) Cycle Control Signal 
Asynchronous except during snoop cycles 


7.27.1 
SIGNAL 
DESCRIPTION 


The 82495XP address latch is controlled by four sig· 
nals: MAOEII, 
MBAOEII, 
MALE, and 
MBALE. 


MAOEII and MBAOEII are the output enables of 
this latch for the entire 82495XP address. Specifical· 
Iy, MBAOEII controls the subline address portion 
and MAOE11 controls the rest. 


MBAOEII has two functions. One, it can tristate the 
subline portion of the address separately from the 
rest of the address. Since the 82495XP does not 
sequence through burst addresses, the memory sys- 
tem may wish to provide the burst count. This re· 
quires that the 82495XP address burst portion be 
tristated after the first transfer. The Subline Address 
table appears in Section 7.26, MBALE. 


TWo, MBAOE11 is sampled during snoop cycles. If 
MBAOE11 is sampled inactive, the snoop write back 
cycle, if any, will begin at the subline address provid- 
ed. If MBAOEII is sampled active, the snoop write 
back will begin at subline address O. This allows 
snoop write backs to begin at the snooped subline 
address and progress through the normal burst or- 
der. 


7.27.2 
WHEN 
SAMPLED 


Like MAOE11, MBAOE11 is asynchronous except 
during snoop cycles and can be asserted or deas- 
serted at any time. Since MBAOEII has direct con- 
trol over the address latch, it must always be driven 
to a valid state. 


MBAOE# 
is ,however, sampled during snoop cy- 
cles. In synchronous snooping mode, MBAOEII 
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must meet proper setup and hold times to CLK's 
rising edge. In clocked mode, MBAOEII must meet 
setup and hold times to SNPCLK's rising edge. In 
strobed mode, MBAOEII must meet setup and hold 
times to SNPSTBII's falling edge. 


If MBAOE11 is not being sampled for a snoop, ie. 


. SNPSTBII is not asserted, MBAOEII need not meet 
any setup or hold time. 


7.27.3 
RELATION 
TO OTHER 
SIGNALS 


MAOE11 and MBAOE11 control the entire 82495XP 
address output asynchronously. This address latch 
is completely controlled by MALE, MBALE, MAOE11, 
and MBAOEII. 
. 


MBAOEII is only sampled by the 82495XP during 
snoop cycles with SNPSTBII. 


7.28 MBRDY# 


Memory Burst Ready 
Burst Ready input to 82490XP memory buffers 
Input to 82490XP (pin 22) Cycle Progress Signal 
Synchronous to MCLK 


7.28.1 
SIGNAL 
DESCRIPTION 


When in clocked memory bus mode, MBRDY11 (with 
MSELII 
active) is used to advance the memory 
burst counter for the 82490XP buffer in use. This 
causes either new data to be latched from the mem- 
ory bus (read cycle), or new data to be driven from 
the 82490XP buffer (write cycle). MBRDY11 is sam- 
pled on all MClK edges in which MSElII 
is sampled 
active and has no relation to ClK. In strobed mode, 
MBRDYII 
must be tied high as MISTB/MOSTB 
strobes data in/out of the 82490XP. 


For write cycles, the first piece of write data is avail- 
able at the MDATA pins. MBRDYII assertion with 
MSEL11 active causes the next 32, 64, or 128-bit 
slice of write data to be available. If only one slice is 
required, MSELII and MBRDYII need never go ac- 
tive. 


For read cycles, the first piece of read data flows 
through 
to 
the 
CPU. MBRDY11 
assertion 
with 
MSEL11 active causes the next slice of memory data 
to be latched in the 82490XP buffer. BRDY11 asser- 
tion will allow this data'lio be available on the CPU 
bus and latch it into the CPU. For cacheable cycles, 
MBRDY11 needs to be asserted 4 or 8 times de- 
pending on the cache configuration. 
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7.28.2 WHEN SAMPLED 


MBRDY# is sampled on all MClK 
edges where 
MSEl# 
is sampled active. In this way MSEl# 
quali- 
fies the MBRDY# input. If MSEl# 
is sampled mac- 
tive, MBRDY# need not follow setup and hold times 
to MClK. 


7.28.3 RELATION TO OTHER SIGNALS 


MBRDY# 
is 
qualified 
by 
the 
MSEl# 
input. 


MBRDY# advances the memory burst counter for 
the 82490XP in use which either inputs or outputs 
data through MDATA. 


MEOC# switches the 82490XP buffers to the next 
pending cycle, so the last MBRDY# must come be- 
fore or on the,clock of MEOC# assertion. 


7.29 
MCACHE# 


82495XP Internal Cacheability 
Indicates cycle cacheability attribute 
Output from 82495XP (pin C2) Cycle Control Signal 
Synchronous to ClK 


7.29.1 SIGNAL DESCRIPTION 


MCACHE# is driven by the 82495XP and indicates 
that the current cycle may be cached. Data caches- 
bility is determined later in the cycle by MKEN# as- 
sertion. MCACHE# is asserted for allocation, re· 
placement write-back cycles, and during cacheable 
read-miss cycles. (ie. read-miss cycles in which PCD 
is not asserted). It is not asserted for 10, special, or 
locked cycles, 


Cycle Type 
MCACHE# 


Posted Writes 
1 


Write Backs 
0 


Read,pCD = 0 
0 


Read, PCD = 1 
1 


Allocation 
0 


1/0 Cycles 
1 


locked Cycles 
1 


7.29.2 WHEN DRIVEN 


MCACHE# is valid in the same ClK as CADS# and 
remains valid until CRDY# or CNA#. 
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7.29.3 RELATION TO OTHER SIGNALS 


Address and cycle specification signals (MSETO- 
MSET10, 
MTAGO-MTAG11, 
MCFAO-MCFA6, 


CW/R#, 
CM/IO#, 
CD/C#, 
RDYSRC, MCACHE#, 


NENE#, SMlN#, 
KlOCK#, 
and CPlOCK#) will be 
valid with CADS#. 


7.30 
MCFAo-MCFA6 
MSETo-MSET10 
MTAGo-MTAG11 


MCFAO-MCFA6 Memory Configuration Address 1/0 


MSETO-MSET10 Memory Set Address 1/0 
MTAGO-MTAG11 Memory Tag Address 1/0 
82495XP Memory Address Inputs/Outputs 
Input/Output of 82495XP (pins N14, P7-P15, 06- 
016, R4, R14-R17, S14-S17) 
Cycle Control Sig- 
nals 
Input Synchronous to ClK, SNPClK, or SNPSTB#. 
Output from ClK, MAOE# active or MALE high. 


7.30.1 SIGNAL DESCRIPTION 


MSETO-10, MTAGO-11, and MCFAO-6 provide the 
complete 30 bit address input/ output interface of 
the 82495XP to the memory bus. Together they 
span the entire CPU address range A2-A31. 
De- 
pending on the cache configuration, each pin repre- 
sents a different CPU address line (see configura- 
tion section for details). 


MSETO-10, 
MTAGO-11, 
and 
MCFAO-6 
pass 
through a 82495XP output latch. The latching of this 
latch is controlled by MAlE/MBAlE, 
and the output 
of this latch is controlled by MAOE#!MBAOE#. 


With 
MAOE#/MBAOE# 
active, 
MSET/MTAGI 
MCFA are 82495XP outputs. They are valid at the 
start of a memory bus cycle at the input of the 
82495XP address latch. If MAlE/MBAlE 
is high 
(flow-through) and 
MAOE#!MBAOE# 
is active 
(outputs enabled), they are driven to the memory 
bus with CADS# . 


If a new cycle starts and MAlE/MBAlE 
is low, the 
previous address remains valid at the 82495XP 
MSET/MTAG/MCFA outputs. Once MAlE/MBAlE 
goes high, the new address flows through with the 
appropriate 
propagation 
delay 
(MSET/MTAGI 
MCFA address valid delay from MAlE/MBAlE 
go- 
ing high). The new address will be driven to the 
82495XP MSET/MTAG/MCFA outputs if MAOE#! 
MBAOE# is active. 


If a new cycle starts, MAlE/MBAlE 
is high, and 
MAOE# IMBAOE # is inactive, the 82495XP MSETI 
MTAG/MCFA outputs will remain tristated. Once 
MAOE#!MBAOE# 
is asserted, the new address 
flows through with the appropriate propagation delay 
(MSET/MTAG/MCFA address valid from MAOE#! 
MBAOE# going active). 


MSETO-10, MTAGO-11, and MCFAO-6 are used 
as inputs to the 82495XP during snoop cycles. Here, 
MAOE#/MBAOE# 
is 
inactive. 
MSET/MTAGI 
MCFA are sampled by the 82495XP during snoop 
initiation just like the other snoop attributes. 


7.30.2 
WHEN 
SAMPLED 


If MAlE/MBAlE 
is high and MAOE# IMBAOE # is 
low, MSETO-10, MTAGO-11, and MCFAO-6 are 
valid with CADS# with a timing reference to ClK. 
Otherwise, 'they are asserted with a delay from 
MAlE/MBAlE 
high or MAOE#/MBAOE# 
active. 


MSETO-10, 
MTAGO-11, and MCFAO-6 change 
once CNA# or CRDY# is sampled active. MSETO- 
10, MTAGO-11, and MCFAO-6 have a float delay 
from MAOE#/MBAOE# 
going inactive. These out- 
puts are undefined after CRDY# ICNA # assertion 
and before the next CADS# assertion. 


As inputs during snoop cycles (SNPSTB# asserted), 
they must be sampled like other snoop attributes 
with proper setup and hold times. In synchronous 
snoop mode this is with respect to ClK; in clocked 
mode, this is with respect to SNPClK; 
and in 
strobed mode this is with respect to SNPSTB# fail- 
ing edge. 


If MAOE# is inactive and SNPSTB# is not asserted 
(no snoop), MSETO-10, MTAGO-11, and MCFAO- 
6 need not meet any setup or hold time. 


7.30.3 
RELATION 
TO OTHER 
SIGNALS 


MSETO-10, MTAGO-11, and MCFAO~6are assert- 
ed with CADS# so they are valid when CADS# is 
sampled active. This is true as long as MAlE/MBA- 
lE 
is high and MAOE#/MBAOE# 
is active. If 
MSETO-10, MTAGO-11, and MCFAO-6 have been 
asserted but are blocked 
by MAlE/MBAlE 
or 
MAOE#!MBAOE#, 
they are asserted from MAlEI 
MBAlE going high or MAOE#/MBAOE# 
going ac- 
tive. 


MSETO-10, MTAGO-11, and MCFAO-6 are deas- 
serted or changed with CADS# or CNA# active. 
They may also be floated with MAOE# going inac- 
tive. 
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MSETO-10, MTAGO-11, and MCFAO-6 are used 
as inputs during snoop cycles. They are sampled 
with SNPSTB# like any other snoop attribute signal. 


7.31 
MCLK 


Memory Bus Clock 
Input to the 82490XP (Pin 26) 


7.31.1 
SIGNAL 
DESCRIPTION 


In a clocked memory bus mode, this pin provides the 
memory bus clock. Memory bus signals and memory 
bus data are sampled on the rising edge of MClK. 
Memory bus write data is driven off 
MClK 
or 
MOClK depending upon the configuration. MClK 
has no relation to ClK. 


7.31.3 
RELATION 
TO OTHER 
SIGNALS 


MClK shares a pin with MISTB. 


In clocked memory bus mode, the MDATA7-MDA- 
TAO, MSEl#, 
MFRZ#, 
MBRDY#, 
MZBT#, 
and 
MEOC# pins are sampled synchronously with the 
rising edge of MClK. In a clocked memory bus write, 
MDATA7-MDATAO are driven synchronous with 
MClK or MOClK. 


MOClK is a delayed version of MClK. If a clocked 
memory bus configuration 
is 
chosen, 
and 
the 
MOClK rising edge is detected by the 82490XP af- 
ter RESET, data will be driven off of MOClK rather 
then MClK. 
Only data is effected 
by MOClK. 


MOClK is used to allow the system designer to in- 
crease the minimum output time of MDATA relative 
to MClK. 


7.32 
MDATAO-MDATA7 


Memory Bus Data Pins 
82490XP Connection to the Memory Bus 
Input/Output of 82490XP (pins 18, 14, 10, 6, 16, 12, 
8,4) 
Synchronous to ClK or MClK or MOClK or MISTB 
or MOSTB. 


7.32.1 
SIGNAL 
DESCRIPTION 


MDATAO-7 is the 82490XP data bus connection to 
the memory bus. All or part of these pins will be used 
depending on the cache configuration. These pins 
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Ing MBRDY# or MISTS. 


7.32.2 
WHEN 
DRIVEN 


When the CPU or 82495XP initiates a write cycle, 
the write data is written to the appropriate 82490XP 
buffer and CDTS# is asserted. If MDOE# is active, 
that first piece of write data will be available at the 
MOATA pins with some delay from the CPU ClK 
edge that COTS# is asserted. Subsequent pieces of 
write data are output with some delay from MClK or 
MOClK 
(mode dependent) from the edge that 
MSRDY# is sampled active. In strobed mode, sub- 
sequent data is output with MOSTS assertion. 


MDATA has no value before CDTS# assertion, after 
MEOC# with no pending cycle, or with MDOE# in- 
active. 


For read cycles, the 82495XP asserts CDTS# the 
clock before the MDATA path is available for read 
data. MDOE# must be inactive for the 82490XP to 
read data. Read data is strobed into the 82490XP by 
asserting MSRDY# on MClK edges. MEOC# will 
latch the last piece data as it switches buffers. In 
strobed mode, data is read by MISTS. Data that is 
read into MDATA must meet proper setup and hold 
times. 


Data at the MOATA inputs need not follow setup and 
hold times to MClK edges that sample MSRDY# 
inactive. 


7.32.3 
RELATION 
TO OTHER 
SIGNALS 


CDTS# indicates that write data is in the 82490XP 
buffers. If MDOE# is active, write data is available at 
MDATA some time after CDTS# or MEOC# is sam- 
pled active. Subsequent write data is available at 
MDATA after MSRDY# assertion or MOSTS chang- 
ing. 


2-338 


Memory Data Output Enable 
Tristates/Enables Memory Data Outputs 
Input to 82490XP (pin 20) Cycle Control Signal 


Asynchronous 


7.33.1 
SIGNAL 
DESCRIPTION 


MOOE# is an input to the 82490XP that, when as- 
serted, causes the 82490XP to drive its MDATAO- 
MDATA7 outputs. When MOOE# is inactive, these 
lines are floated and may be used as inputs to the 
82490XP. MDOE# is not sampled by any clock and 
is a direct connection to the 82490XP memory ouput 
driver. 


7.33.2 
WHEN 
SAMPLED 


Since 
MDOE# 
is 
a 
direct 
connection 
to 
the 
82490XP memory output drivers, MDOE# must al- 
ways be driven to a valid level. With MDOE# inac- 
tive, data in the 82490XP's may be driven to MDATA 
outputs with some propagation delay from MDOE# 
going active. Similarly, there is some float delay from 
MDOE# going inactive. 


MOOE# must be inactive for the 82490XP to read 
memory data. 


7.33.3 
RELATION 
TO OTHER 
SIGNALS 


MDOE# 
hat' no relation to MClK, 
MOClK, 
or 
MOSTS. Since MDOE# controls the final stage of 
the MDATA output buffers, it has no effect on any 
other signal of the 82490XP. 


7.34 
MEMLDRV 


Memory low Capacitance Drivers 
Selects 
the 
low 
Capacitance 
Drivers 
for 
the 
82495XP and the 82490XP 


Inputs t'o 82495XP and 82490XP (pins Q4, 24) Con- 
figuration Signal 
Synchronous to CLK 


7.34.1 
SIGNAL 
DESCRIPTION 


MEMLDRV is a pin on both the 82495XP and 
82490XP that, when high during reset, select normal 
driving memory output buffers. If this pin is driven 
low at reset, the high capacitance drivers are select- 
ed. Specifically, these are the 82495XP address out- 
puts to the memory bus, and the 82490XP MDATA 
outputs. The normal output drivers are designed to 
drive up to 50 pF loads. The high capacitance driv- 
ers can drive up to 100 pF without derating. 


7.34.2 
WHEN 
SAMPLED 


MEMLDRV is sampled like figure 7-1 with a setup 
time of 4 CPU clocks for the 82495XP and 1 CPU 
clock for the 82490XP. On the 82495XP, MEMLDRV 
becomes the SYNC# input once FSIOUT# goes 
inactive. On the 82490XP, MEMLDRV becomes the 
MFRZ# signal which is sampled after the first mem- 
ory cycle begins. 


7.34.3 
RELATION 
TO OTHER 
SIGNALS 


MEMLDRV shares a pin with 
SYNC# 
on the 
82495XP and MFRZ# on the 82490XP. 


7.35 MEOC# 


Memory End of Cycle 
Ends a cycle in 82490XP by switching buffers 
Input to 82490XP (pin 23) Cycle Control Signal 
Synchronous to MCLK or Asynchronous (strobed 
mode) 


7.35.1 
SIGNAL 
DESCRIPTIONS 


MEOC# is an input to the 82490XP that ends the 
current cycle and switches memory buffers for new 
cycle. Switching to the next cycle does not cause 
information to be lost in the memory or CPU buffers 
in the 82490XP, but rather switches new buffers to 
the memory I/O bus of the 82490XP. 


MEOC# is provided so that the memory system, 
which is synchronous to MCLK, can switch to a new 
cycle without synchronization. In clocked memory 
bus mode MEOC# is sampled with the rising edge 
of MCLK. In strobed memory bus mode the MEOC# 
function is performed with rising or falling edges of 
MEOC#. 
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For read or write cycles, MEOC# may be activated 
on or after the clock edge of the last MBRDY# of 
the current cycle. If a cycle is pending (pipelining is 
used), the next cycle will flow-through with a propa- 
gation delay from MEOC# assertion. MEOC# is re- 
quired for all memory bus cycles. 


In addition to switching memory buffers, MEOC# 
does three other things. One, MEOC# activation 
causes the memory burst counter to be reset to its 
start value and if MSEL# is active, MZBT# is sam- 
pled. This allows MSEL# to stay active between cy- 
cles. Two, MEOC# activation during a write cycle 
causes MFRZ# to be sampled for the a subsequent 
allocation (line-fill). Three, MEOC# latches in the 
last slice of data (like MBRDY#) before switching 
buffers. 


7.35.2 
WHEN 
SAMPLED 


In clocked memory bus mode, MEOC# is sampled 
on every MCLK edge. It must always observe setup 
and hold times to MCLK. In strobed memory bus 
mode, MEOC# is always sampled and must meet 
proper active/inactive times. 


7.35.3 
RELATION 
TO OTHER 
SIGNALS 


MEOC# is provided so that a cycle may end on the 
memory bus before CRDY# can be asserted. The 
implication rules surrounding MEOC# are: 
1. MEOC# 
:<:; CRDY# 


2. MEOC# for cycle N+ 1 ~ 2 clocks after CRDY# 
of cycle N 
3. MEOC# for cycle N+ 1 ~ 2 clocks after last 
BRDY# of cycle N 
4. MEOC# ~ BGT# 


MEOC# active with MSEL# active causes the sam- 
pling of MZBT# and MFRZ#. 


7.36 MFRZ# 


Memory Data Freeze 
Freezes Memory Write Data in 82490XP Buffer 
Input to 82490XP (pin 24) Cycle Control Signal 
Synchronous to MCLK or Strobed 


7.36.1 
SIGNAL 
DESCRIPTION 


MFRZ# is an input to the 82490XP that when active 
causes the 82490XP to "freeze" write data in the 
82490XP memory buffer and allow a subsequent al- 
location to fill a cache line around it. MFRZ# is pro- 
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vided so that an actual write to memory need not be 
done to perform an allocation. Using MFRZ 11 to per- 
form this dummy write cycle requires that the memo- 
ry bus controller put the allocated line into the "M" 
state. 


PAllCII 
must be active and MKENII must be re- 
turned active for the write cycle to be turned into an 
allocation. MFRZII is sampled when MEOCII goes 
active at the end of the write cycle. The subsequent 
line fill is then filled around the write data to com- 
plete the allocation. 


7.36.2 
WHEN 
SAMPLED 


In clocked memory bus mode, MFRZ 11 is sampled 
with the MClK rising edge that MEOCII is sampled 
active for all CPU write cycles. MFRZ 11 need only 
follow a proper setup and hold time in this situation. 


" 
In strobed mode, MFRZII is sampled with the falling 
edge of MEOCII for write cycles. MFRZ 11 need only 
follow a proper setup and hold time in this situation. 


7.36.3 
RELATION 
TO OTHER 
SIGNALS 


MFRZII is sampled with the MEOCII going active or 
being active for write cycles. MFRZ 11 is used so that 
a dummy write cycle can be performed. If an alloca- 
tion is done, DRCTM11 must be asserted during the 
SWENDII window of the line fill to put the allocated 
line in the "M" state. 


MFRZII shares a pin with the MEMlDRV configura- 
tion input. 


7.37 MHITM# 


Memory Sus Hit [M] 
Indicates snoop hit to modified line 
Output from 82495XP (pin H4) Snooping Signal 
Sync to ClK 


7.37.1 
SIGNAL 
DESCRIPTION 


The MHITM11 output is driven by the 82495XP dur- 
ing a snoop cycle to indicate that the snooping ad- 
dress has hit a Modified line. If the signal is logic 
high, the snoop has not hit a modified line; if the 
signal is logic low, the snoop has hit a modified line. 
When a snoop hits a modified line, the 82495XP au- 
tomatically schedules a write-back of the hit modi- 
fied line to the memory bus. 
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When the device which controls the memory bus 
(the master) performs a memory access, a snoop is 
requested of all other caching devices on the bus 
(snoopers). An asserted MHITMII pin from any of 
the snooper 82495XPs alerts the master that main 
memory's data is stale, and that the bus must be 
temporarily given to the snooper which has its 
MHITM11 asserted so that the modified line can be 
written out to the memory bus. 


7.37.2 
WHEN 
DRIVEN 


The snoop lookup is performed in the clock in which 
SNPCYCII is asserted. The MHITMII result for the 
snoop is driven on the ClK following SNPCYC11 , 
and 
remains valid 
until 
the 
next 
assertion 
of 
SNPSTSII. The MHITMII signal is not valid from 
SNPSTSII until the ClK after SNPCYCII. 


7.37.3 
RELATION 
TO OTHER 
SIGNALS 


MHITMII and MTHITII outputs together indicate the 
results of a snoop lookup in the 82495XP. 


A 82495XP can accept a snoop request while per- 
forming memory bus transfers of its own. If a snoop 
is requested of a 82495XP while it is performing a 
data transfer of its own, the results of the snoop may 
be delayed. If SNPSTS11 is sampled at a 82495XP 
after it has received SGT11 for its own cycle, the 
snoop lookup is performed (SNPCYCII active) after 
the SWENDII of its own cycle, and MHITMII is driv- 
en with valid results one ClK after SNPCYCII (see 
Sections 6.2.4 and 6.2.5). 


7.38 MISTB 


Memory Sus Input Strobe 
Strobes data into the 82490XP 
Input to 82490XP (pin 22) Cycle Control Signal 
Asynchronous 


7.38.1 
SIGNAL 
DESCRIPTION 


MISTS is an input to the 82490XP that, on rising or 
falling edges, causes the 82490XP to latch its MDA- 
TA inputs. MISTS is used in strobed memory bus 
mode. In clocked memory bus mode, MISTS is the 
MSRDY11 input. 


7.38.2 
WHEN 
SAMPLED 


MISTS is always sampled by the 82490XP. MISTS 
must meet proper strobed mode active and inactive 
times. 


7.38.3 
RELATION 
TO OTHER 
SIGNALS 


MISTS causes the latching of the 82490XP MDATA 
inputs in strobed mode. MISTS shares a pin with 
MSRDY#. 


7.39 
MKEN# 


Memory Cache Enable 
Determines 82495XP and CPU cacheability 
Input to 82495XP (pin R1) Cycle Attribute Signal 


Synchronous to CLK 


7.39.1 
SIGNAL 
DESCRIPTION 


MKEN# is an input to the 82495XP that is sampled 
at the closing of the cacheability window (KWEND# 
is sampled active). The 82495XP drives KEN# back 
to the CPU one clock after sampling the value of 
MKEN#. MKEN# thus determines whether the cur- 
rent cycle is cacheable in the 82495XP and in the 
CPU. 


For read cycles, if MCACHE# is active (cacheable), 
KEN# is driven out of the 82495XP to the CPU to 
indicate cacheability. If MKEN# is sampled inactive 
during KWEND# activation, KEN# is brought inac- 
tive by the 82495XP, and the line will not be cache- 
able by the CPU or 82495XP. If MCACHE# is inac- 
tive, the line will be non-cacheable regardless of 
MKEN#. PCD active will cause MCACHE# to be 
inactive. 


MKEN# is sampled during write-through cycles that 
are potentially allocatable (PALLC# is active during 
the write cycle). If MKEN# is sampled active during 
KWEND# activation of the write cycle, an allocation 
will occur, and a line-fill will follow the write cycle. 
MKEN# during the line-fill is ignored. The MSC indi- 
cates to the 82495XP that it intends to perform an 
allocation by asserting MKEN#. 


MKEN# must be sampled 1 clock before the first 
SRDY# assertion to make a line-fill non-cacheable 
to the CPU. 


7.39.2 
WHEN 
SAMPLED 


MKEN# 
is 
sampled 
on 
the 
clock 
edge 
that 
KWEND# is first sampled active. In all other places 
MKEN# may violate setup and hold times. 
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7.39.3 
RELATION 
TO OTHER 
SIGNALS 


MKEN# and MRO# are sampled with KWEND# 
active. MKEN# must be sampled at least 2 clocks 
before SRDY# assertion to make a line-fill non- 
cacheable. 


7.40 
MOCLK 


Memory Data Output Clock 


Separate Clock Reference for Memory Data Output 
Input to 82490XP (pin 27) 
Asynchronous 


7.40.1 
SIGNAL 
DESCRIPTION 


MOCLK is the latch enable for the 82490XP memory 
data outputs (MDATA). MOCLK controls the latching 
of a transparent latch which, when high, causes 
MDATA to be driven from MCLK. When low, MDATA 
is latched. MOCLK may only be used in clocked 
memory bus mode and only affects output data. It is 
provided so that a greater MDATA output hold time 
can be generated. 


To be used effectively, MOCLK must be a clock in- 
put that is skewed from MCLK. The following picture 
shows how MOCLK has increased the hold time of 
the output burst data: 


WCLK 
~ 
I 
I 


WOCLK \:/\:/\ 
'-+-' 
'-+-' 
, 


WDATA 
i':~( f x:: 
~~~:------~------ 


WBRDY* 


240956-32 


7.40.2 
WHEN 
SAMPLED 


MOCLK is sampled during and after RESET to de- 
termine whether output data should be driven from 
MCLK or MOCLK. If toggling, MOCLK controls the 
MDATA outputs with MCLK. If high, data is driven 
from MCLK alone. Regardless, input data is never 
referenced to MOCLK. 


In strobed memory bus mode the MOCLK signal be- 
comes MOSTS. MOCLK is only used in clocked 
memory bus mode. 
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7.40.3 
RELATION 
TO OTHER 
SIGNALS 


To be used effectively, MOCLK must be the same 
frequency as MCLK but be skewed. This effectively 
increases MDATA hold time to main memory. Main 
memory must sample the data on MCLK edges. 


MOCLK shares a pin with the MOSTS signal. 


7.41 MOSTB 


Memory Sus Output Strobe 


Strobes data out of 82490XP 
Input to 82490XP (pin 27) Cycle Control Signal 
Asynchronous 


7.41.1 
SIGNAL 
DESCRIPTION 


MOSTS is an input to the 82490XP that, on rising 
and falling edges, causes the 82490XP to output 
data through its MDATA outputs. MOSTS is only 
used in strobed memory bus mode. In clocked mem- 
ory bus mode, MOSTS is the MOCLK input. 


7.41.2 
WHEN 
SAMPLED 


MOSTS is always sampled by the 82490XP. MOSTS 
must meet strobed mode active and inactive times. 


7.41.3 
RELATION 
TO OTHER 
SIGNALS 


MOSTS strobes data out of the 82490XP through 
MDATA. MOSTS shares a pin with MOCLK. 


7.42 MRO# 


Memory Read-Only 
Designates current line as read-only 
Input to 82495XP (pin J1) Cycle Attribute Signal 
Synchronous to CLK 


7.42.1 
SIGNAL 
DESCRIPTION 


MRO# is an input to the 82495XP that is sampled at 
the closing of the cacheability window (KWEND# 
activation). If sampled active, it causes the current 
line fill to the 82495XP to be put in the read-only 
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state, and causes the line to be non-cacheable to 
the CPU. Writes to read-only lines in the 82495XP 
are treated as write-misses that are non-allocatable 
(PALLC# 
is inactive). MRO# 
is a bit in each 
82495XP tag entry. 


Once MRO# is sampled active during KWEND# ac- 
tivation, KEN# to the CPU is driven inactive regard- 
less of the state of MKEN#. MKEN# does, howev- 
er, determine whether the 82495XP will cache the 
read-only line. Once MRO# is returned active, the 
CPUwill only require the number of transfers as indi- 
cated by LEN and CACHE#. If MKEN# is returned 
active, the 82495XP will require an entire cache line. 
82495XP read-only cache lines are filled to the [SI 
state. 


The line-fill portion of an allocation may be filled to 
the read-only state by returning MRO# active during 
KWEND# of the line-fill. MRO# is ignored during 
the write portion. 


If MRO# 
is returned 
active 
during 
KWEND#, 


DRCTM# 
and 
MWS/WT # 
are 
ignored 
during 
SWEND#. 


MRO# must be returned to the 82495XP at least 2 
clocks before SRDY# is returned to the CPU so 
KEN# can be sampled properly. 


There is one Read-Only bit per tag in the 82495XP. 


7.42.2 
WHEN 
SAMPLED 


MRO# is sampled on the first clock that KWEND# 
is sampled active. In all other clocks, MRO# need 
not follow setup and hold times. 


7.42.3 
RELATION 
TO OTHER 
SIGNALS 


MRO# and MKEN# are sampled with KWEND# 
activation. MRO# must be returned at least 2 clocks 
prior to the first SRDY#. 


7.43 MSEL# 


Memory Suffer Chip Select 
Selects 82490XP, Causes Sampling of MZST# 
Input to 82490XP (pin 25) Cycle Control Signal 
Synchronous to MCLK or Strobed 


intel~ 


7.43.1 
SIGNAL 
DESCRIPTION 


MSEl# 
is an input to the 82490XP that has 3 main 
functions. 
One, 
MSEl# 
active 
qualifies 
the 
MSRDY# input to the 82490XP. If MSEl# 
is inac- 
tive for a particular 82490XP, MSRDY# will not be 
recognized by that 82490XP. 


Two, MSEl# 
going active causes the sampling of 
MZST# for the next transfer. 


Three, MSEl # going inactive resets the 82490XP 
internal memory burst counter. The 82490XP con- 
tains a memory burst counter that counts through 
the CPU burst order with each MSRDY# assertion 
and increments a pointer to the 82490XP memory 
buffer being accessed. 


MSEl # going inactive will reset this burst counter to 
its original burst value. Sy resetting this counter be- 
fore MEOC# assertion, all information currently be- 
ing read into the 82490XP is lost, but information 
that is being written out is maintained and may be 
rewritten. 


In general, MSEl# 
may stay inactive for single 
transfer cycles such as posted 64-bit write cycles. 
Once active, MSEl# 
need not go inactive as the 
burst counter is reset with MEOC# activation. Since 
MZST# 
may also be' sampled with MEOC#, it is 
possible to leave MSEl# 
asserted throughout most 
basic transfers. 


MSEl# 
or MEOC# must be used to reset the burst 
counter before any transfer begins. If transfers are 
interrupted (by a snoop hit before SGT# assertion 
for example), MSEl# 
must be brought inactive so 
the burst counter may be reset for the snoop write 
back. 


MSEl # must be sampled inactive for at least 1 
MClK 
after reset. This resets the memory burst" 


counter for the first transfer. 


7.43.2 
WHEN 
SAMPLED 


In clocked memory bus mode, MSEl# 
is sampled 
with all rising edges of MClK. 
In this mode, if 
MSEl # 
is sampled inactive, the memory burst 
counter is reset and MZST# is sampled. If MSEl # 
is sampled active and MSRDY# is sampled active, 
the memory burst counter is incremented. Since it is 
constantly sampled with MClK, 
MSEl # must al- 
ways be driven to a known state and must always 
meet setup and hold times to every MClK edge. 


82495XP 
Cache Controller/82490XP 
Cache RAM 


In strobed mode, MSEl# 
falling edge causes the 
sampling of MZST#. While MSEl# 
is active, MISTS 
and MOSTS cause the memory burst counter to be 
incremented. The rising edge of MSEl# 
causes the 
memory burst counter to be reset. 


MSEl# 
must be inactive sometime after RESET be- 
fore the first transfer to initialize the burst counter. 


7.43.3 
RELATION 
TO OTHER 
SIGNALS 


MSEl# 
causes the sampling of MZST#, and quali- 
fies the use of MSRDY#, 
MOSTS, and MISTS. 
Since MSEl # acts as a qualifier for these signals, 
MSEl # 
may be asserted at the same time as 
MSRDY#, MOSTS, or MISTS. 


7.44 
MTHIT# 


Memory Sus Tag Hit 
Indicates snoop hit 
Output from 82495XP (pin G3) Snooping Signal 
Sync to ClK 


7.44.1 
SIGNAL 
DESCRIPTION 


The MTHIT# output is asserted by the 82495XP 
during snoop cycles to indicate that the snoop ad- 
dress has hit a line in the 82495XP cache. An as- 
serted MTHIT# signal from any of the snooping 
82495XP's alerts a bus master that the data being 
accessed resides in another cache. If SNPINV was 
not asserted on the snoop request, the copy of the 
data in a 82495XP asserting MTHIT# will remain 
valid and in the Shared state-so 
a caching master 
must also place his copy of the data in the Shared 
state. 


7.44.2 
WHEN 
DRIVEN 


The snoop lookup is performed in the ClK in which 
SNPCYC# is asserted. The MTHIT# result for the 
snoop is driven on the next ClK and remains valid 
until the next assertion of SNPSTS#. The MTHIT# 
signal is not valid from SNPSTS# until the ClK after 
SNPCYC#. 


7.44.3 
RELATION 
TO OTHER 
SIGNALS 


MTHIT# and MHITM# together indicate the results 
of a snoop lookup in the 82495XP. 
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An 82495XP can accept a snoop request while per- 
forming memory bus transfers of its own. If a snoop 
is requested while it is performing a transfer of its 
own, the results of the snoop may be delayed. If 
SNPSTB# is sampled at a 82495XP after it has re- 
ceived BGT# for its own cycle, the snoop lookup is 
performed (SNPCYC# active) after the SWEND# of 
its own cycle, and MTHIT# is driven with the valid 
result one ClK after SNPCYC# (see Sections 6.2.4 
and 6.2.5). 


Because an asserted MTHIT# from any snooping 
82495XP requires the master to place the fetched 
line in the Shared state (unless it is an invalidating 
snoop), the memory bus controller should include 
the MTHIT# signals of other processors when gen- 
erating the MWB/WT# 
signal to its own 82495XP. 


7.45 MWB/WT# 


Memory Write-back/Write-through 
Forces lines to be filled to the [S] state 
Input to 82495XP (pin K3) Cycle Attribute Signal 
Synchronous to ClK 


7.45.1 SIGNAL DESCRIPTION 


MWB/WT # is an input to the 82495XP that is sam- 
pled at the closing of the snoop window (SWEND# 
activation). If sampled active, the current line-fill is 
filled to the [S] state in the 82495XP. The [S] state 
is a write-through state in the 82495XP. 


MWB/WT # is used in many cases. If a cache to 
cache transfer updates memory and leaves the data 
valid in the other cache, the line must be filled to the 
[S] state instead of the [E] state default. A portion of 
memory may be designated as write-through by as- 
serting MWB/WT # for appropriate addresses. 


MWB/WT # 
has no effect 
on the 
82495XP if 
DRCTM# is sampled active or MRO# 
has been 
sampled active during KWEND#. If PWT is active, 
MWB/WT # has no effect and the line is filled to the 
[S] state. 


7.45.2 WHEN SAMPLED 


MWB/WT # is sampled on the first clock edge that 
SWEND# is sampled active. If MWB/WT# 
is not 
being sampled, it need not follow setup and hold 
times. 
' 


7.45.3 RELATION TO OTHER SIGNALS 


Both MWB/WT# 
and DRCTM# are sampled with 
SWEND#. 
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7.46 MX4/MX8# 
MTR4/MTR8# 


Memory 4/8 I/O bits 
Memory 4/8 Transfers 
Selects MDATA Input/Output width and number of 
memory bus transfers 
Inputs to 82490XP (pins 21, 25) Configuration Sig- 
nals 
Synchronous to ClK 


7.46.1 SIGNAL DESCRIPTION 


MX4/MX8# 
configures the 82490XP to use MDA- 


TA[O:3] or MDATA[O:7] memory bus 
1/0 
pins. 


MTR4/MTR8# selects whether the a cache line will 
take 4 or 8 transfers. These selections depend on 
the line ratio (82495XP line size I CPU line size) and 
must be configured according to the following table: 


Line 
MX41 
MTR41 
Membus 
CPUbus 
Ratio 
MX8# 
MTR8 # 
1/0 Pins 
1/0 Pins 


1 
1 
1 
4 
4 


2 
1 
0 
4 
4 


2 
0 
1 
8 
4 


4 
0 
0 
8 
4 


1 
0 
1 
8 
8 


2 
0 
0 
8 
8 


7.46.2 WHEN SAMPLED 


These signals are sampled like Figure 7-1 with a set- 
up time of 1 clock. Once the first CADS# is issued 
by the 82495XP these signals are sampled for the 
MZBT# and MSEl # functions. 


7.46.3 RELATION TO OTHER SIGNALS 


MX4/MX8 # shares a pin with MZBT# and MTR41 
MTR8# shares a pin with MSEl # . 


7.47 MZBT# 


Memory Zero Base Transfer 
Forces cycles to begin at subline address 0 
Input to 82490XP (pin 21) Cycle Control Signal 
Synchronous to MClK or Strobed 


intel~ 


7.47.1 SIGNAL DESCRIPTION 


MZST# is an input to the 82490XP that forces a 
read or write cycle to begin with burst address 0 
regardless of the CPU generated address. 


MZST# 
is sampled before the transfer begins. 
MZST# 
is sampled with MSEL# 
and MEOC#. 


MZST# is sampled with MSEL# going active for the 
current cycle. If MSEL# stays active between cy- 
cles, MZST# is sampled with MEOC# going active 
for the previous cycle. 


Once sampled, data input to the 82490XP's will start 
at burst address 0 and continue through 4, 8, C, etc. 
If the CPU is requesting a burst location other than 
0, the memory bus controller must hold off any 
SRDY# until that bursted item is read from the 
memory bus. 


7.47.2 WHEN SAMPLED 


In clocked mode, MZST# is sampled in two loca- 
tions. First, MZST# is sampled on all MCLK rising 
edges where MSEL# 
is sampled inactive. Once 
MSEL# is sampled active, the value of MZST# that 
was sampled one MCLK before is used for the next 
transfer. 


Second, MZST# is sampled on MCLK rising edges 
where MEOC# is sampled active with MSEL# ac- 
tive. The MZST# value sampled will be used for the 
next transfer. This allows MSEL# to stay asserted 
between transfers if so desired. 


In strobed mode, MZST# is sampled with the same 
two signals. First, it is sampled with the falling edge 
of MSEL#. Second, it is sampled with the falling 
edge of MEOC# if MSEL# is active. 


In clocked memory bus mode MZST# must follow 
setup and hold times to all MCLK edges where 
MSEL# is sampled inactive or MEOC# is sampled 
active with MSEL# active. 


In strobed memory bus mode MZST# must meet 
setup and hold times to MSEL# falling edge and 
MEOC# falling edge if MSEL# is active. 
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7.47.3 RELATION TO OTHER SIGNALS 


MZST# is sampled with MSEL# and MEOC# and 
has no effect otherwise. In systems that will never 
force a zero-based transfer, MZST# may be driven 
high after RESET. 


MZST# shares a pin with the MX4/MX8# 
confiqu- 
ration input. 


7.48 
NCPFLD 11 


Non-Cacheable PFLD 
Enables Non-Cacheable Floating Point Loads 
Input to 82495XP (N4) Configuration Signal 
Asychronous 


7.48.1 SIGNAL DESCRIPTION 


During RESET, this pin functions as the NCPLFD# 
configuration signal. The 82495XP can be config- 
ured to decode i860 XP CPU PFLD (Pipelined Float- 
ing Point Load) cycles. The 82495XP supports 3 op- 
erational modes for PFLD cycle decoding as defined 
by FPFLDEN and NCPFLD#: 
Mode # 1. PFLD cycles that are cached in the 
82495XP. 


Mode #2. PFLD cycles not cached in the 82495XP, 
without 
an 
external 
PFLD 
extension 
FIFO. 
Mode #3. PFLD cycles not cached in the 82495XP, 
with an external PFLD extension FIFO. 


Mode # 
FPFLDEN 
NCPFLD# 


1 
0 
1 


2 
0 
0 


3 
1 
1 


Illegal Mode 
1 
0 


See Section 5.2.5 for details. 
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7.49 NENE# 


Next Near 
Indicates current cycle address is near previous one. 
Output from 82495XP (pin 05) Cycle Control Signal 


Synchronous to ClK 


7.49.1 SIGNAL DESCRIPTION 


NENE# indicates to the MBC that the address of 
the requested memory cycle is "near" the address 
of the previously generated one (in the same 2K 
DRAM page). This information may be used by the 
MBC to optimize access to paged or static column 
DRAMs. 


7.49.2 WHEN DRIVEN 


NENE# is valid together with CADS# and will stay 
valid until CNA# or CRDY#. 


7.49.3 RELATION TO OTHER SIGNALS 


Address and cycle specification signals (MSETO- 
MSET10, 
MTAGO-MTAG11, 
MCFAO-MCFA6, 
CW/R#, 
CMIIO#, 
CD/C#, 
RDYSRC, MCACHE#, 
NENE#, SMlN#, 
KlOCK#, and CPlOCK#) will be 
valid with CADS#. 


NENE# may change state after CNA# or CRDY# 
are asserted to the 82495X~. 


7.50 PALLC# 


Potential Allocate 
Indicates 82495XP intent to allocate current cycle 
Output from 82495XP (pin 02) Cycle Control Signal 
Synchronous to ClK 
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Read-Only line. 


7.50.2 WHEN DRIVEN 


PAllC# 
is valid in the same ClK as CADS# and is 
valid until CRDY# or CNA#. 


7.50.3 RELATION TO OTHER SIGNALS 


PAllC# 
is valid with CADS#. 


7.51 PAR# 


Parity Selection 
Selects 82490XP as a Parity Device 
Input to 82490XP (pin32) Configuration Signal 
Synchronous to ClK 


7.51.1 SIGNAL DESCRIPTION 


PAR# is a strapping option on the 82490XP that, 
when strapped 'Iow, configures that 82490XP device 
to be a dedicated parity device. A 82490XP parity 
device must be configured the same as all the other 
devices, however, the data lines are defined differ- 
ently. CDATA[O:3] are 4 parity bit I/O lines and 
CDATA[4:7] are 4 bit select lines so each parity line 
may be written individually. Parity devices must be 
used as follows: 


Cache 
Memory 
Number 
82490XP 
Bus 
of Parity 
1/0 Bits 
Size 
Width 
Devices 
(CPU:Mem) 


256K 
64 
2 
4:4 


512K 
128 
2 
4:8 


7.51.2 
WHEN 
SAMPLED 


PAR# is a strappin~ option and must be tied either 
high or low. 


7.51.3 
RELATION 
TO OTHER 
SIGNALS 


PAR# affects the definition of the CDATA and MDA- 
TA lines of the 82490XP. 


7.52 
RDYSRC 


Ready Source 
Cycle control signal to the MBC 
Output from 82495XP (pin C1) Cycle Control Signal 
Synchronous to ClK 


7.52.1 
SIGNAL 
DESCRIPTION 


RDYSRC serves as a cycle control signal to the 
MBC. It indicates the source of the BRDY# genera- 
tion (either 82495XP or MBC) for the CPU. When 
high it indicates that the MBC should generate the 
BRDY#s to the CPU, when low it indicates that the 
82495XP will provide the BRDY#s. 


RDYSRC is asserted for line-fill and not asserted for 
the write portion of allocation cycles. RDYSRC is 
also asserted (high) for all I/O cycles. 


7.52.2 
WHEN 
DRIVEN 


RDYSRC is valid in the same ClK as CADS# and is 
valid until CRDY# or CNA#. 


7.52.3 
RELATION 
TO OTHER 
SIGNALS 


Address and cycle specification signals (MSETO- 
MSET10, 
MTAGO-MTAG11, 
MCFAO-MCFA6, 


CW/R#, 
CM/IO#, 
CD/C#, 
RDYSRC, MCACHE#, 
NENE#, SMlN#, 
KlOCK#, and CPlOCK#) will be 
valid with CADS#. 


7.53 
RESET 


Reset 
Forces the 82495XP to begin execution in a known 
state 
Input to 82495XP (05) 
Asynchronous 
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7.53.1 
SIGNAL 
DESCRIPTION 


The falling edge of this signal tells the 82495XP to 
sample all configuration inputs and initializes the 
82495XP to a known state. See the specific configu- 
ration Signals for setup and hold times relative to 
RESET's falling edge. RESET can be asserted at 
any time. 


During initlialization, the 82495XP lRU bits are set 
to 1 indicating that the 82495XP lRU way is way 1. 
The 82490XP MRU bits are initlialized to 0 as are all 
tag array bits. 


RESET takes about 4100 clocks in the 82495XP. 
RESET with self-test takes about 80,000 clocks. 


7.53.2 
WHEN 
SAMPLED 


RESET is an asynchronous input. RESET must have 
a pulse width of at least 8 ClK's in order to guaran- 
tee 82495XP recognition. 


7.53.3 
RELATION 
TO OTHER 
SIGNALS 


The following signals are sampled at RESET: 


CNA#[CFGO): 
CFGO line of 82495XP 
configuration 
inputs 


SWEND# 
[CFG1): 
CFG1 line of 82495XP 
configuration 
inputs 


KWEND# 
[CFG2): 
CFG2 line of 82495XP 
configuration 
inputs 


FLUSH# 
[NCPFLD#): 
If low, enables 
decoding 
of 


i860XP non-cacheable 
PFLD 
mode. 


FPFLD# 
[FPFLDEN): 
If high, enables 
the extemal 
FIFO for i860XP PFLD mode. 


BGH 
[C490LDRV): 
Indicates 
the driving strength 
of 
the 82495XP/82490XP 
interface. 


SYNC# 
[MEMLDRV): 
Indicates 
the memory 
bus 
driving strength. 


SNPCLK# 
[SNPMD): 
Indicates 
the snooping 
mode; 
synchronous 
or strobed. 


CFG2-CFGO 
Configure 
cache 
parameters 
such as lines/sector, 
line ratio, 
and number 
of tags. 
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7.54 
SLFTST # 


Self Test 
Executes 82495XP self-test 
Input to 82495XP (pin M2) Test Signal 
Synchronous to ClK 


7.54.1 
SIGNAL 
DESCRIPTION 


If SlFTST# 
is sampled low and HIGHZ# is sam- 
pled high, the 82495XP will perform a self-test after 
reset. The results of the self-tests are given by CA- 
HOLD when FSIOUT# goes inactive. 


7.54.2 
WHEN 
SAMPLED 


SlFTST # is sampled with reset like figure 7-1 with a 
setup time of 10 CPU clocks. SlFTST # is then a 
"don't care" until after the first CADS# activation 
when it becomes the CRDY# pin. 


7.54.3 
RELATION 
TO OTHER 
SIGNALS 


SlFTST # shares a pin with CRDY#. The 82495XP 
enters self-test if both SlFTST # is sampled active 
and HIGHZ# is sampled inactive. 


7.55 
SMLN# 


Same Line 
Current cycle is same 82495XP line as previous one. 
Output from 82495XP (pin C6) Cycle Control Signal 
Synchronous to ClK 


7.55.1 
SIGNAL 
DESCRIPTION 


SMlN# 
is used to indicate to the MBC that the cur- 
rent cycle is accessing the same 82495XP cache 
line as the previous cycle. This indication can be 
used 
by 
the 
MBC 
to 
selectively 
activate .its 
SNPSTB# signal to other caches in the system. For 
example, back-to-back snoop hits to the same line 
may be snooped only once. 


7.55.2 
WHEN 
DRIVEN 


SMlN# 
is asserted with CADS# and will stay valid 
until CNA# or CRDY#. 
' 
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7.55.3 
RELATION 
TO OTHER 
SIGNALS 


Address and cycle specification signals (MSETO- 
MSET10, 
MTAGO-MTAG11, 
MCFAO-MCFA6, 
CW/R#, 
CM/IO#, 
CD/C#, 
RDYSRC, MCACHE#, 
NENE#, SMLN #, KlOCK #, and CPlOCK #) will be 
valid with CADS#. 


7.56 
SNPADS# 


Cache Snoop Address Strobe 
Initiates a snoop write back cycle 
Output from 82495XP (pin F3) Snooping Signal 


Sync to ClK 


7.56.1 
SIGNAL 
DESCRIPTION 


The SNPADS# signal indicates valid cache control 
and 
attribute 
signals, 
functioning 
identically 
to 
CADS#, 
but is generated only on snoop write- 
backs. The separation of address status signals for 
normal and snoop write-back cycles eases memory 
bus controller implementation. When SNPADS# is 
activated, the memory bus controller should abort all 
pending cycles for which BGT# has not been is- 
sued. The 82495XP reissues these non-committed 
cycles after the snoop write-back has completed. 


7.56.2 
WHEN 
DRIVEN 


SNPADS# is produced when a snoop hits a modi- 
fied line. A modified line condition exists when a line 
in the cache has been updated, and copies of that 
memory location in other devices are no longer val- 
id. A snoop is initiated by the master of a shared bus 
when accessing a memory location on the shared 
bus. 


The response of the 82495XP to a snoop appears 
on the MTHIT# and MHITM# pins in the clock after 
SNPCYC# is active. If these pins are both driven 
low, the snoop resulted in a hit to a modified line, 
and a snoop write-back is initiated with the assertion 
of SNPADS#. SNPADS# is driven, at earliest, two 
clocks after SNPCYC#. Like CADS# , SNPADS# is 
active for one ClK, and is always valid. 


7.56.3 
RELATION 
TO OTHER 
SIGNALS 


Cycles initiated by SNPADS# require only CRDY#; 
they do not require the other cycle progress signals 
(BGT#, KWEND#, SWEND#). 


The SNPADS# signal is driven by the 82495XP to 
indicate the 
start 
of the 
write-back 
cycle; the 
82495XP drives the following address and cycle 
specification signals valid with SNPADS#: CW/R#, 
CD/C#, 
CM/IO#, 
MCACHE#, RDYSRC, NENE#, 
SMlN#, 
and 
the 
address 
on 
MSET[O:101, 


MTAG[O:111, and MCFA[O:6]. Upon assertion of 
SNPADS#, the memory bus controller should can- 
cel all pending cycles for which BGT# has not yet 
been asserted, because they will be reissued after 
the snoop write-back. The 82495XP will ignore 
BGT# while SNPBSY# and MHITM# are active (ie, 
during the write-back). 


The 82495XP can accept a snoop request while per- 
forming memory bus transfers of its own. If a snoop 
is requested while it is performing a transfer of its 
own, the results of the snoop and any necessary 
snoop write-backs may be delayed. If SNPSTB# is 
sampled at a 82495XP after it has received BGT# 
for its own cycle, and the snoop hits a modified line, 
the snoop write-back will occur after CRDY# for the 
82495XP's own cycle. See Sections 6.2.4 and 6.2.5 
for details. 


7.57 
SNPBSY# 


Snoop Busy 
Indicates additional snoop processing in progress 
Output from 82495XP (pin F1) Snooping Signal 
Sync to ClK 


7_57.1 
SIGNAL 
DESCRIPTION 


SNPBSY# and SNPCYC# indicate a snoop in prog- 
ress. The SNPCYC# signal is asserted on the actual 
snoop look-up to the 82495XP tags. If the snoop 
look-up indicates a valid line is hit and the snoop is 
invalidating, the 82495XP must perform a back inval- 
idation on the CPU. If a snoop hit occurs to a modi- 
fied line, a snoop write-back must occur. SNPBSY# 
is asserted and remains active while either a back 
invalidation or a snoop write-back is in progress. 


7.57.2 
WHEN 
DRIVEN 


SNPBSY# 
is activated for two conditions. First, 
SNPBSY# is activated whenever a back invalidation 
is necessary: the snoop returns MTHIT# active and 
SNPINV was asserted on the snoop initiation. Sec- 
ond, SNPBSY# is activated when a modified cache 
line is hit on a snoop, as indicated by MHITM#, until 
the modified line has been written back (CRDY# re- 
turned for the write-back). 


SNPBSY# is valid in the ClK following SNPCYC#, 
and if active, remains active for a minimum of two 
ClKS. 
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7.57.3 
RELATION 
TO OTHER 
SIGNALS 


After SNPCYC# occurs for a snoop, a new snoop 
may be initiated. If SNPBSY# is asserted for the 
initial snoop, the SNPCYC# of the second snoop is 
delayed until the SNPBSY# signal is deasserted for 
the initial snoop, indicating that its snoop processing 
has completed. 


7.58 
SNPCLK [SNPMD] 


Snoop Clock [Snooping Mode] 


Selects 82495XP snooping mode 
Input to 82495XP (pin S3) Snooping Signal 
SynChronousto ClK 


7.58.1 
SIGNAL 
DESCRIPTION 


SNPMD selects whether the 82495XP snoop initia- 
tion be in synchronous, clocked, or strobed mode. 
82495XP snoop response is always synchronous to 
ClK. 


Synchronous mode (to ClK) is selected by SNPMD 
sampled low during reset. Strobed mode is selcted 
by SNPMD sampled high during reset. Clocked 
mode is selected by connecting the snoop clock 
source to SNPMD, and thus SNPMD becomes the 
actual snoop clock (SNPClK). 


7.58.2 
WHEN 
SAMPLED 


SNPMD is sampled like figure 7-1 with a setup time 
of 4 CPU clocks. SNPMD is then not used unless 
clocked mode is being selected. If clocked mode is 
selected, SNPMD becomes SNPClK 
to clock in 
snoop requests. 


7.58.3 
RELATION 
TO OTHER 
SIGNALS 


SNPMD becomes SNPClK if a clock signal is de- 
tected at reset. In this clocked mode, SNPClK is 
then used to clock-in SNPSTB#, 
the snoop ad- 
dress, and all snoop attributes. 


7.59 SNPCYC# 


Snoop Cycle 
Indicates snoop look-up occurring in 82495XP tags 
Output from 82495XP (pin H3) Snooping Signal 
Sync to ClK 


2-349 


82495XP 
Cache Controller/82490XP 
Cache RAM 


7.59.1 
SIGNAL 
DESCRIPTION 


SNPCYC# is asserted by the 82495XP during the 
clock when the actual tag look-up for the snoop is 
performed. SNPCYC# may appear as early as the 
ClK following SNPSTB# assertion, or may be de- 
layed several clocks while a snoop write-back or 
82495XP memory bus cycle take place. 


7.59.2 
WHEN 
DRIVEN 


SNPCYC# is always a valid 82495XP output. It is 
asserted once, for a single clock, for every snoop 
which is initiated in the 82495XP. 


7.59.3 
RELATION 
TO OTHER 
SIGNALS 


A snoop is initiated by assertion of the SNPSTB# 
input if MAOE# is not asserted. The actual snoop, 
signalled by the assertion of SNPCYC#, can be de- 
layed by a prior snoop's write-back in progress 
(SNPBSY# asserted) or by a 82495XP memory cy- 
cle in progress (SNPSTB# occurs after BGT#)- 
see SNPSTB# for details. If neither of these is oc- 
curring, strobed and clocked snooping modes can 
also delay snoop look-up for a clock while the snoop 
address and attributes are synchronized. 


In the clock following SNPCYC#, 
MHITM# 
and 
MTHIT# report valid snoop results. 


7.60 
SNPINV 


Snoop Invalidation 
Forces invalidation of snoop hits 
Input to 82495XP (pin P5) Snooping Signal 
Sampled with SNPSTB# (see SNPSTB#) 


7.60.1 
SIGNAL 
DESCRIPTION 


Assertion of the SNPINV signal during the initiation 
of a snoop request forces a snoop hit for that re- 
quest into the Invalid state. 


The SNPINV pin is sampled upon initiation of a 
snoop request with SNPSTB# activation, depending 
on snooping mode: rising edge of first ClK when 
SNPSTB is asserted (synchronous snooping mode), 
or rising edge of first SNPClK when SNPSTB# is 
asserted (clocked mode), or falling edge of strobed 
SNPSTB# (strobed mode). 
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7.60.2 
WHEN 
SAMPLED 


When a bus master performs a bus access, the 
SNPSTB# of all other 82495XPs is asserted to initi- 
ate a snoop for that address. If the master's access 
is one which is modifying the data (a write to memo- 
ry, etc.), the SNPINV pin of all snooping 82495XPs 
must be asserted during SNPSTB# so that the line 
is properly marked Invalid. 
' 


SNPINV is not asserted during SNPSTB# assertion 
if snoop hits are to remain valid: the master issuing 
the snoop does not require their invalidation (a 
read). 


SNPINV assertion forces all snoop hits to be invali- 
dated, overriding other 
inputs 
or 
attributes 
(ie 
SNPNCA). When SNPINV is not asserted, cache 
states change according to normal protocol. 


SNPINV is only sampled with SNPSTB#, which may 
be qualified by ClK or SNPClK depending on the 
snooping mode, and must meet setup and hold 
times for the edge of its sampling. When SNPSTB# 
is not being asserted, SNPINV is a don't care and 
need not follow setup and hold times. 


7.60.3 
RELATION 
TO OTHER 
SIGNALS 


SNPINV is sampled according to SNPSTB#, which 
may be qualified by SNPClK or ClK, depending on 
the snooping mode. SNPINV overrides the SNPNCA 
input, which may also be asserted with SNPSTB#. If 
MAOE# 
is active with SNPSTB# 
sampling, the 
snoop request is ignored. 


7.61 
SNPNCA 


Snoop Non Caching device Access 
Indicates to snooping 82495XP that the initiating 
master is a non- caching device 
Input to 82495XP (pin 03) Snooping Signal 
Sampled with SNPSTB# (see SNPSTB#) 


7.61.1 
SIGNAL 
DESCRIPTION 


SNPNCA indicates that the master which is initiating 
the snoop request will not cache the data. If the 
SNPNCA pin is not asserted and the snoop is nonin- 
validating (where noninvalidating = SNPINV not as- 
serted), a snoop hit line must be placed in the 
Shared state, since the data will exist in another 


cache. If SNPNCA is asserted and the snoop is non- 
invalidating, a snoop hit line will not be entered into a 
new cache, so a hit Exclusive or Modified line will be 
placed in the Exclusive state by the 82495XP. A 
noninvalidating snoop hit to a Shared line must keep 
the hit line in the Shared state, regardless of 
SNPNCA. 


SNPNCA is sampled upon initiation of a snoop re- 
quest with SNPSTB# activation, depending on the 
snooping mode: rising edge of first ClK 
when 
SNPSTB# asserted (synchronous snooping mode), 
or the rising edge of SNPClK when SNPSTB# is 
asserted (clocked snooping mode), or the falling 
edge of SNPSTB# (strobed snooping mode). 


7.61.2 
WHEN 
SAMPLED 


To achieve maximum processor performance and 
minimum bus traffic, SNPNCA should be asserted 
when the noninvalidating snoop is caused by an ac- 
cess from a non-caching device like a DMA. 


If the snoop is being caused by a device which will 
also be caching the data, SNPNCA must not be as- 
serted, so that the 82495XP does not leave the hit 
line in an Exclusive state-subsequent 
writes to 
lines in this state do not appear on the bus, and stale 
data would result in the cache which incorrectly as- 
serted SNPNCA. 


If SNPNCA is asserted on a noninvalidating snoop 
request, the following outlines the behavior of the 
cache for a snoop hit in each of the MESI states: 
Modified 
The data is written to the bus, and the 
line is placed in the Exclusive state 
The line remains in the Exclusive state 
The line remains in the Shared state 
This is a cache miss. The line remains 
Invalid. 


Exclusive 
Shared 
Invalid 


If SNPNCA is NOT asserted on a noninvalidating 
snoop request, an M, E, or S state hit line will be 
placed in the Shared state. Again, M state causes a 
write to the bus, Invalid lines remain Invalid. 


SNPNCA is only sampled with SNPSTB#, which 
may be qualified by ClK or SNPClK depending on 
the snooping mode, and must meet setup and hold 
times 
for 
the 
edge 
of 
this 
sampling. 
When 
SNPSTB# is not being sampled, SNPNCA is a don't 
care and need not follow set-up and hold times. 


7.61.3 
RELATION 
TO OTHER 
SIGNALS 


SNPNCA ls sampled with SNPSTB#, which may be 
qualified by SNPClK or ClK, depending on snoop- 
ing mode. The assertion 
of 
SNPINV overrides 
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SNPNCA, and places all snoop hit lines into the In- 
valid state. If MAOE# is active on SNPSTB# sam- 
pling, the snoop request is ignored. 


7.62 SNPSTB# 


Snoop Strobe 
Initiates 82495XP snoop and latches snoop address 
& attributes 
Input to 82495XP (pin R3) Snooping Signal 
Sync to ClK or SNPClK, or strobed 


7.62.1 
SIGNAL 
DESCRIPTION 


Snoop strobe initiates a 82495XP snoop request. It 2 
controls the latching of the snoop address and 
snoop attribute signals, in the manner specified by 
one of three snooping modes: 


Snooping 
Modes 


Mode 
Snoop Addressl 
Attributes 
Sampled on: 


Strobed 
falling edge of SNPSTB# 
Clocked 
rising edge of SNPClK when 
SNPSTB# sampled active 
Synchronous 
rising edge of ClK when 
SNPSTB# sampled 


SNPSTB# must be asserted to initiate a snoop re- 
quest. Snoops are initiated by a bus master for all 
memory accesses, to ensure that data residing in 
other caches is flushed if modified and invalidated if 
necessary. 


SNPSTB# 
must be deasserted for at least one 
SNPClK 
or ClK 
when clocked or synchronous 
snooping mode (respectively) is used, in order to 
rearm for the next snoop. 


SNPSTB# can be asserted while a snoop is in prog- 
ress, allowing one level of pipelining. However, the 
reassertion of SNPSTB# while snooping is in prog- 
ress must not occur until after SNPCYC#-precise- 
Iy, after the falling edge of SNPCYC# for strobed 
and clocked modes, or in the clock after SNPCYC# 
is active for synchronous mode. SNPSTB# must not 
be asserted between the first and last BGT# of a 
locked sequence. Similarly, SNPSTB# must not oc- 
cur after the BGT# of the write through and before 
the BGT# of the allocation when a Read-for-Owner- 
ship transaction is occurring. 


SNPSTB# itself does not affect the cache contents 
or states, 
but the 
snoop signals SNPINV and 
SNPNCA, latched upon SNPSTB#, 
force various 
changes in the cache on a snoop hit. 
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7.62.2 
WHEN 
SAMPLED 


SNPSTB# is sampled on every SNPClK or ClK in 
clocked or synchronous modes, and is sampled con- 
stantly in strobed mode. While a snoop is in prog- 
ress, a new SNPSTB# is recognized as a new, pos- 
sibly pipelined, snoop request. After the assertion of 
a pipelined SNPSTB#, the SNPSTB# signal must 
not be reasserted until after the nextSNPCYC#. 


SNPSTB# should always meet proper set-up and 
hold times when operating in clocked or synchro- 
nous modes. When operating in strobed mode, it 
must meet minimum active/inactive 
times to be 
properly recognized in the next clock. 


7.62.3 
RELATION 
TO OTHER 
SIGNALS 


SNPSTB# latches the following signals: SNPINV, 
SNPNCA, MBAOE#, and MAOE#, and the address 
on the MSET, MTAG, and MCFA pins. The address 
which appears on the MSET, MTAG, and MCFA ad- 
dress pins is to be snooped in the 82495XP. 
MAOE# acts as a qualifier for a snoop; if MAOE# is 
active when sampled on a SNPSTB# assertion, the 
snoop request is ignored. SNPINV and SNPNCA 
provide the 82495XP with snoop attributes which af- 
fect the state of a snoop hit cache entry. 


If MBAOE# is active during SNPSTB# assertion, 
the 82495XP forces all bits in the subline address 
(those address bits which MBAOE# controls) to 0 
on a snoop write back for that snoop. 


Snoops and memory accesses are interlocked, such 
that after BGT# for a memory access has been is- 
sued, a SNPSTB# which is asserted will be latched, 
with its address and attributes, but will not cause a 
snoop until after SWEND# for that memory cycle. 
After BGT# has been issued for a cycle, snoop 
write-backs are delayed until after the CRDY# for 
that cycle. Likewise, once a snoop is underway 
(SNPCYC# active) BGT# 
is ignored until snoop 
completion. 


SNPSTB# must not be deasserted and reasserted 
(specifically, cause a second falling edge) between 
its initial recognition and SNPCYC#-ie, 
SNPSTB# 
must not be asserted before the SNPCYC# of the 
previous SNPSTB#. In strobed and clocked modes, 
SNPSTB# can be reasserted after the falling edge 
of SNPCYC#; in synchronous mode, SNPSTB# can 
be reasserted in the ClK after SNPCYC# is active. 
This 
second 
.assertion 
of 
SNPSTB#, 
after 
SNPCYC#, can occur while the first snoop is still 
progressing (SNPBSY# is active), allowing one level 
of snoop pipelining. In this case, a third assertion of 
SNPSTB# must not occur until after the SNPCYC# 
for the second, piped snoop request. 
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SNPSTB# must not be asserted while the 82495XP 
is executing a locked sequence (lOCK# 
active). 


Specifically, SNPSTB# must not be asserted after 
the BGT# for the first locked access and before the 
BGT# of the last locked access. 


Systems which support Read-for-Ownership must 
not assert SNPSTB# between the BGT# of the 
write through and the BGT# of the allocation during 
a Read-for-Ownership operation. 


7.63 
SWEND# 


Snoop Window End 
Closes Snooping Window 
Input to 82495XP (pin Q1) Cycle Progress Signal 


Synchronous to ClK 


7.63.1 
SIGNAL 
DESCRIPTION 


SWEND# is an input to the 82495XP that, when 
asserted, closes the snooping window and causes 
sampling of 
MWB/WT # 
and 
DRCTM#. 
Once 
snooping 
of 
all 
other 
82495XP's 
is 
complete, 
DRCTM# and MWB/WT # can be determined. 


Snoop response is blocked by the 82495XP be- 
tween BGT# and SWEND# activation. Therefore, 
the faster SWEND# is closed, faster snoops can be 
determined. 


All CPU-generatedwrite cycles and cache read miss 
cycles must cause a snoop on the memory bus. 
SWEND# 
may be activated once snooping has 
completed for these cycles. SWEND# 
activation 
causes the 82495XP's internal tags to change state 
for the-current cycle (if necessary). DRCTM# and 
MWB/WT # influence the state change decision. 


SWEND# need only be activated for those cycles 
which 
require the 
sampling 
of 
DRCTM# 
and 
MWB/WT#. 


If a cycle does not specifically require SWEND#, 
and SWEND# is not returned, snooping is blocked 
from BGT# to CRDY#. For this reason, it may be 
more efficient to always return SWEND#. 


7.63.2 
WHEN 
SAMPLED 


SWEND# is sampled by the 82495XP on the clock 
or after KWEND# is sampled active for those cycles 
that sample KWEND#. For cycles that do not sarn- 


pie KWEND#, SWEND# is sampled with or after 
BGT#. Once SWEND# is sampled active, it is ig- 
nored until KWEND# of the next cycle. If SWEND# 
is not being sampled, it may violate setup and hold 
times. 


Snoop response is blocked between BGT# and 
SWEND#. 
If a snoop is initiated between BGT# 
and SWEND#, 
the 
MTHIT# 
and MHITM# 
re- 
sponse is given after SWEND# activation. Any sub- 
sequent 
snoop 
write 
back 'would 
begin 
after 
CRDY#. 


7.63.3 
RELATION 
TO OTHER 
SIGNALS 


SWEND# causes the sampling of MWB/WT# 
and 
DRCTM#. SWEND# is sampled once KWEND# is 
sampled active. BGT#, KWEND#, and SWEND# 
may be asserted in the same clock. 


SWEND# shares a pin with CFG1. 


7.64 
SYNC# 


Sync 
Synchronizes 82495XP TAG array with Main Memo- 
ry 
Input to 82495XP (04) Cache Synchronization Sig- 
nal 
Asynchronous 


7.64.1 
SIGNAL 
DESCRIPTION 


SYNC# activation will cause the synchronization of 
the 82495XP and i860 XP CPU tag arrays with main 
memory. The 82495XP will flush all modified entries 
to memory. All valid tag entries will be kept, with 
modified [M] state lines becoming non-modified [E] 
state lines. 


7.64.2 
WHEN 
SAMPLED 


SYNC# can be asserted at any time. The 82495XP 
will complete all outstanding cycles on the CPU and 
memory bus before beginning the SYNC process. 
The memory bus controller does not have to prevent 
SYNC# during locked cycles because the 82495XP 
will complete its locked cycle before the SYNC pro- 
cess will begin. 


Once a SYNC operation has begun, the SYNC# sig- 
nal is ignored until the operation completes. If 
RESET or FLUSH# is asserted while the SYNC op- 
eration is in progress, the SYNC operation will be 
aborted and the RESET or FLUSH immediately exe- 
cuted. 
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SYNC# 
is an asynchronous input. SYNC# 
must 
have a pulse width of 2 ClK's in order to guarantee 
82495XP recognition. 


7.64.3 
RELATION 
TO OTHER 
SIGNALS 


To initiate a SYNC, the 82495XP will complete all 
pending cycles and prohibit further ADS#'s to occur 
while a SYNC is in progress. The FSIOUT# output 
signal is used to indicate the start and end of the 
SYNC operation. It will become active when the 
SYNC# signal is internally recognized (all outstand- 
ing cycles have completed) and will de-activate 
when the SYNC operation has completed. 


The memory bus controller supplies BRDY# to the 
CPU once the SYNC has completed. Once SYNC 2 
has begun, and FSIOUT# active, all CADS#'s and 
CRDY# 's correspond to the write-backs caused by 
the SYNC operation. 


The 82495XP can be snooped during SYNC cycles 
and the snooping protocols will be the same as that 
for any memory bus cycle. 


7.65 
TCK 


Test Clock 
Clock for the JTAG boundary scan tests 
Input to the i860 XP CPU (pin 01) Test Signal 
Input to the 82495XP (pin P3) 
Input to the 82490XP (pin 3) 
Synchronous 


7.65.1 
SIGNAL 
DESCRIPTION 


TCK is an input to the i860 XP CPU, 82495XP and 
82490XP and provides the clocking function re- 
quired by the JTAG boundary scan feature.' TCK is 
used to clock state information and data into and out 
of the component. State select information and data 
are clocked into the component on the rising edge 
of TCK on TMS and TDI, respectively. Data is 
clocked out of the part on the falling edge of TCK on 
TOO. 


In addition to using TCK as a free running clock, it 
may be stopped in a low, logic 0, state, indefinitely 
as described in IEEE 1149.1. While TCK is stopped 
in the low state, the boundary scan latches retain 
their state. 


When boundary scan is not used, TCK should be 
tied low. 
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Input to the i860 XP CPU (pin S14) Test Signal 


Input to the 82495XP 
(pin N3) 


Input to the 82490XP 
(pin 2) 


Synchronous 
to TCK 


7.66.1 
SIGNAL 
DESCRIPTION 


TOI is the serial input used to shift JT AG instructions 
and data into the component. 
The shifting of instruc- 
tions 
and 
data 
occurs 
during 
the ,SHIFT-IR 
and 
SHIFT- 
OR 
TAP 
controller 
states, 
respectively. 
These 
states 
are selected 
using the TMS signal as 
described 
in chapter 
9. 


An internal 
pull up resistor 
is provided 
on TOI to en- 
sure a known 
logic state if an open circuit occurs on 
the TOI path. 
Note 
than when 
"1" 
is continuously 
shifted 
into the instruction 
register, 
the BYPASS 
in- 
struction 
is selected. 


7.66.2 
WHEN 
SAMPLED 


TOI is sampled 
on the rising edge of TCK, during the 
SHIFT-IR 
and the SHIFT-OR 
states. 
During all other 
TAP controller 
states, TOI is a "don't 
care". 


7.66.3 
RELATION 
TO OTHER 
SIGNALS 


TOI is only sampled 
when TMS and TCK have been 
used to select 
the SHIFT-IR 
or SHIFT-OR 
states 
in 
the TAP controller. 


For proper initialization 
of JT AG logic, TOI should be 
driven high, "1", 
for at least four TCK cycles follow- 
ing the rising edge of RESET. 
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and SHIFT- 
OR TAP controller 
states. 
respectively. 


These 
states 
are selected 
using the TMS signal as 
described 
in chapter 
9. 


When 
not in SHIFT-IR 
or SHIFT-OR 
state, 
TOO is 
driven to a high impedance 
state to allow connecting 
TOO of different 
devices 
in parallel. 


7.67.2 


TOO is driven on the falling edge of TCK during the 
SHIFT-IR 
and SHIFT- 
OR TAP controller 
states. 
At 
all other times TOO is driven to the high impedance 
state. 


7.67.3 


TOO is only driven when TMS and TCK have been 
used to select the SHIFT- 
IR or SHIFT-OR 
states in 
the TAP controller. 


7.68 
TMS 


Test Mode Select 


Controls 
testing 
by selecting 
mode of operation 


Input to the i860 XP CPU Test Signal 


Input to the 82495XP 
(pin P2) 


Input to the 82490XP 
(pin 1) 


Synchronous 
to TCK 


7.68.1 
SIGNAL 
DESCRIPTION 


TMS 
is decoded 
by the 
JTAG 
TAP 
(Tap 
Access 
Port) to select the operation 
of the test logic, as de- 
scribed 
in chapter 
9. 


intel~ 


To guarantee 
deterministic 
behavior 
of the TAP con- 
troller TMS is provided 
with an internal 
pull-up resis- 
tor. If boundary 
scan is not used, TMS may be tied 
high or left unconnected. 


7.68.2 
WHEN 
SAMPLED 


TMS is sampled 
on every rising edge of TCK. 


7.68.3 
RELATION 
TO OTHER 
SIGNALS 


TMS 
is used 
to select 
the 
internal 
TAP states 
re- 
quired to load boundary 
scan instructions 
to data on 
TDI. 


For 
proper 
initialization 
of 
the 
JT AG 
logic, 
TMS 
should 
be driven 
high, "1", 
for at least four TCK cy- 
cles following 
the rising edge of RESET. 


7.69 
Vcc and Vss 


Power and Ground 
Pins 


See Tables 
1.1 and 1.2 for locations. 


7.70 
WWOR# 


Weak Write Ordering 
Mode 


Enforces 
strong/weak 
write-ordering 
policy 


Input to 82495XP 
(pin Q2) Configuration 
Signal 


Synchronous 
to ClK 


7_70.1 
SIGNAL 
DESCRIPTION 


When asserted 
during reset, the 82495XP 
enforces 
a weak write ordering 
policy. If WWOR IF is deassert- 
ed 
during 
reset, 
the 
82495XP 
enforces 
a strong 
write-ordering 
policy. 


In a strong write-ordering 
mode, writes to the memo- 
ry bus are forced 
to occur in the order in which they 
were 
posted 
by the CPU. In a weak 
write-ordering 
mode it is possible 
for: 


1. A 
CPU 
posted 
write 
(A) 
to 
be 
waiting 
in 
a 
82495XP/82490XP 
memory 
buffer. 


2. A subsequent 
CPU write 
(B) to complete 
in the 
82495XP/82490XP 
because 
it was a hit to M or E 
state. 
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3. A snoop hit to B to cause a write back of B before 
A is written. 


In this scenario, 
B is written 
to memory 
before 
A is, 
and thus CPU writes 
have been reordered. 


7.70.2 
WHEN 
SAMPLED 


WWOR IF is sampled 
during reset like figure 7-1 with 
a setup 
time of 4 CPU clocks. 
WWOR 
IF 
becomes 
MALE 
once 
FSIOUT IF 
indicates 
that 
the 82495XP 
reset sequence 
has completed. 


7_70.3 
RELATION 
TO OTHER 
SIGNALS 


WWOR 11 shares 
a pin with MALE. 


8.0 
BUS FUNCTIONAL DESCRIPTION 
AND TIMING 


The 82495XP/82490XP 
cache core supports 
a wide 
variety 
of bus transfers 
to meet the needs 
of high 
performance 
systems. 
Bus transfers 
can be single 
cycle or multiple cycle, cacheable 
or non-cacheable, 
64- or 128-bit (memory 
bus), and locked. 
To support 
multiprocessing 
systems there are cache back-inval- 
idation, inquire, snooping, 
read for ownership, 
cache 
to cache transfers, 
and locked 
cycles. 


This section begins with read cycles, both cacheable 
and 
non-cacheable. 
It moves 
on 
to 
write 
cycles, 
cacheable 
and non-cacheable. 
Snooping 
cycles 
are 
discussed 
next with an example 
of each 
snooping 
mode. The remaining 
sections 
describe 
special 
cy- 
cles: read for ownership, 
I/O, and locked 
cycles. 


The cycles 
shown 
in this chapter 
are examples 
of 
various types of 82495XP/82490XP 
cycles. The pur- 
pose of these 
examples 
is to show 
signal 
relation- 
ships, and are not necessarily 
best case scenarios. 


8.1 
Read Cycles 


8.1.1 
READ 
HITS 


Read Hit cycles 
are executed 
completely 
within 
the 
CPU/Cache 
core, and will not be seen by the MBC. 
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Figure 8·1. Cacheable Read Miss with Clean Replacement 


2-356 
I 


cycle is performed on the CPU bus to guarantee that 
the replaced data is also removed from the CPU's 
first level cache, thus maintaining the inclusion prop- 
erty. 


CACHE CONTROL SIGNALS: 


The 
CPU 
initiates 
the 
read 
cycle 
to 
the 
82495XP/82490XP 
cache where the cache tag 
state is looked up. Once the 82495XP determines 
the cycle to be a cache miss, it issues CADS# 
(clock 2) and the associated cycle control signals to 
the MBC (eg. CW/R#, CM/IO#, 
CD/C#, 
RDYSRC, 
MCACHE#) in order to schedule the cache line-fill 
operation. MCACHE# is active, indicating that the 
read miss is potentially cacheable by the 82495XP; 
RDYSRC is active, indicating that the MBC must 
supply BRDY#s to the CPU cache core. 


The 
memory 
bus 
address 
(MSET[10:01, 


MTAG[11:01, MCFA[6:0j) 
is valid with 
CADS# 
(clocks 2 and 13 for the two cycles in this example) 
and remain valid until after CNA# is sampled active 
by the 82495XP (clocks 5 and 16). MALE and MBA- 
LE may be used to hold the address as necessary. 


The MBC arbitrates for the memory bus and returns 
BGT# asserted (clock 3), indicating that the cycle is 
guaranteed to complete on the memory bus. Once 
the 82495XP samples BGT# asserted, it must finish 
that cycle on the memory bus. Prior to this point, the 
cycle can be aborted by a snoop hit in the cache. 


CNA# is asserted by the MBC (clock 4) to indicate 
that it is ready to schedule a new memory bus cycle. 
Note that after CNA# activation, cycle control sig- 
nals are not guaranteed to be valid. 


When the MBC has determined the cacheability at- 
tribute of the cycle, it drives the MKEN# signal ac- 
cordingly. The MBC also drives the KWEND# signal 


Dyme Mt:lt; 
(ClOCK 1U). It IS at this time that the data 2 


in the 82490XP's memory cycle buffers is loaded 
into the cache SRAM. 


The 82495XP issues a new CADS# in clock 13 
which also misses the 82495XP/82490XP cache: 
Note that once the cycle progress signals (BGT# , 
CNA#, KWEND#, SWEND#) of a cycle are sam- 
pled asserted, the 82495XP ignores them until the 
<?RDY# of that cycle. The 82495XP does not pipe- 
hne the cycle progress signals (ie. it will not sample 
them again until after CRDY# of the current memory 
bus cycle). 


MEMORY BUS SIGNALS: 


The memory address latch enables (MALE and 
MBALE) may remain asserted by the MBC to place 
the address latches in flow through mode. If the 
82495XP is the current bus master, the memory ad- 
dress output enables (MAOE# 
and MBAOE#) 
should be asserted by the MBC. MDOE# must be 
inactive to allow the data pins to be used as inputs. 


Some time after the address has been driven onto 
the memory bus, data will be supplied from the 
DRAM (main memory) to 
the 
82490XP cache 
SRAM. 


For Clocked Memory Bus Mode, MSEL# is driven 
active by the MBC (clock 4) to allow sampling of 
MBRDY# and to latch MZBT# for the transfer. 
MZBT# 
is sampled on all MCLK edges where 
MSEL# is inactive. Once MSEL# is sampled active 
by the 82495XP, the value of MZBT# sampled on 
the prior MCLK is used for the 
next transfer. 


MBRDY# is driven active by the MBC in clocks 4 to 
6 to cause the memory burst counter to be incre- 
mented and data to be placed into the 82490XP 
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cache 
memory cycle 
buffers. The 
MBC drives 
MEOC# asserted (clock 7) to end the current cycle 
on the memory bus and switch memory cycle buffers 
for the new cycle. MZBT# is latched at this time 
(when MEOC# is sampled asserted and MSEL# re- 
mains low) for the next transfer. 


MBRDY# is driven active by the MBC in clocks 15 
to 17 to read data into the 82490XP cache memory 
cycle buffers. The MBC asserts MEOC# (clock 18) 
to end the second read miss cycle on the memory 
bus and switch the memory cycle buffers for a new 
cycle. 


For Strobed Memory Bus Mode, MSEL# is driven 
active by the MBC (clock 4) to allow MISTB opera- 
tion and to latch MZBT# (on the falling edge of 
MSEL#) for the transfer. MISTB is toggled in clocks 
5 to 7 to cause the memory burst counter to be in- 
cremented, and data to be placed into the 82490XP 
cache memory cycle buffers. Note: MISTB latches 
the memory bus data on both the rising and falling 
edges. The MBC drives MEOC# asserted (clock 8) 
to end the current cycle on the memory bus and 
switch memory cycle buffers for the new cycle. 
MZBT# for the next cycle, is sampled at this time on 
the falling edge of MEOC#. 


MISTB is toggled by the MBC (clocks 15 to 17) to 
read data into the 82490XP memory cycle buffers. 
The MBC asserts MEOC# (clock 18) to end the sec- 
ond read miss cycle on the memory bus and switch 
the memory cycle buffers for a new cycle. 


8.1.2.2 Read Miss with Replacement of Dirty 
Line 


Figure 8.2 illustrates a CPU read cycle which misses 
the 82495XP cache, and requires the replacement 
of a modified line (eg. tag replacement, lines/sec- 
tor= 1 line ratio= 1). In such cycles, the 82495XP 
will instruct the MBC to perform a cache line-fill on 
the memory bus, instruct the 82490XP to fill its write- 
back buffer with the contents of the array location 
corresponding to the line which must be replaced, 
and perform a back invalidation to the CPU to main- 
tain the first and second level cache consistency. 
Once 
the 
cache 
line-fill 
has 
completed, 
the 
82495XP/82490XP will write back the contents of 
the replaced line to main memory from the 82490XP 
write-back buffer. 


CACHE CONTROL SIGNALS: 


The 
CPU 
initiates 
the 
read 
cycle 
to 
the 
82495XP/82490XP 
cache where the cache tag 
state is rooked up. Once the 82495XP determines 
the cycle to be a cache miss, it issues CADS# 
(clock 1) and the associated cycle control signals to 
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the MBC (eg. CW/R#, CM/lOll', CD/C#, RDYSRC, 
MCACHE#) in order to schedule the cache line-fill 
operation. MCACHE# is active, indicating that the 
read miss is potentially cacheable by the 82495XP; 
RDYSRC is active, indicating that the MBC must 
supply BRDY#s to the CPU cache core. 


The 
memory 
bus 
address 
(MSET[10:01, 


MTAG[11:01, MCFA[6:0l) 
is valid with 
CADS# 
(clocks 1 and 5 for the two cycles in this example) 
and remain valid until after CNA# is sampled active 
by the 82495XP (clocks 4 and 10). MALE and MBA- 
LE may be used to hold the address as necessary. 


The MBC arbitrates for the memory bus and returns 
BGT# asserted (clock 2), indicating that the cycle is 
guaranteed to complete on the memory bus. At this 
point, the 82490XP's write-back buffer is prefilled 
with the line to be replaced. Once the 82495XP sam- 
ples BGT# asserted, it must finish that cycle on the 
memory bus. Prior to this point, the cycle can be 
aborted by a snoop hit from another cache. 


CNA# is asserted by the MBC (clock 3) to indicate 
that it is ready to schedule a new memory bus cycle. 
Note that after CNA# activation, cycle control sig- 
nals are not guaranteed to be valid. 


When the MBC has determined the cacheability at- 
tribute of the cycle, it drives the MKEN# signal ac- 
cordingly. The MBC also drives the KWEND# signal 
at this time, indicating the end of the cacheability 
window. The 82495XP samples MKEN# 
during 
KWEND# (clock 4) to determine that the cycle is 
indeed cacheable. 


The MBC asserts SWEND# 
(clock 6) when the 
snoop window ends on the memory bus. The clo- 
sure of the snoop window enables the MBC to start 
providing the CPU with data that has been stored in 
the 82490XP's memory cycle buffer. The MBC sup- 
plies BRDY#s to the CPU (clocks 6-9) to serve the 
read cycle. Note that data may be supplied to the 
82490XP's immediately after MSEL# activation, and 
need not wait for SWEND#. 


On the memory bus, the 82495XP issues a write- 
back (WB) cycle. CNA# is sampled active in clock 3 
causing the 82495XP to issue the CADS# 
(also 
CDTS#) 
of the write-back (clock 5). The MBC 
knows this is a write back cycle and not a CPU initia- 
ted write cycle by sampling MCACHE# asserted. 
This tells the MBC how many data transfers are nec- 
essary. 


BGT#, CNA#, and KWEND# of the write-back are 
sampled asserted by the MBC (clock 9) after the 
CRDY# of the read miss cycle (clock 8). At this 
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Figure 8·2. Cacheable Read Miss with Replacement of Dirty Line 
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point, the 82495XP may issue another CADS# for a 
new (unrelated) memory bus cycle. It is at this time 
that the data in the 82490XP's memory cycle buffers 
is loaded into the cache SRAM. The data to be writ- 
ten back to main memory is in the 82490XP's write 
back buffers. 


The snoop window for the write back cycle is closed 
by the MBC in clock 11, and the cycle is ended by 
CRDY# sampled asserted in clock 13. 


MEMORY BUS SIGNALS: 


The memory address latch enables (MALE and 
MBALE) may remain asserted by the MBC to place 
the address latches in flow through mode. If the 
82495XP is the current bus master, the memory ad- 
dress output enables 
(MAOE# 
and MBAOE#) 
should be asserted by the MBC. 


Some time after the address has been driven onto 
the memory bus, data will be supplied from the 
DRAM 
(main memory) to 
the 
82490XP cache 
SRAM. 


For Clocked Memory Bus Mode, MSEL# is driven 
active by the MBC (clock 3) to allow sampling of 
MBRDY# and to latch MZBT# for the transfer. 
MZBT# 
is sampled on all MCLK edges where 
MSEL# is inactive. Once MSEL# is sampled active 
by the 82495XP, the value of MZBT# sampled on 
the 
prior MCLK is used for the 
next transfer. 


MBRDY# is driven active by the MBC in clocks 3 to 
5 to cause the memory burst counter to be incre- 
mented and data to be placed into the 82490XP 
cache 
memory cycle 
buffers. The 
MBC drives 
MEOC# asserted (clock 6) to end the current cycle 
on the memory bus and switch memory cycle buffers 
for the new cycle. MZBT# is latched at this time 
(when MEOC# is sampled asserted) for the next 
transfer. 


The MBC asserts the memory data output enable 
signal (MDOE#, clock 8) to drive the memory data 
outputs. 


MBRDYif is driven active by the MBC in clocks 10 
to 12 to write data from the 82490XP cache memory 
cycle buffers onto the memory bus. The MBC as- 
serts MEOC# (clock13) to end the write back cycle 
on the memory bus and switch the memory cycle 
buffers for a new cycle. 


For Strobed Memory Bus Mode, MSEL# is driven 
active by the MBC (clock 4) to allow MISTB opera- 
tion and to 
latch MZBT# 
for the transfer (on 
MSEL# falling edge). MISTB is toggled in clocks 5 
to 7 to cause the memory burst counter to be incre- 
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mented, and data to be placed into the 82490XP 
cachememory cycle buffers. Note: MISTB latches 
the memory bus data on both the rising and falling 
edges. The MBC drives MEOC# asserted (clock 8) 
to end the current cycle on the memory bus and 
switch memory cycle buffers for the new cycle. 
MZBT# for the next cycle, is latched at this time on 
the falling edge of MEOC#. 


The MBC asserts MDOE#. (clock 9) to drive the 
memory data outputs. 


MOSTB is toggled by the MBC (clocks 10 to 12) to 
write data from the 82490XP memory cycle buffers 
onto the memory bus. The MBC asserts MEOC# 
(clock 13) to end the write back cycle on the memo- 
ry bus and switch the memory cycle buffers for a 
new cycle. 


8.1.3 NON-CACHEABLE READ MISSES 


8.1.3.1 Read Misses not Cacheable by CPUI 
Cache Core and Cacheable by Core, but 
not by Memory Bus 


Figure 8.3 illustrates two CPU read cycles which 
miss the 82495XP cache, and are non-cacheable. In 
the first cycle, the CPU/Cache core forces the read 
to 
be 
non-cacheable 
(as 
indicated 
by 
the 
MCACHE# output from the 82495XP). In the sec- 
ond cycle, non-cacheability of the data is forced by 
the memory bus (as indicated by the MKEN# input 
from the MBC). Since both cycles are not cache- 
able, there is no line-fill operation performed, the cy- 
cles are merely echoed to the memory bus. 


CACHE CONTROL SIGNALS: 


The CPU initiates the 
first 
read cycle 
to 
the 
82495XP/82490XP 
cache where the cache tag 
state is looked up. Once the 82495XP determines 
the cycle to be a cache miss, it issues a cycle re- 
quest (CADS# in clock 1) and the associated cycle 
control signals to the MBC (eg. CW/R#, 
CM/IO#, 


CD/C#, RDYSRC, MCACHE#) in order to schedule 
the read operation. RDYSRC is active, indicating 
that the MBC must provide BRDY# to the CPU; 
MCACHE# is not active, indicating that the read 
miss in not cacheable by the CPU/Cache core. 


The 
memory 
bus 
address 
(MSET[10:01, 


MTAG[11:01, MCFA[6:0l) 
is valid with 
CADS# 
(clocks 1 and 5 for the two cycles in this example) 
and remain valid until after CNA# is sampled active 
by the 82495XP (clocks 4 and 10). MALE and MBA- 
LE may be used to hold the address as necessary. 
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Figure 8-3, Non-Cacheable Read Misses 


2-361 


_____ 
I •..• 
__ 
..,...,..,1... 
IIIQ 
IVlt.,IV 
~UtJ"'IICo); 
BRDY11 to the CPU to complete the cycle to the 
CPU. The MBC asserts CRDY (clock 8) to the 
82495XP/82490XP to complete the read miss cycle 
on the memory bus. 


The 82495XP issues a new (unrelated) cycle request 
(CADSII 
in 
clock 
5) 
which 
also 
misses 
the 
82495XP/82490XP cache. Since the 82495XP has 
already sampled CNA11 asserted, it issues a new 
CADSII prior to receiving CRDYII of the current cy- 
cle (ie. this cycle is pipelined within the MBC). Note 
that once the cycle progress signals of a cycle are . 
sampled asserted, the 82495XP ignores them until 
the CRDY! 
of that cycle. The 82495XP will not 
sample the cycle progress signals again until after 
the CRDY11 of the current memory bus cycle. The 
current read cycle is completed on the bus in clock 8 
with CRDY11 assertion. 


The cycle progress signals for the second read miss 
are also valid at this time (clock 5). RDYSRC is ac- 
tive, indicating that the MBC must provide BRDYlls 
to the CPU/Cache core; and MCACHEII is active, 
indicating that the read miss is potentially cacheable 
by the 82495XP/82490XP. 


The MBC issues BGTII and CNAII to the 82495XP 
in clock 9 to indicate that the cycle is guaranteed to 
complete on the memory bus, and that it is ready to 
schedule a new memory bus cycle. KWENDII is as- 
serted at this time to close the cacheability window. 
MKEN11 is not active, indicating to the 82495XP that 
the read miss cycle is not cacheable by the memory 
bus. KWENDII and MKENII must be returned to the 
82495XP at least two clocks prior to BRDY11 to in- 
form the CPU that a line fill will not follow. 


The MBC asserts SWEND11 (clock 11) to close the 
snoop window, and CRDY11 (clock 13) to complete 
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Some time after the address has been driven onto 
the memory bus, data will be supplied from the 
DRAM (main memory) to the 82490XP memory cy- 
cle buffers. 


For Clocked Memory Bus Mode, MEOCII is assert- 
ed by the MBC (clock 6) to latch MZBT11 for the 
next transfer, and end the current cycle on the mem- 
ory bus (MBRDYII and MSELII are not necessary 
since this example shows a single transfer cycle). 
MZBT 11 is driven high by the MBC in order to force 
the read cycle to begin with a non-zero burst ad- 
dress. 


For the second non-cacheable read cycle, MSELII 
is driven active by the MBC (clock 8) to allow sam- 
pling of MBRDY11 and to latch MZBT 11 for the trans- 
fer. MZBT11 is sampled on all MCLK edges where 
MSEL11 is inactive. Once MSEL11 is sampled active 
by the 82495XP, the value of MZBTII sampled on 
the prior MCLK is used for the next transfer. Again, 
MZBT11 is driven high by the MBC to force the trans- 
fer 
to 
begin 
with 
the 
correct 
burst 
address. 
MBRDY11 is driven active by the MBC in clock 10 to 
cause the memory burst counter to be incremented 
and data to be placed into the 82490XP cache mem- 
ory cycle buffers. The MBC drives MEOCII asserted 
(clock 12) to end the current cycle on the memory 
bus and switch memory cycle buffers for the new 
cycle. 


For Strobed Memory Bus Mode, MEOCII is driven 
active by the MBC (clock 5) to latch MZBT 11 for the 
transfer (on MEOCII falling edge), and end the cur- 
rent cycle on the memory bus (MISTB is not neces- 
sary since this example shows a single transfer cy- 
cle). MZBT11 is driven high by the MBC in order to 
force the read cycle to begin with the correct burst 
address. 
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Figure 8-4. Write Hit to [S] State Line (Write-Through) 
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For the second non-cacheable read cycle, MSEL# 
is driven active by the MBC (clock 8) to allow MISTB 
operation and to latch MZB,T# for the transfer (on 
MSEL# falling edge). Again, MZBT# is driven high 
by the MBC to force the transfer to begin with the 
correct burst address. MISTB is toggled in clock 9 to 
cause the memory burst counter to be incremented, 
and data to be placed into the 82490XP cache mem- 
ory cycle buffers. Note: MISTB latches the memory 
bus data on both the rising and falling edges. The 
MBC drives MEOC# asserted (clock 13) to end the 
current cycle on the memory bus and switch memo- 
ry cycle buffers for the new cycle. MZBT# for the 
next cycle (not shown), is sampled at this time on 
the falling edge of MEOC#. 


8.2 Write Cycles 


8.2.1 
WRITE HITS 


8.2.1.1 
Write Hit to [E) or [M) States 


CPU initiated write cycles which hit 82495XP entries 
tagged in the [E) or [M) states are executed com- 
pletely within the CPU/Cache core, and will not be 
seen by the MBC. 


8.2.1.2 
Write Hit to [S) State 


Figure 8.4 illustrates CPU initiated write cycles which 
hit lines in the 82495XP/82490XP cache array that 
are in the shared state. If the 82495XP/82490XP is 
used as a write through cache (not write back), the 
[S) state is the only state a cached line could be in. 
These cycles are posted as are all normal write cy- 
cles (as long as no other write miss is pending). 


CACHE CONTROL SIGNALS: 


The 
CPU 
initiates 
the 
write 
cycle 
to 
the 
82495XP/82490XP 
cache where the cache tag 
state is looked up. Once the 82495XP determines 
the cycle to be a hit to shared state, it posts the write 
and returns BRDY# to the CPU. 


The 82495XP next issues a cycle request (CADS# 
in clock 1), and the associated cycle control signals 
to 
the 
MBC 
(eg. CW/R#, 
CM/IO#, 
CD/C#, 
RDYSRC, MCACHE#, PALLC#) in order to sched- 
ule the write through operation. MCACHE# is not 
active since the write will be posted; RDYSRC is not 
active, 
indicating that 
the 
82495XP will 
supply 
BRDY# to the CPU; PALLC# is not active, indicat- 
ing that an allocation cycle will not be performed 
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(regardless of MKEN# state) since the line is al- 
ready available in the cache. The MBC must also 
latch PWT and PCD on BLE# falling edge in order 
to track hits and misses to the [S) state. This is how 
an external state tracker can track the [S) state. 


The 
memory 
bus 
address 
(MSET[10:0)' 


MTAG[11:0), 
MCFA[6:01) is valid with 
CADS# 
(clocks 1 and 6 for the two cycles in this example) 
and remains valid until after CNA# is sampled active 
by the 82495XP (clocks 4 and 9). MALE and MBALE 
may be used to hold the address as necessary. 


The MBC arbitrates for the memory bus and returns 
BGT# asserted (clock 2), indicating that the cycle is 
guaranteed to complete on the memory bus. Once 
the 82495XP samples BGT# asserted, it must finish 
that cycle on the memory bus: Prior to this point, the 
cycle can be aborted by a snoop hit from another 
cache. 


CNA# is asserted by the MBC (clock 3) to indicate 
that it is ready to schedule a new memory bus cycle. 
Note that after CNA# activation, cycle control sig- 
nals are not guaranteed to be valid. KWEND# is 
also driven at this time since the cacheability of this 
cycle is already known and MKEN# is a don't care. 
It is not necessary that KWEND# be asserted at this 
time. 


The 82495XP provides BRDY# to the CPU since 
the cycles are posted writes. The MBC completes 
the first write hit to [S) state in clock 5 when it as- 
serts CRDY# to the 82495XP/82490XP cache. The 
data is latched in to the 82490XP array from the 
memory cycle buffer at this time. 


In this example, the 82495XP issues a second write 
to [S) state in clock 6. For this cycle, the 82495XP 
issues the memory bus request (CADS#) as soon 
as it can after sampling CNA# 
asserted. The 
82495XP will not wait for KWEND# (if it does not 
get asserted immediately as in this example) to is- 
sue CADS# since this is not a potential allocate cy- 
cle (ie. PALLC# active). 


The MBC asserts BGT#, CNA#, and KWEND# to- 
gether in clock 8 to indicate that the current cycle is 
guaranteed to complete and the 82495XP is free to 
schedule a new memory bus cycle. 


Again, the 82495XP provides BRDY# to the CPU 
since the cycles are posted writes. The MBC com- 
pletes the second write hit to [S) state in clock 12 
when it asserts CRDY# to the 82495XP/82490XP 
cache. The data is latched in to the 82490XP array 
from the memory cycle buffer at this time. 


MEMORY BUS SIGNALS: 


The memory address latch enables (MALE and 
MBALE) may remain asserted by the MBC to place 
the address latches in flow through mode. If the 
82495XP is the current bus master, the memory ad- 
dress output 
enables (MAOE# 
and 
MBAOE#) 
should be asserted by the MBC. 


For Clocked Memory Bus Mode, the memory data 
output enable signal (MDOE#) is asserted by the 
MBC in clock 2 to drive the memory data outputs. 


MEOC# is asserted by the MBC (clock 4) to latch 
MZBT# for the transfer, and end the current cycle 
on the memory bus (MBRDY# is not necessary 
since this example shows a single transfer cycle). 
MZBT# is driven high by the MBC in order to force 
the write cycle to begin with the correct burst ad- 
dress. MFRZ# is sampled here (it need not be ac- 
tive since the cycle is not potentially allocatable). 


For the second write through cycle, MSEL# is driv- 
en active by the MBC (clock 7) to allow sampling of 
MBRDY# and to latch MZBT# for the transfer. 
MZBT# 
is sampled on all MCLK edges where 
MSEL# is inactive. Once MSEL# is sampled active 
by the 82495XP, the value of MZBT# sampled on 
the prior MCLK is used for the next transfer. Again, 
MZBT# is driven high by the MBC to force the trans- 
fer 
to 
begin 
with 
the 
correct 
burst 
address. 
MBRDY# is driven active by the MBC in clock 10 to 
cause the memory burst counter to be incremented 
and data to be placed into the 82490XP cache mem- 
ory cycle buffers. The MBC drives MEOC# asserted 
(clock 12) to end the current cycle on the memory 
bus and switch memory cycle buffers for the new 
cycle. 


For Strobed Memory Bus Mode, the memory data 
output enable (MDOE#) is asserted by the MBC in 
clock 2 to drive the memory data outputs. 


MEOC# is driven active by the MBC (clock 4) to 
latch MZBT# for the transfer (on MEOC# falling 
edge), and end the current cycle on the memory bus 
(MOSTB is not necessary since this example shows 
a single transfer cycle). MZBT# is driven high by the 
MBC in order to force the read cycle to begin with 
the correct burst address. 


For the second write through cycle, MSEL# is driv- 
en active by the MBC (clock 6) to allow MOSTB op- 
eration and to latch MZBT# for the transfer (on 
MSEL# falling edge). Again, MZBT# is driven high 
by the MBC to force the transfer to begin with the 
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correct burst address. MOSTB is toggled in clock 9 
to cause the memory burst counter to be increment- 
ed, and data to be placed into the 82490XP cache 
memory cycle buffers. Note: MOSTB latches the 
memory bus data on both the rising and falling edg- 
es. The MBC drives MEOC# asserted (clock 11) to 
end the current cycle on the memory bus and switch 
memory cycle buffers for the new cycle. MZBT# for 
the next cycle (not shown), is sampled at this time 
on the falling edge of MEOC#. 


8.2.2 
WRITE 
MISSES 


8.2.2.1 
Write 
Miss with no Allocation 


Figure 8.5 illustrates two CPU initiated write cycles • 
which miss the 82495XP/82490XP cache and are 
not allocatable. The first write cycle begins as a po- 
tentially allocatable cycle, but MKEN# sampled in- 
active indicates that the cycle is not cacheable by 
the memory bus. The second write miss cycle is not 
cacheable by the CPU/82495XP/82490XP as indi- 
cated by the PALLC# output from the 82495XP. 


CACHE CONTROL SIGNALS: 


The CPU initiates the first 
write cycle 
to 
the 
82495XP/82490XP 
cache where the cache tag 
state is looked up. Once the 82495XP determines 
the cycle to be a cache miss. It issues a cycle re- 


o quest (CADS# in clock 1) and the associated cycle 
control signals to the MBC (eg. CW/R#, 
CM/IO#, 


CD/C#, 
RDYSRC, MCACHE#, PALLC#) in order 
to schedule the write miss operation. RDYSRC is not 
active, indicating that the 
82495XP will 
supply 
BRDY# 
to the CPU; MCACHE# 
is not active; 


PALLC# is active, indicating that the cycle is poten- 
tially allocatable. 


The write miss data is posted in the 82490XP's 
memory cycle buffer, and the cycle completes with 
no wait states to the CPU. The CPU is then free to 
issue another (non-related) cycle while the 82495XP 
completes the current write miss cycle and possible 
allocation. If this new cycle is a cache hit, it will be 
serviced by the 82495XP immediately; but if it is a 
cache miss, its service will wait until the CRDY# of 
the write cycle (and allocation cycle, if executed). 


The 
memory 
bus 
address 
(MSET[10:0], 
MTAG[11:0], 
MCFA[6:0j) 
is valid with 
CADS# 
(clocks 1 and 7 for the two cycles in this example) 
and remain valid until after CNA# is sampled active 
by the 82495XP (clocks 4 and 10). MALE and MBA- 
LE may be used to hold the address as necessary. 
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Figure 8-5. Write Miss with No Allocation 
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The MBC arbitrates for the memory bus and returns 
BGT11 asserted (clock 2), indicating that the write 
through cycle is guaranteed to complete on the 
memory bus. Once the 82495XP samples BGT11 as- 
serted, it must finish that cycle on the memory bus. 
Prior to this point, the cycle can be aborted by a 
snoop hit from another cache. 


CNAII is asserted by the MBC (clock 3) to indicate 
that it is ready to schedule a new memory bus cycle. 
Notice that the cycle control signals are not guaran- 
teed to be valid after CNA 11 activation. NOTE that 
CNAII has no effect before KWENDII. 


When the MBC has determined the cacheability at- 
tribute of the write through cycle, it drives the 
MKEN11 signal accordingly. The MBC also drives 
the KWENDII signal at this time (clock 4), indicating 
the end of the cacheability window. The 82495XP 
samples MKENII inactive during KWENDII, indicat- 
ing that the missed cycle is not cacheable and 
should not be allocated. 


The MBC asserts SWENDII 
(clock 6) when the 
snoop window of the write through cycle ends on the 
memory bus. The MBC may return CRDY11 to the 
82495XP/82490XP cache any time after the closure 
of the snoop window. In this example, CRDY11 is 
issued by the MBC in clock 8. 


The 82495XP issues a cycle request for the second 
write miss cycle in clock 7. The cycle control signals 
are valid at this time. Note that PALLCII is inactive, 
indicating that the 82495XP/82490XP has deter- 
mined the cycle to not be allocatable. 


The MBCII asserts BGTII, CNAII, and KWENDII in 
clock 9. MKEN 11 is a don't care during the cachea- 
bility window since the cycle is not allocatable. The 
snoop window is closed in clock 11, and the cycle is 
completed on the memory bus in clock 13 with the 
assertion of CRDY11 by the MBC. 


MEMORY BUS SIGNALS: 


The memory address latch enables (MALE and 
MBALE) may remain asserted by the MBC to place 
the address latches in flow through mode. If the 
82495XP is the current bus master, the memory ad- 
dress output 
enables 
(MAOE11 
and MBAOE11) 
should be asserted b~'the MBC. 


For Clocked Memory Bus Mode, the memory data 
output enable (MDOEII) is asserted by the MBC in 
clock 4 to drive the memory data outputs. 
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MEOCII is asserted by the MBC (clock 5) to latch 
MZBT 11 for the transfer, and end the current cycle 
on the' memory bus (MBRDY11 is not necessary 
since this example shows a single transfer cycle). 
MZBT 11 is driven high by the MBC in order to force 
the read cycle to begin with the correct burst ad- 
dress. MFRZ 11 is sampled here (it need not be ac- 
tive since the cycle is not potentially allocatable). 


For the second non allocatable write cycle, MSELII 
is driven active by the MBC (clock 8) to allow sam- 
pling of MBRDY11 and to latch MZBT 11 for the trans- 
fer. MZBT 11 is sampled on all MCLK edges where 
MSELII is inactive. Once MSELII is sampled active 
by the 82495XP, the value of MZBT 11 sampled on 
the prior MCLK is used for the next transfer. Again, 
MZBT 11 is driven high by the MBC to force the trans- 
fer 
to 
begin 
with 
the 
correct 
burst 
address. 
MBRDYII is driven active by the MBC in clock 10 to 
cause the memory burst counter to be incremented 
and data to be placed into the 82490XP cache mem- 
ory cycle buffers. 


The MBC drives MEOCII asserted (clock 13) to end 
the current cycle on the memory, bus and switch 
memory cycle buffers for the new cycle. MFRZII is 
sampled here (it need not be active since the cycle 
is not potentially allocatable). MZBT 11 is also sam- 
pled at this time. 


For Strobed Memory Bus Mode, the memory data 
output enable (MDOEII) is asserted by the MBC in 
clock 2 to drive the memory'data outputs. 


MEOCII is driven active by the MBC (clock 5) to 
latch MZBT 11 for the transfer, and end the current 
cycle on the memory bus (MOSTB is not necessary 
since this example shows a single transfer cycle). 
MZBT 11 is driven high by the MBC in order to force 
the read cycle to begin with the correct burst ad- 
dress. 


For the second write through cycle, MSELII is driv- 
en active by the MBC (clock 8) to allow MOSTB op- 
eration and to latch MZBT 11 for the transfer. Again, 
MZBT 11 is driven high by the MBC to force the trans- 
fer to begin with the correct burst address. MOSTB 
is toggled in clock 12 to cause the memory burst 
counter to be incremented, and data to be read from 
the 82490XP cache memory cycle buffers. Note: 
MOSTB latches the memory bus data on both the 
rising and falling edges. 


The MBC drives MEOCII asserted (clock 13) to end 
the current cycle on the memory bus and switch 
memory cycle buffers for the new cycle. MZBT 11 
and MFRZ 11 for the next cycle (not shown), is sam- 
pled at this time on the falling edge of MEOCII. 


• 
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Figure 8-6. Write Miss with Allocation 
to [M] Line 
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8.2.2.2 Write Miss with Allocation 


Figure 8.6 illustrates a CPU initiated write cycle 
which misses the 82495XP/82490XP cache and fol- 
lows the write to main memory with an allocation 
cycle. An allocation is when the cache follows a 
write miss cycle with a line fill. This example as- 
sumes that allocating the new line requires the re- 
placement of a modified line (ie. a write-back to main 
memory). 


CACHE CONTROL SIGNALS: 


The 
CPU 
initiates 
the 
write 
cycle 
to 
the 
82495XP/82490XP 
cache where the cache tag 
state is looked up. Once the 82495XP determines 
the cycle to be a cache miss, it issues CADS# 
(clock 1) and the associated cycle control signals to 
the MBC (eg. CW/R#, 
CM/IO#, 
CD/C#, 
RDYSRC, 
MCACHE#, PALLC#) in order to schedule the write 
operation. MCACHE# is not active; RDYSRC is not 
active, 
indicating that the 
82495XP will 
supply 
BRDY#s to the CPU; PALLC# is asserted, indicat- 
ing a potential allocate cycle after the write-through 
cycle. 


The write miss data is posted in the 82490XP's 
memory cycle buffer, and the cycle completes with 
no wait states to the CPU. The CPU is free to issue 
another (non-related) cycle while waiting for the 
82495XP to complete the allocation. If this new cy- 
cle is a cache hit, it will be serviced by the 82495XP 
immediately; but if it is a cache miss, its service will 
wait until the CRDY# of the allocation. 


The 
memory 
bus 
address 
(MSET[10:0)' 


MTAG[11:0), 
MCFA[6:0l) 
is valid with 
CADS# 
(clocks 1, 5 and 10 for the three cycles in this exam- 
ple) and remain valid until after CNA# is sampled 
active by the 82495XP (clocks 4, 10 and 19). MALE 
and MBALE may be used to hold the address as 
necessary. 


The MBC arbitrates for the memory bus and returns 
BGT# asserted (clock 2), indicating that the write 
through cycle is guaranteed to complete on the 
memory bus. Once the 82495XP samples BGT# as- 
serted, it must finish that cycle on the memory bus. 
Prior to this point, the cycle can be aborted by a 
snoop hit from another cache. 


CNA# is asserted by the MBC (clock 3) to indicate 
that it is ready to schedule a new memory bus cycle. 
Note that after CNA# activation, cycle control sig- 
nals are not guaranteed to be valid. 


When the MBC has determined the cacheability at- 
tribute of the write through cycle, it drives the 
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MKEN# signal accordingly. The MBC also drives 
the KWEND# signal at this time, indicating the end 
of the cacheability window. The 82495XP samples 
MKEN# active during KWEND# (clock 4), indicat- 
ing that the missed line should be allocated in the 
cache. 


At the first available time (clock 5), the 82495XP as- 
serts CADS# to request an allocation cycle. The cy- 
cle control Signalsare valid at this point: MCACHE# 
is active, indicating the cacheability of the line-fill cy- 
cle; RDYSRC is not active, indicating that the MBC 
need not supply BRDY#s to the CPU (no BRDY#s 
are necessary for an allocation cycle). 


The MBC asserts SWEND# 
(clock 6) when the 
snoop window of the write through cycle ends on the • 
memory bus.. I 


The MBC'may 
return CRDY# 
to the 82495XPI 
82490XP cache any time after the closure of the 
snoop window. In this example, CRDY# is issued by 
the MBC in clock 8. At this time, the cycle progress 
signals for the allocation cycle may be issued by the 
MBC to complete the line fill. 


Once again, the MBC arbitrates for the memory bus 
and returns BGT# asserted (clock 9) for the alloca- 
tion cycle. The MBC also asserts CNA# 
and 
KWEND# at this time. The 82495XP back-invali- 
dates the CPU to maintain first and second level 
cache consistency, 


In clock 10, the 82495XP asserts CADS# for the 
write back cycle (since the miss was to a dirty line). 
CDTS# is asserted by the 82495XP six clocks later 
(clock 16). Note that CDTS# of the write back cycle 
is not asserted with CADS# since the data is not yet 
available in the 82490XP's write-back buffer. 


The MBC asserts SWEND# (clock 11) when the 
snoop window of the allocation cycle ends on the 
memory bus. 


At this time, the MBC may assert CRDY# to the 
82495XP/82490XP cache for the allocation cycle. 
CRDY# assertion will cause the data stored in the 
82490XP's memory cycle buffers to be latched into 
the cache array. 


On the memory bus, BGT#, CNA#, and KWEND# 
are sampled active in clock 18 for the write back 
cycle. The snoop window is closed two clocks later 
(clock 20) by the MBC with SWEND#, and the write 
back cycle is completed with CRDY# asserted in 
clock 22. 
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MEMORY BUS SIGNALS: 


The memory address latch enables (MALE and 
MBAlE) may remain asserted by the MBC to place 
the address latches in flow through mode. If the 
82495XP is the current bus master, the memory ad- 
dress output enables 
(MAOE11 and MBAOE11) 
should be asserted by the MBC. 


For Clocked Memory Bus Mode, the memory data 
output enable (MOOEII) has been asserted by the 
MBC to drive the memory data outputs. 


MEOCII is asserted by the MBC (clock 4) to latch 
MZBT 11 for the transfer, and end the current cycle 
on the memory bus (MBROY11 is not necessary 
since this example shows a single transfer write 
miss cycle). MZBT 11 is driven high by the MBC in 
order to force the read cycle to begin with the cor- 
rect burst address. MFRZ11 is driven inactive by the 
MBC here, allowing the line to be placed into the 
exclusive ([El) state and requiring the data to be 
written to main memory. 


For the allocation (line fill) cycle, MSElII 
is driven 
active by the MBC (clock 6) to allow sampling of 
MBROYII 
and to latch MZBTII 
for the transfer. 


MZBT11 is sampled on all MClK 
edges where 
MSElII 
is inactive. Once MSElII 
is sampled active 
by the 82495XP, the value of MZBTII sampled on 
the 
prior MClK 
is used for the 
next transfer. 


MDOEII is also deasserted in clock 6 to allow the 
data pins to be used as inputs for the allocation cy- 
cle. 


MBROY11 is driven active by the MBC in clocks 7 to 
9 to cause the memory burst counter to be incre- 
mented and data to be placed into the 82490XP 
cache 
memory cycle 
buffers. The 
MBC drives 
MEOCII asserted (clock 10) to end the allocation 
cycle on the memory bus and switch memory cycle 
buffers for the new cycle. MZBT11 is sampled and 
latched at this time for the next data transfer. 


MOOEII is asserted by the MBC (clock 16) to drive 
the memory data outputs for the write back cycle. 


The MBC again asserts MBROY11 (clocks 17 to 19) 
for the write back cycle to increment the memory 
burst counter and cause data to be read from the 
82490XP memory cycle buffers. The write back cy- 
cle ends on the memory bus and switches memory 
cycle buffers with MEOCII 
assertion (clock 20). 


MZBTII and MFRZII for the next transfer are sam- 
pled at this time. MFRZ11 need not be active since 
the cycle is not potentially allocatable. 


For Strobed Memory Bus Mode, the memory data 
output enable (MOOEII) has been asserted by the 
MBC to drive the memory data outputs for the write 
miss cycle. 
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MEOCII is driven active by the MBC (clock 4) to 
latch MZBT11 for the transfer, and end the current 
cycle on the memory bus (MOSTB is not necessary 
since this example shows a single transfer cycle). 
MZBT11 is driven high by the MBC in order to force 
the read cycle to begin with the correct burst ad- 
dress. MFRZII 
is driven deasserted by the MBC 
here, allowing the line to be placed into the exclu- 
sive ([El) state. 


For the allocation (line fill) cycle, MSElII 
is driven 
active by the MBC (clock 6) to allow MISTB opera- 
tion and to latch MZBT11 for the transfer. MISTB is 
toggled in clocks 8 to 10 to cause the memory burst 
counter to be incremented, and data to be placed 
into the 82490XP cache memory cycle buffers. 
Note: MISTB latches the memory bus data on both 
the rising and falling edges. MDOE11 is also deas- 
serted in clock 6 to allow the data pins to be used as 
inputs for the allocation cycle. 


The MBC drives MEOCII asserted (clock 11) to end 
the allocation cycle on the memory bus and switch 
memory cycle buffers for the new cycle. MZBT11 for 
the next cycle, is latched at this time on the falling 
edge of MEOCII. 


MOOEII is asserted by the MBC (clock 18) to drive 
the memory data outputs for the write back cycle. 


The MBC toggles MOSTB (clocks 19 to 21) for the 
write back cycle to increment the memory burst 
counter and cause data to 
be read from 
the 
82490XP memory cycle buffers. 


The write back cycle ends on the memory bus and 
switches memory cycle buffers with MEOCII asser- 
tion (clock 22). MZBTII and MFRZII for the next 
transfer are sampled at this time. MFRZ11 need not 
be active since the cycle is not potentially allocata- 
ble. 


8.3 Snooping Cycles 


8.3.1 SYNCHRONOUS SNOOPING MODE 
(HIT TO [M] LINE) 


Figure 8.7 illustrates a snoop hit to a dirty line se- 
quence occurring simultaneously with a CPU initiat- 
ed read miss cycle. This example assumes synchro- 
nous snooping mode (ie. requests for snoops are 
done via SNPSTBII from the MBC, sampled on the 
82495XP's ClK). 


82495XP Cache Controller/82490XP 
Cache RAM 


, 
~~~-+!~~r+--r~--~~~~~+-~~--~~-J 


SNPCYC- ~~!--~--~~--~--~~--~~--~--~~--~--r--; 


MTHIT" ~I. 
MHIn••" 
~. 


SNPBSY" 
, 
: 
\ I 
r.--: 
I 
I 
I 
I 
I 
I 
I 
I 


SNPSTB# 
~: 
I 
: 
: 
I: 
: 
lacnl:~ 
SNPINV'(llll~ 


~AOE. 
I 
I 
I 
I 
I 
I: 
: 
I 
I 
I 
----4-+-+-~_~_~_~~-J-l_t_l_~~-~-~-~-~ 


I 
I 
I 
I 
I 
'I 
I 
I 
: 
: 
: 
: 
I 
I 
I 
: 
: 
: 
CLOCKED 
MEMORY 
BUS 
MODE: 
: 
: 
: 
' 
, 
, 
': 
:'" 


: 
I 
I 
I 
: 
: 
: 
: 
I 
'::: 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
----~-~-L-r-L-~-~~-~-.-~-~-~--~-----~ 
STROBED 
~EMo'RY 
BUS 
MODE: 
: 
: 
: 
: 
: 
: 
: 
: 
! 
: 
: 
: 
: 
I.-....J_....L_ 
...•.. 
: 
: 
I 
: 
: 
: 
: 
: 
: 
: 
I 
: 
: 
: 
: 


MSEL-: 
: 
: 140: : 
'i' 
'i' 'i' 'i' : 
! 1 'i' \ll3: 
: 
-.tEOC.' 
, 
, 
I 
' 
I 
I 
I 
I 
I 
I 
I 


I 
I 
I 
1 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


~ 
'---' 
, 
' 
, 
, 
, 
'qxfXt" 
~--,-- 
.• 
M.STB 
,'_illiiiM:, __ .1__j' -,'--r' __'__.1; 
".1.• ' 
, 
, 
I 
I 
I 
I 
I 


~ZBT# 
dili&ii8*ii(E)O(E 
1111_' 
t.trRz.~ 
xx 


, 
, 
, 
I 
I 
I 
I 
'I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
•••OOf:- 
I 
I 
I 
I" 
, 
! 
! 
, 
, 
, 
, 
, 
I 
I 
, 
, 
, 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
MDATA~i~ 


CADS- 


ADDRESS 


CDTS" 


CW/R" 


RDYSRC 


MCACHE" 


CRDY" 


SNPADS- 


MSEL" 


MEDC" 


MBRDY" 


MZBT" 


MFRZ" 


MDOE- 


MDATA 


CLK 


2 
3 
4 
5 
6 
7 
8 
9 


ill 
I 
I 
I 
I 
I 
I 
I 


~~_....L_~_~_~_~....J~~_~:~ 
I 
I 
I 
I 
I 
I 
I 
I 
: 
: 
:W: 
\J; 
I 
I 
:~ 
MxxjX\' r: 
I 
: 
: 
: 
:~ 
, 
, 
, 
I ~? 
CA~S" A,bar!",! 
CADS" RallOuod',' 
, 
UXXXXXXXX/'I 
~ 
, 
, 
, 
; 
: 
; 
:~ 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


SGT" 


MCLK 
r--r-~~l 
: 
I~ 
~ 
: 
~~~~~~~~~~~ 
I::: 
II~ 
I 
I 
I 
I 
~~ 
I 
I 
I 
I 
I 
I 
I 
I 
I 
xxxxxxxxxxxAl 
' 
I 
i 
i 
'\ 
I 
I 
" 
i 
I 
I 
_111111'I'rlll 


_'ii'il'ii'ii~ 
x'WlVlm'f 
I 
I 
I 
I 
I'-"~ 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
, 
I 
I 
I 


o 
3 


240956-39 


I 


Figure 8-7. Synchronous Snooping Mode 
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CACHE CONTROL SIGNALS: 


In clock 1 SNPSTB# is asserted by the MBC, indi- 
cating to the 82495XP a request for snooping. The 
82495XP samples MAOE# (it must be inactive) in 
order to recognize the snoop request. It is latched 
together 
with 
the 
snoop 
address (MSET[0:101. 


MTAG[0:111, MCFA[0:6l), SNPINV, MBAOE#, and 
SNPNCA on the 82495XP's CLK during SNPSTB# 
assertion. The tag look-up is done immediately after 
SNPSTB# 
is sampled active since snoop opera- 
tions have the highest priority in the cache tag state 
arbiter. The 82495XP issues SNPCYC# (clock 2), 
indicating that the snoop look-up is in progress. The 
results of the look-up are driven to the memory bus 
via MTHIT# and MHITM# in the next clock after 
SNPCYC#. Since the snoop hit a modified line, both 
signals are asserted (clock 3). SNPBSY# is also is- 
sued to indicate that the 82495XP is busy with CPU 
back-invalidations, the 82490XP's snoop buffer is 
full, or a write back is to follow. The 82495XP will 
accept snoops only when SNPBSY# is inactive. 


Simultaneously with the memory bus activity due to 
the snoop request, the CPU initiates a read miss cy- 
cle. The 82495XP issues a memory bus request 
(CADS#), CDTS#, and cycle control signals to the 
MBC in clock 3. The MBC must wait for the pending 
snoop cycle to complete on the memory bus prior to 
servicing this read miss cycle. 


The 
memory 
bus 
address 
(MSET[10:01. 


MTAG[11:01, MCFA[6:0l) is not valid until MAOE# 
goes active after CRDY# of the snoop write back 
cycle is sampled active by the 82495XP and the 
CADS# is reissued (clock 16). 


In clock 4 the 82495XP issues SNPADS# and cycle 
control signals to the MBC, indicating a request to 
flush a modified line out of the cache. SNPADS# 
activation causes the MBC to abort the pending read 
miss cycle. It is the 82495XP responsibility to re-is- 
sue the aborted cycle after the completion of the 
write back, since BGT# was not asserted by the 
MBC. 


Data is loaded into the 82490XP's snoop buffer. 
Since 
SNPINV 
was 
sampled 
asserted 
by 
the 
82495XP (clock. 1) during SNPSTB# assertion, it 
back-invalidated the CPUs first level cache. 


The 82495XP asserts CDTS# (clock 10) indicating 
to the MBC that data is available in the snoop buffer. 
When the MBC complete the write back cycle on the 
memory 
bus, 
it 
activates 
CRDY# 
to 
the 
82495XP/82490XP 
cache. 
At 
this 
time, 
the 
82495XP deasserts SNPBSY# (clock 16) and re-is- 
sues the aborted read miss cycle (clock 16) by as- 
serting CADS# and CDTS#. 
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MEMORY BUS SIGNALS: 


For Clocked Memory Bus Mode, the memory data 
output enable (MDOE#) is not activated by the MBC 
to allow the memory data pins to be used as inputs. 


MSEL# is driven active by the MBC (clock 4) to al- 
low sampling of MBRDY# and to latch MZBT# for 
the read miss transfer. MZBT# is sampled on all 
MCLK rising edges where MSEL# is inactive. Once 
MSEL# is sampled active by the 82495XP, the val- 
ue of MZBT# sampled on the prior MCLK is used 
for the next transfer. 


Since the read miss cycle is aborted due to the 
snoop hit to a modified line (requires a write back 
cycle), no MEOC# is given. MSEL# is deasserted 
by the MBC (clock 6) and reasserted (clock 8) to 
allow latching of MZBT# for the snoop write back 
cycle and sampling of MBRDY# 
for that cycle. 


MFRZ# is also sampled at this time. 


The memory data output enable (MDOE#) signal is 
driven active by the MBC (clock 7) to drive the mem- 
ory data outputs. 


MBRDY# is driven active by the MBC in clocks 12 
to 14 to cause the memory burst counter to be incre- 
mented and data to be written from the 82490XP 
cache snoop buffers. The MBC drives MEOC# as- 
serted (clock 15) to end the write back cycle on the 
memory bus and switch memory cycle buffers for 
the new cycle. MZBT# and MFRZ# are sampled 
and latched at this time for the next data transfer. 


MDOE# is deasserted by the MBC (clock 16) to al- 
low the memory data pins to be used as inputs for 
the reissued read cycle. 


For Strobed Memory Bus Mode, the memory data 
output enable (MDOE#) has not been asserted by 
the MBC to allow the memory data pins to be used 
as inputs for the read miss cycle. 


MSEL# is asserted by the MBC (clock 16) to allow 
sampling of MISTB and latch MZBT# (on the falling 
edge of MSEL#) for the read miss transfer. 


Since the read miss cycle is aborted due to the 
snoop hit to a modified line (requires a write back 
cycle), no MEOC# is given. MSEL# is deasserted 
by the MBC (clock 5) and reasserted (clock 6) to 
allow latching of MZBT# for the snoop write back 
cycle 
and sampling of 
MOSTB for 
that 
cycle. 


MFRZ# is also sampled at this time. 


MOSTB is toggled in clocks 13 to 15 to cause the 
memory burst counter to be incremented, and data 


to be read from the 82490XP cache memory cycle 
buffers. Note: MOSTB latches the memory bus data 
on both the rising and falling edges. The MBC drives 
MEOC# asserted (clock 16) to end the snoop write 
back cycle on the memory bus and switch memory 
cycle 
buffers for 
the 
new cycle. 
MZBT# 
and 
MFRZ# for the next cycle, are latched at this time 
on the falling edge of MEOC#. 


MDOE# is deasserted by the MBC (clock 16) to al- 
low the memory data pins to be used as inputs for 
the reissued read miss cycle. 


8.3.2 
CLOCKED SNOOPING 
MODE 


Figure 8.8 illustrates a CPU initiated Read cycle 
which misses the 82495XP/82490XP cache and the 
subsequent line fill replaces non dirty data (eg. clean 
or empty). Simultaneous with the read request to the 
MBC, that device initiates a snoop to the 82495XP 
which misses that line in the cache. The snoop is the 
result of a write cycle on the memory bus by some 
other cache core; therefore, asserting the snoop in- 
validation signal (SNPINV) to this 82495XP. This ex- 
ample assumes Clocked Snooping Mode (Le.the re- 
quests for snoops are done via SNPSTB# from the 
MBC, sampled on the MBC's SNPClK). 


CACHE CONTROL SIGNALS: 


The 
CPU 
initiates 
the 
read 
cycle 
to 
the 
82495XP/82490XP 
cache where the cache tag 
state is looked up. Once the 82495XP determines 
the cycle to be a cache miss, it issues CADS# 
(clock 1) and the associated cycle control signals to 
the MBC (eg. CW/R#, CM/lOll', CD/C#, RDYSRC 
MCACHE#) in order to schedule the cache line-fili 
operation. MCACHE# is active, indicating that the 
read miss in potentially cacheable by the 82495XP; 
RDYSRC is active, indicating that the MBC must 
supply BRDY#s to the CPU cache core. 


In clock 3, SNPSTB# is asserted by the MBC at this 
~ime,indicatin.gto the 82495XP a request for snoop- 
Ing. MAOE# ISdeasserted to allow the forthcoming 
snoop (the 82495XP will not recognize the snoop if 
MAOE# is active). It is latched together with the 
snoop 
address 
(MSET[O:10]' 
MTAG[O:11l, 
MCFA[O:6l), SNPINV, MBAOE#, and SNPNCA on 
the MBC's SNPClK rising edge during SNPSTB# 
assertion. SNPINV is asserted from the MBC since 
the cache core which initiated the snoop issued a 
write cycle on the memory bus. If the response of 
the snoop to this 82495XP was a cache hit, the con- 
tents would no longer be valid due that write. 
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Following synchronization to the 82495XP ClK, 
it 
issues SNPCYC# 
(clock 5), indicating that the 
snoop look-up is in progress. The results of the look- 
up are driven to the memory bus via MTHIT# and 
MHITM# in the next clock after SNPCYC#. Since 
the snoop was a miss in the cache, both signals are 
inactive (clock 6). Note that SNPBSY# will not be 
asserted since the snoop was a miss to this cache. 
The snoop from another cache is complete at this 
point, and the read miss cycle will continue. 


The MBC asserts MAOE# to allow this 82495XP to 
drive its address on the memory bus in order to com- 
plete the read miss cycle. The memory bus address 
(MSET[10:0]' MTAG[11:0], MCFA[6:0l) is valid after 
MAOE# assertion # (clock 6 for the read cycle in 
this example) and remains valid until after CNA# is 2 
sampled active by the 82495XP (clock 8). MALE and 
MBAlE may be used to hold the address as neces- 
sary. 


The MBC arbitrates for the memory bus and returns 
BGT# asserted (clock 6), indicating that the cycle is 
guaranteed to complete on the memory bus. Once 
the 82495XP samples BGT# asserted, it must finish 
that cycle on the memory bus. Prior to this point, the 
cycle can be aborted by a snoop hit from another 
cache. 


CNA# is asserted by the MBC (clock 7) to indicate 
that it is ready to schedule a new memory bus cycle. 
Note that after CNA# activation, cycle control sig- 
nals are not guaranteed to be valid. 


When the MBC has determined the cacheability at- 
tribute of the cycle, it drives the MKEN# signal ac- 
cordingly. The MBC also drives the KWEND# signal 
at this time, indicating the end of the cacheability 
window. The 82495XP samples MKEN# 
during 
KWEND# (clock 7) to determine that the cycle is 
indeed cacheable. 


The MBC asserts SWEND# when the snoop win- 
dow ends on the memory bus. The 82495XP sam- 
ples MWB/WT# during SWEND# (clock 9) and up- 
dates the cache tag state according to the consist- 
ency protocol. The closure of the snoop window also 
enables the MBC to start providing the CPU with 
data that has been stored in the 82490XP's memory 
cycle buffer. The MBC supplies BRDY#s to the CPU 
(clocks 9-12). 


The read miss cycle ends when CRDY# is driven 
active by the MBC (clock 12). It is at this time that 
the data in the 82490XP's memory cycle buffers is 
loaded into the cache SRAM. 
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Figure 8·8. Clocked Snooping Mode 
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MEMORY BUS SIGNALS: 


The memory address latch enables (MALE and 
MBALE) may remain asserted by the MBC to place 
the address latches in flow through mode. If the 
82495XP is the current bus master, the memory ad- 
dress output 
enables 
(MAOE# 
and 
MBAOE#) 
should be asserted by the MBC. (Note the use of 
MAOE# for snooping at the beginning of the cache 
control signals section.) MDOE# must be inactive to 
allow the data pins to be used as inputs. 


Some time after the address has been driven onto 
the memory bus, data will be supplied from the 
DRAM 
(main memory) to 
the 
82490XP cache 
SRAM. 


For Clocked Memory Bus Mode, MSEL# is driven 
active by the MBC (clock 6) to allow sampling of 
MBRDY# and to latch MZBT# for the transfer. 
MZBT# 
is sampled on all MCLK edges where 
MSEL# is inactive. Once MSEL# is sampled active 
by the 82495XP, the value of MZBT# sampled on 
the 
prior MCLK is used for the 
next transfer. 


MBRDY# is driven active by the MBC in clocks 7 to 
9 to cause the memory burst counter to be incre- 
mented and data to be placed into the 82490XP 
cache 
memory cycle 
buffers. The 
MBC drives 
MEOC# asserted (clock 10)to end the current cycle 
on the memory bus and switch memory cycle buffers 
for the new cycle. MZBT# is sampled at this time 
(when MEOC# is sampled asserted and MSEL# re- 
mains low) for the next transfer. 


For Strobed Memory Bus Mode, MSEL# is driven 
active by the MBC (clock 6) to allow MISTB opera- 
tion and to latch MZBT# (on the falling edge of 
MSEL#) for the transfer. MISTB is toggled in clocks 
8 to 10 to cause the memory burst counter to be 
incremented, 
and 
data 
to 
be 
placed 
into 
the 
82490XP cache memory cycle buffers. Note: MISTB 
latches the memory bus data on both the rising and 
falling edges. The MBC drives MEOC# asserted 
(clock 11) to end the current cycle on the memory 
bus and switch memory cycle buffers for the new 
cycle. MZBT# for the next cycle, is sampled at this 
time on the falling edge of MEOC#. 


8.3.3 
STROBED 
SNOOPING 
MODE 
(HIT TO [M) LINE) 


Figure 8.9 illustrates a snoop hit to a dirty line se- 
quence occurring simultaneously with a CPU initiat- 
ed read miss cycle. This example assumes strobed 
snooping mode (ie. requests for snoops are done 
from the falling edge of SNPSTB#). 
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CACHE CONTROL SIGNALS: 


In clock 
1 (totally asynchronous to any clock) 
SNPSTB# is asserted by the MBC, indicating to the 
82495XP a request for snooping. The 82495XP 
samples MAOE# (it must be inactive) in order to 
recognize the snoop request. It is latched together 
with the snoop address (MSET[0:10), MTAG[0:11), 
MCFA[0:6)), SNPINV, MBAOE#, and SNPNCA on 
falling edge of SNPSTB#. 
The 82495XP issues 
SNPCYC# (clock 3), indicating that the snoop look- 
up is in progress. The results of the look-up are driv- 
en to the memory bus via MTHIT# and MHITM# in 
the next clock after SNPCYC#. Since the snoop hit 
a modified line, both signals are asserted (clock 4). 
SNPBSY# 
is also 
issued to 
indicate that 
the 
82495XP is busy with CPU back-invalidations, the 
82490XP's snoop buffer is full, or a write back is to 
follow. The 82495XP.will accept snoops only when 
SNPBSY# is inactive. 


Simultaneously with the memory bus activity due to 
the snoop request, the CPU initiates a read miss cy- 
cle. The 82495XP issues a memory bus request 
(CADS#), CDTS#, and cycle control signals to the 
MBC in clock 1. The MBC must wait for the pending 
snoop cycle to complete on the memory bus prior to 
servicing this read miss cycle. 


The 
memory 
bus 
address 
(MSET[10:0), 


MTAG[11:0), MCFA[6:0)) is not valid until MAOE# 
goes active after CRDY# of the snoop write back 
cycle is sampled active by the 82495XP and the 
CADS# is reissued (clock 17). 


In clock 5 the 82495XP issues SNPADS# and cycle 
control signals to the MBC, indicating a request to 
flush a modified line out of the cache. SNPADS# 
activation causes the MBC to abort the pending read 
miss cycle. It is the 82495XP responsibility to re-is- 
sue the aborted cycle after the completion of the 
write back, since BGT# was not asserted by the 
MBC. 


Data is loaded into the 82490XP's snoop buffer. 
Since 
SNPINV was 
sampled 
asserted 
by 
the 
82495XP (clock 1) during SNPSTB# assertion, it 
back-invalidated the CPUs first level cache. 


The 82495XP asserts COTS# (clock 11) indicating 
to the MBC that data is available in the snoop buffer. 
When the MBC complete the write back cycle on the 
memory 
bus, 
it 
activates 
CRDY# 
to 
the 
82495XP/82490XP 
cache. 
At 
this 
time, 
the 
82495XP deasserts SNPBSY# (clock 17) and re-is- 
sues the aborted read miss cycle by asserting 
CADS# and CDTS#. 
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MEMORY BUS SIGNALS: 


For Clocked Memory Bus Mode, the memory data 
output enable (MDOE#) is not activated by the MBC 
to allow the memory data pins to be used as inputs. 


MSEL# is driven active by the MBC (clock 2) to al- 
low sampling of MBRDY# and to latch MZBT# for 
the read miss transfer. MZBT# is sampled on all 
MCLK rising edges where MSEL# is inactive. Once 
MSEL# is sampled active by the 82495XP, the val- 
ue of MZBT# sampled on the prior MCLK is used 
for the next transfer. 


Since the read miss cycle is aborted due to the 
snoop hit to a modified line (requires a write back 
cycle), no MEOC# is given. MSEL# is deasserted 
by the MBC (clock 11) and reasserted (clock 13) to 
allow latching of MZBT# for the snoop write back 
cycle and sampling of MBRDY# 
for that cycle. 
MFRZ# is also sampled at this time. 


The memory data output enable (MDOE#) signal is 
driven active by the MBC (clock 11) to drive the 
memory data outputs. 


MBRDY# is driven active by the MBC in clocks 13 
to 15 to cause the memory burst counter to be incre- 
mented and data to be written from the 82490XP 
cache 
memory cycle 
buffers. The 
MBC drives 
MEOC# asserted (clock 16) to end the write back 
cycle on the memory bus and switch memory cycle 
buffers for the new cycle. MZBT# and MFRZ# are 
sampled and sampled at this time for the next data 
transfer. 


MDOE# is deasserted by the MBC (clock 18) to al- 
low the memory data pins to be used as inputs for 
the reissued read cycle. 


For Strobed Memory Bus Mode, the memory data 
output enable (MDOE#) has not been asserted by 
the MBC to allow the memory data pins to be used 
as inputs for the read miss cycle. 


MSEL# is asserted by the MBC (clock 2) to allow 
sampling of MISTB and latch MZBT# (on the falling 
edge of MSEL#) for the read miss transfer. 


Since the read miss cycle is aborted due to the 
snoop hit to a modified line (requires a write back 
cycle), no MEOC# is given. MSEL# is deasserted 
by the MBC (clock 11) and reasserted (clock 13) to 
allow latching of MZBT# for the snoop write back 
cycle 
and sampling of 
MOSTB for 
that 
cycle. 
MFRZ# is also sampled at this time. 


MOSTS is toggled in clocks 14 to 16 to cause the 
memory burst counter to be incremented, and data 
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to be read from the 82490XP cache memory cycle 
buffers. Note: MOSTB latches the memory bus data 
on both the rising and falling edges. 


The MBC drives MEOC# asserted (clock 17) to end 
the snoop write back cycle on the memory bus and 
switch memory cycle buffers for the new cycle. 
MZBT# and MFRZ# for the next cycle, are sam- 
pled at this time on the falling edge of MEOC#. 


MDOE# is deasserted by the MBC (clock 18) to al- 
low the memory data pins to be used as inputs for 
the reissued read miss cycle. 


8.3.4 
CACHE 
TO CACHE 
TRANSFER 


8.3.4.1 
Read Cycles Causing 
a Snoop 
Hit 
to [M] Line 


Figure 8.10 illustrates CPU initiated Read cycles that 
miss the 82495XP/82490XP cache and replace a 
non-dirty (eg. clean) line in the cache. During the 
snoop window, the memory bus attribute which indi- 
cates a direct to [M] state transfer is sampled active. 
In such cycles, the 82495XP will instruct the MBC to 
perform a cache line-fill cycle on the memory bus. 
The request for data will not go to main memory, but 
instead will go to the controller of the cache which 
contained the modified data. The line is then written 
into the 82490XP's array, and data transferred to the 
CPU as requested. If the line fetched from the sec- 
ond cache replaces a line which is in valid unmodi- 
fied state ([E] or [S)), then a back-invalidation cycle 
is performed on the CPU bus to guarantee that the 
replaced data is also removed from the CPU's first 
level cache, thus maintaining the inclusion property. 


CACHE CONTROL SIGNALS: 


The 
CPU 
initiates 
the 
read 
cycle 
to 
the 
82495XP/82490XP 
cache where the cache tag 
state is looked up. Once the 82495XP determines 
the cycle to be a cache miss, it issues CADS# 
(clock 2) and the associated cycle control signals to 
the MBC (eg. CW/R#, 
CM/IO#, 
CD/C#, 
RDYSRC, 


MCACHE#) in order to schedule the cache line-fill 
operation. MCACHE# is active, indicating that the 
read miss is potentially cacheable by the 82495XP; 
RDYSRC is active, indicating that the MBC must 
supply BRDY#s to the CPU cache core. 


The 
memory 
bus 
address 
(MSET[10:0], 


MTAG[ll :0], 
MCFA[6:0)) 
is valid with 
CADS# 


(clocks 2 and 13 for the two read miss cycles in this 
example) and remain valid until after CNA# is sam- 
pled active by the 82495XP (clocks 5 and 16). MALE 
and MBALE may be used to hold the address as 
necessary. 
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Figure 8-10. Cache to Cache Transfer: Cacheable Read Miss 
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The MBC arbitrates for the memory bus and returns 
BGT 11 asserted (clock 3), indicating that the cycle is 
guaranteed to complete on the memory bus. Once 
the 82495XP samples BGT11 asserted, it must finish 
that cycle on the memory bus. Prior to this point, the 
cycle can be aborted by a snoop hit from another 
cache. 


CNAII is asserted by the MBC (clock 4) to indicate 
that it is ready to schedule a new memory bus cycle. 
Note that after CNA 11 activation, cycle control sig- 
nals are not guaranteed to be valid. 


When the MBC has determined the cacheability at- 
tribute of the cycle, it drives the MKENII signal ac- 
cordingly. The MBC also drives the KWENDII signal 
at this time, indicating the end of the cacheability 
window. 
The 
82495XP 
samples 
MKEN11 
and 
MROII during KWENDII (clock 5) to determine that 
the cycle is indeed cacheable. 


The MBC asserts SWENDII when the snoop win- 
dow ends on the memory bus. The 82495XP sam- 
ples MWB/WTII 
and DRCTMII during SWENDII 
(clock 7) and updates the cache tag state according 
to the consistency protocol. Since the result of the 
snoop was a hit to a modified line in another cache, 
the MBC asserts DRCTM11 at this time (this is an 
option to save time by skipping the main memory 
access, not a requirement of the memory bus) so 
that the tag state will go immediately to the [M) 
state, skipping the [E) state. MWB/WT 11 must be in 
write back mode (high) to assure this transition. The 
closure of the snoop window also enables the MBC 
to start providing the CPU with data that has been 
stored in the 82490XP's memory cycle buffer. The 
MBC supplies BRDYlls to the CPU (clocks 7-10). 


The 82495XP issues a new CADS11 in clock 13, 
which also misses the 82495XP/82490XP cache. 
Since the 82495XP has already sampled CNAII as- 
serted (clock 4), It issues a new CADSII prior to 
receiving CRDY11 of the current cycle (ie. this cycle 
is pipelined within the MBC). Note that once the cy- 
cle progress signals (BGTII, 
CNAII, 
KWENDII, 
SWENDII) of a cycle are sampled asserted, the 
82495XP ignores them until the CRDY11 of that cy- 
cle. The 82495XP does not pipeline the cycle prog- 
ress signals (ie. it will not sample them again until 
after CRDY11 of the current memory bus cycle). 


MEMORY BUS CYCLES: 


At the start of this cycle, the master 82495XP does 
not know that the data will be coming from a slave 
82495XP/82490XP and begins a read request to 
main memory to obtain the required data. Since the 
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snoop resulted in a hit to a modified line in the sec- 
ond cache, the memory request must be backed off 
so that the snooped 82495XP may supply the data. 


The memory address latch enables (MALE and 
MBALE) may remain asserted by the MBC to place 
the address latches in flow through mode. If the 
82495XP is the current bus master, the memory ad- 
dress output enables 
(MAOE11 
and MBAOE11) 
should be asserted by the MBC. The memory data 
output enable signal (MDOEII) must remain inactive 
to allow the data pins to be used as inputs. 


For Clocked Memory Bus Mode, MSELII is driven 
active by the MBC (clock 4) to allow sampling of 
MBRDY11 and to latch MZBT 11 for the transfer. 
MZBT11 is sampled on all MCLK edges where 
MSELII is inactive. Once MSELII is sampled active 
by the 82495XP, the value of MZBT11 sampled on 
the prior MCLK is used for the next transfer. • 
MBRDY11 is driven active in clocks 4 to 10 to read 
data into the 82490XP cache memory cycle buffers. 
The MBC asserts MEOC11 (clock 11) to end the 
read miss cycle on the memory bus and switch the 
memory cycle buffers for a new cycle. MZBT 11 is 
latched at this time for the next transfer, Note that 
there 
are 
8 
transfers 
needed 
to 
fill 
the 
82495XP/82490XP cache line and only 4 needed 
for the CPU line fill. 


MBRDY11 is again driven active by the MBC in 
clocks 11 to 21 to cause the memory burst counter 
to be incremented and data to be placed into the 
82490XP cache memory cycle buffers for the sec- 
ond read miss cycle. 


For Strobed Memory Bus Mode, MSEL11 is driven 
active by the MBC (clock 4) to allow MISTB opera- 
tion. and to latch MZBTII for the transfer (on the 
falling edge of MSELII). MISTB is toggled in clocks 
5 to 11 to cause the memory burst counter to be 
incremented, and 
data 
to 
be 
placed 
into 
the 
82490XP cache memory cycle buffers. Note: MISTB 
latches the memory bus data on both the rising and 
falling edges. The MBC drives MEOCII asserted 
(clock 12) to end the current cycle on the memory 
bus and switch memory cycle buffers for the new 
cycle. MZBT11 for the next cycle is latched at this 
time on the falling edge of MEOCII. 


The MBC toggles MISTB (clocks 16 to 21) for the 
second read miss cycle to increment the memory 
burst counter and cause data to be written into the 
82490XP memory cycle buffers. 
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Figure 
8-11. Read For Ownership 
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8.4 
Read for Ownership 


8.4.1 WRITE MISS WITH MFRZ# ASSERTED, 
FOLLOWED BY READ TO SAME LINE 


Figure 8·11 illustrates a Read For O~ners~ip cycle. 
First a CPU initiates a write cycle which misses the 
82495XP/82490XP cache. The MBC issues a "dum- 
my" write to main memory (the write does not actu- 
ally go out to main memory - to save valuable bus 
time). The 82490XP MFRZ iF input is used by the 
MBC to indicate that the following line-fill (allocation) 
data (from either main memory or another cache) 
should be merged with the data of the write miss. 
The entire line is then placed into the internal ta- 
gram. 


CACHE CONTROL SIGNALS: 


The 
CPU 
initiates 
a 
write 
cycle 
to 
the 
82495XP/82490XP 
cache where the cache tag 
state is looked up. Once the 82495XP determines 
the cycle to be a cache miss, it issues CADSiF 
(clock 1) and the associated cycle control signals to 
the MBC (eg. CW/RiF, CMIIOiF, CD/CiF, RDYSRC, 
MCACHEiF, PALLCiF) in order to schedule the write 
operation. MCACHEiF is not active; RDYS.RCis not 
active, indicating that 
the 
82495XP Will supply 
BRDYiFs to the CPU; PALLCiF is active, indicating a 
potential allocate cycle after the write through cycle. 


The write miss data is posted in the 82490XP's 
memory cycle buffer, and the cycle completes with 
no wait states to the CPU. The CPU is free to issue 
another (non-related) cycle while the 82495XP is 
processing the allocation. If this new cy~le is ~ 
cache hit it will be serviced by the 82495XP Immedi- 
ately; but if it is a cache miss, its service will wait 
until the CRDYiF of the allocation. 


The 
memory 
bus 
address 
(MSET[10:0], 


MTAG[11:0], 
MCFA[6:0l) 
is valid with 
CADSiF 
(clocks 1 and 5 for the write miss and allocation cy- 
cle in this example) and remain valid until after 
CNAiF is sampled active by the 82495XP (clocks 4 
and 10). MALE and MBALE may be used to hold the 
address as necessary. 


The MBC arbitrates for the memory bus and returns 
BGT iF asserted (clock 2), indicating that the write 
through cycle is guaranteed to complete on the 
memory bus. Once the 82495XP samples BGT iF as- 
serted, it must finish that cycle on the memory bus. 
Prior to this point, the cycle can be aborted by a 
snoop hit from another cache. 


CNAiF is asserted by the MBC (clock 3) to indicate 
that it is ready to schedule a new memory bus cycle. 
Note that after CNAiF activation, cycle control sig- 
nals are not guaranteed to be valid. 


82495XP 
Cache Controller/82490XP 
Cache RAM 


When the MBC has determined the cacheability at- 
tribute of the write through cycle, it drives the 
MKENiF signal accordingly. The MBC also drives 
the KWENDiF signal at this time, indicating the end 
of the cacheability window. The 82495XP samples 
MKENiF active during KWENDiF (clock 3), indicat- 
ing that the missed line should be allocated in the 
cache. 


The MBC asserts SWENDiF (clock 5) when the 
snoop window of the write through cycle ends on t~~ 
memory bus. Note that the direct to [M] state qualifi- 
er signal (DRCTMiF) is sampled during SW~NDiF 
and is inactive for the write . The MBC also Issued 
CRDYiF to the 82495XP at this time so that the 
82495XP thinks the write cycle completed on the 
memory bus when, in fact, it did not. 


In this example, the 82495XP requests the allocation 
cycle by issuing CADSiF in clock 5. The cycle.con- 
trol signals are valid at this point: MCACHEiF ISac- 
tive, indicating the cacheability of the line-fill cycle; 
RDYSRC is not active, indicating that the MBC need 
not supply BRDYiFs to the CPU (no BRDYiFs are 
necessary for an allocation cycle). 


Once again, the MBC arbitrates for the memory bus 
and returns BGTiF asserted (clock 7) for the alloca- 
tion cycle. The MBC asserts CNAiF, KWENDiF, and 
SWENDiF (clock 9) to pipeline the memory bus and 
close the cacheability and snoop windows. Note that 
(for this example) DRCTMiF 
is asserted during 
SWENDiF to place the line in the modified state. 
Since this is done, all other caches must invalidate 
their copies. 


CRDYiF for the allocation (line-fill) cycle is issued by 
the MBC in clock 11 to complete the read cycle on 
the 
memory bus and place the data 
into the 
82490XP cache array. 


MEMORY BUS SIGNALS: 


The memory address latch enables (MALE and 
MBALE) may remain asserted by the MBC to place 
the address latches in the flow through mode. If the 
82495XP is the current bus master, the memory ad- 
dress output enables 
(MAOEiF and 
MBAOEiF) 
should be asserted by the MBC. 


For Clocked Memory Bus Mode, the memory data 
output enable (MDOEiF) has been asserted by the 
MBC to drive the memory data outputs. 


The MBC asserts MSELiF (clock 2) to allow sam- 
pling of MBRDYiF and to latch MZBTiF and MFRZiF 
for the write. MBRDYiF and MEOCiF are asserted 


• 
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the prior MCLK is used for the 
next transfer. 
MDOE# is also deasserted in clock 6 to allow the 
data pins to be used as inputs for the allocation cy- 
cle. 


MBRDY# is driven active by the MBC in clocks 7 to 
9 to cause the memory burst counter to be incre- 
mented and data to be placed into the 82490XP 
cache memory cycle buffers. During the line fill, the 
82490XP will merge the data from the write through 
buffer with the incoming data from either main mem- 
ory or another cache (if that line was a write hit to 
[M] in another cache). 


The MBC drives MEOC# asserted (clock 10) to end 
the allocation cycle on the memory bus and switch 
memory cycle buffers for the new cycle. MZBT# is 
sampled at this time for the next data transfer. 


For Strobed Memory Bus Mode, the memory data 
output enable (MDOE#) has been asserted by the 
MBC to drive the memory data outputs. 


The MBC asserts MSEL# (clock 2) to allow toggling 
of MISTB and to latch MZBT# and MFRZ# for the 
write (on MSEL# falling edge). MISTB is toggled 
and MEOC# asserted by the MBC (clock 2) to place 
the write data into the memory cycle buffers, sample 
MZBT# and MFRZ# for the next transfer (on the 
falling edge of MEOC# while MSEL# is active), and 
end the current cycle on the memory bus. MFRZ# is 
driven active by the MBC here, indicating to the 
82495XP that the data of the write through will be 
merged with the following allocation data. 


For the allocation (line fill) cycle, MSEL# is driven 
active again by the MBC (clock7) to allow sampling 
of MOSTB and to latch MZBT# for the transfer. 
MDOE# is also deasserted in clock 7 to allow the 
data pins to be used as inputs for the allocation cy- 
cle. 


MOSTB is toggled by the MBC in clocks 8 to 10 to 
cause the memory burst counter to be incremented 


2-382 


Figure 8-12 illustrates CPU initiated 1/0 cycles, both 
read and write. I/O writes are the only write cycles 
not posted by the 82495XP/82490XP cache (ie. the 
cycle is not fully acknowledged to the CPU until it 
has completed on the memory bus). 


CACHE CONTROL SIGNALS: 


The 
CPU initiates 
an 
1/0 
write 
cycle 
to 
the 
82495XP/82490XP. 
The 
82495XP 
then 
issues 
CADS# and CDTS# (clock 1) and the associated 
cycle control signals to the MBC (eg. CW/R#, CMI 
10#, CD/C#, RDYSRC, MCACHE#). MCACHE# in 
not active, indicating that the cycle is not cacheable; 
RDYSRC is active, indicating that the MBC must 
supply BRDY#s to the CPU/Cache core. 


The 
memory 
bus 
address 
(MSET[10:01, 


MTAG[11:01, MCFA[6:0l) 
is valid with 
CADS# 
(clocks 1 and 10 for the two read s in this example) 
and remain valid until after CNA# is sampled active 
by the 82495XP (clocks 6 and 17). MALE and MBA- 
LE may be used to hold the address as necessary. 


The MBC arbitrates for the memory bus and returns 
BGT# asserted (clock 2) for the 1/0 write cycle, in- 
dicating that the cycle is guaranteed to complete on 
the memory bus. Once the 82495XP samples BGT# 
asserted, it must finish that cycle on the memory 
bus. Prior to this point, the cycle can be aborted by a 
snoop hit from another cache. 


CNA# for the write cycle is asserted by the MBC 
(clock 5) to indicate that it is ready to schedule a 
new memory bus cycle. Note that SWEND# and 
KWEND# are not needed for 1/0 cycles since they 
are not cacheable. 


The MBC asserts BRDY# in clock 7 to complete the 
1/0 write cycle from the CPU, and CRDY# in clock 8 
to complete the cycle on the memory bus from the 
82495XP/82490XP cache. 


intel~ 
82495XP Cache Controller/82490XP 
Cache RAM 


2 
3 
<456 
, 
I 
9 , 10 , 11 
, 
I 
' 
, 
,17,18,, 
, 


240956-44 


ClK 


7 
8 


CADS" 
~ 
, 
, 
, 
,\f1 
}) ! 
)O!!Q.XlXXXXX 
I 
.., 
I 
I 
I 
I 
I 


'pM 1 
'**'*'*'*' I 


tJDU 
, 
,'= 


ADDRESS 


CW/R" 
CW/IO" 


RDYSRC IiII 
'~I 
, 
, 


, 
, 
, 
, 
1 
~ 
WCACHE" 
SGT" 
KWEND" 
SWEND" 


CNA" 


I 
I 
I 
I 
I 
I 
! 
i&' I 
I 
I 
I W'ld!.dl.bl' I 


~ •• 
~ 
XDiv 
\ i{lOnOnOnOn'V 
I 
I 
I 
I 
I 
BRDY" 
: 
i 
i \:J 
i 
i 
i 
i \..!.Ii 
, 
' 
\~r-! 


CRDY" 
, 
1 
'+' 
' '+' : 
----4---T-~-~-L-~--~-~-1-~-~-~~-~-~---~ 
! 
I 
I 
I 
I 
I 
CLOCKED hlEt.lORY BUS hlODE: 
1 
I, 


~ 
: :\'tmm 
I 
I 
I 
I 
, 


WClK 


WSEl" 
, 
, 


I 
I 
I 
I W 
I 
I 
I 
I 
I 
I 
lii 
!!O!!~ 
I 
I' 
I 
I 
I 
I 
I 
I 
xl'Xxl'XXI'XXV 
WXI'XXl'XxI'Xxl'XXlII' 


:u: , : 


~ 
Xl'Xx_XI'XXJ: 


MEOC" 
,, 
~, 
WBRDY" 


WZBT" 


MFRZ" 
, 
, 
, 


WDOE", 
,j\ 
I 
I:', 
, 
I 
, 
, 


1111~1111~ 
~ 
WDATA '!!O!!d!! 
.....: 
J'~ 
: 
I 
: 
: 
: 
I 
: 
: 
: 
: 
I 
: 
: 
I 
I 
I 
I 
I 
I 
- - - 
- +r - - 
- T - .• - ,- -I- -r -,- ;- 1- ~- r - 
-:- 
---{- -{ - ~ - -+ - ~ 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
STROBED hlEt.lORY BUS hlODE: 
1 
' 
, 
, 
'I 
::: 
1 
: 
: 
! 
I 
: 
: 
: 
: 
I 
I 
I 
I 
I 


WSEl" 
I 
,I 
, 
\ 
1 
'" 
J 
,I 
, 
I 
I 
i 
I 
MEOC" 
, 
I 
' 
, 
'\!..J' 
, 
, 
,I 
'" 
~ 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


WxSTB 
,OX J__l 
l __ ~,,~ 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


MZBT- ~!a!!v 
\fIXXXXXn1XXXPX 


MFRZ" 


I 
I 
I 
I 
I 
I 
I 
I 
, 
I 
' 
'I 
MOOE" 
, 
I 
' 
! 
I 
I 
I 
, 
I 
! 'I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
WDATA 
~ 
: 
m,d!!8'8.mgnmgr 
_ 
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A new CADS# is issued from the 82495XP in clock 
10 for an 1/0 read cycle, along with the associated 
cycle control signals. MCACHE# is again not active, 
and RDYSRC is again active. 


The MBC returns BGT# asserted right away (clock 
11). The 82495XP can pipeline 1/0 cycles, but does 
not for the 1/0 read in this example. 


Upon completing the access on the memory bus, 
the MBC activates BRDY# (clock 17) and CRDY# 
(clock 16). Note that BRDY# of a cycle may come 
before (as in the 1/0 write cycle of this example), 
with or after the CRDY# of the same cycle. 


MEMORY BUS SIGNALS: 


The memory address latch enables (MALE and 
MBALE) may remain asserted by the MBC to place 
the address latches in flow through mode. If the 
82495XP is the current bus master, the memory ad- 
dress output 
enables (MAOE# 
and 
MBAOE#) 
should be asserted by the MBC. 


For Clocked Memory Bus Mode, The memory data 
output enable signal (MDOE#) is asserted by the 
MBC in clock 3 to drive the memory data outputs. 


MEOC# is asserted by the MBC (clock 5) to latch 
MZBT# for the I/O write transfer, and end that cycle 
on the memory bus (MBRDY# is not necessary 
since this example shows a single transfer cycle). 
MZBT# is driven high by the MBC in order to force 
the write cycle to begin with the correct burst ad- 
dress. MFRZ# is also sampled here (it need not be 
active since the cycle is not potentially allocatable). 


For the 1/0 read cycle, MDOE# is deasserted (clock 
12) by the MBC to allow the data pins to be used as 
inputs. 


MSEL# is driven active by the MBC (clock 12) to 
allow sampling of MBRDY# and to latch MZBT# for 
the transfer. MZBT# is sampled on all MCLK edges 
where MSEL# is inactive. Once MSEL# is sampled 
active by the 82495XP, the value of MZBT# sam- 
pled on the prior MCLK is used for the next transfer. 
Again, MZBT# is driven high by the MBC to force 
the transfer to begin with the correct burst address. 


The MBC asserts MBRDY# (clock 14) to cause the 
memory burst counter to be incremented and data to 
be placed into the 82490XP cache memory cycle 
buffers. The MBC drives MEOC# asserted (clock 
15) to end the read cycle on the memory bus and 
switch memory cycle buffers for 
a new cycle. 
MZBT# for the next transfer is latched at this time. 
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For Strobed Memory Bus Mode, The memory data 
output enable signal (MDOE#) has been asserted 
by the MBC to drive the memory data outputs. 


MEOC# is asserted by the MBC (clock 5) to latch 
MZBT# for the 1/0 write transfer (on MEOC# falling 
edge), and end that cycle on the memory bus 
(MOSTB is not necessary since this example shows 
a single transfer cycle). MZBT# is driven high by the 
MBC in order to force the write cycle to begin with 
the correct burst address. MFRZ# is also sampled 
here (it need not be active since the cycle is not 
potentially allocatable). 


For the 1/0 read cycle, MDOE# is deasserted (clock 
10) by the MBC to allow the data pins to be used as 
inputs. 


MSEL# is driven active by the MBC (clock 10) to 
allow operation of MISTB and to latch MZBT# for 
the 
transfer 
(on 
MSEL# 
falling 
edge). 
Again, 


MZBT# is driven high by the MBC to force the trans- 
fer to begin with the correct burst address. 


The MBC toggles MISTB (clock 15) to cause the 
memory burst counter to be incremented and data to 
be placed into the 82490XP cache memory cycle 
buffers for the 1/0 read cycle. Note: MISTB latches 
the memory bus data on both the rising and falling 
edges. The MBC drives MEOC# asserted (clock 16) 
to end the read cycle on the memory bus and switch 
memory cycle buffers for a new cycle. MZBT# for 
the next transfer is latched at this time (on the falling 
edge of MEOC#). 


8.6 
LOCKed Cycles 


8.6.1 
CPU READ 
MODIFY 
WRITE 
CYCLES 


The 82495XP provides a facility to allow atomic ac- 
cesses requested by the CPU (via CPU LOCK# acti- 
vation) through the 82495XP KLOCK# signal. Fig- 
ure 8-13 illustrates two back-to-back CPU initiated 
Locked read-modify-write cycles. KLOCK# activa- 
tion indicates to the MBC that the memory bus 
should not be released between the KLOCKed cy- 
cles. KLOCK# will remain asserted from the begin- 
ning of the first cycle (with CADS#) until one clock 
after the CADS of the last cycle. The 82495XP does 
not distinguish between back-to-back locked opera- 
tions and will not open an arbitration window (deas- 
sert KLOCK#) between them. It is the responsibility 
of the MBCto distinguish between the multiple RMW 
sequences, if it is so desired. 
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Figure 8-13. LOCKed Read-Modify-Write 
Cycles 
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issues CADS# to the MBC (clock 1) along with the 
associated cycle control signals (eg. CW/R#, 
CM/ 
10#, CD/C#, RDYSRC, MCACHE#). MCACHE# is 
never asserted for LOCKed cycles; RDYSRC is ac- 
tive, indicating that the MBC must supply BRDY# to 
the CPU/Cache core. 


The 
memory 
bus 
address 
(MSET[10:01. 


MTAG[11:0), 
MCFA[6:0)) 
is valid with 
CADS# 
(clocks 1 and 5, then 7 and 11 for the two locked 
RMW sequences in this example) and remain valid 
until after CNA# is sampled active by the 82495XP 
(clocks 3 and 7, then 9 and 13). MALE and MBALE 
may be used to hold the address as necessary. 


The MBC arbitrates for the memory bus and returns 
BGT# asserted (clock 2), indicating that the cycle is 
guaranteed to complete on the memory bus. Once 
the 82495XP samples BGT# asserted, it must finish 
that cycle on the memory bus. Prior to this point, the 
cycle can be aborted by a snoop hit from another 
cache. 


CNA# for the read cycle is also asserted by the 
MBC (clock 2) to indicate that it may schedule a new 
memory bus cycle. Note that the cycle control sig- 
nals are not guaranteed to be valid after CNA# acti- 
vation. 


The MBC asserts BRDY# to the CPU/Cache core 
in clock 4. CRDY# for the locked read cycle is as- 
serted to the 82495XP/82490XP from the MBC 
(clock 5) to load the data stored in the 82490XP's 
memory cycle buffers into the cache array. If the 
read was to a dirty line, the 82495XP is intelligent 
enough to ignore the data in the memory cycle buff- 
ers and use the data in the cache array. 


Locked sequences always end in a write cycle, no 
new CPU initiated cycles may be inserted between 
the Locked read and Locked write cycles. Therefore, 
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low snooping. The 82495XP issues a memory bus 
request (CADS#) in clock 7 for the second locked 
read cycle, along with the new cycle control signals. 


The second locked RMW sequence repeats the ac- 
tions of the first. It's purpose in this example is to 
demonstrate that an arbitration window may not 
open between locked sequences if they follow one 
another with no idle or non-locked cycles between 
them. 


MEMORY BUS SIGNALS: 


The memory address latch enables (MALE and 
MBALE) may remain asserted by the MBC to place 
the address latches in flow through mode. If the 
82495XP is the current bus master, the memory ad- 
dress output enables (MAOE# 
and 
MBAOE#) 
should be asserted by the MBC. 


For Clocked Memory Bus Mode, MSEL# is driven 
active by the MBC (clock 3) to allow sampling of 
MBRDY# and to latch MZBT# for the transfer. 
MZBT# 
is sampled on all MCLK edges where 
MSEL# is inactive. Once MSEL# is sampled active 
by the 82495XP, the value of MZBT# sampled on 
the prior MCLK is used for the next transfer. 


The memory data output enable signal (MDOE#) 
must be inactive to allow the data pins to be used as 
inputs for the first locked read cycle. The MBC as- 
serts MEOC# (clock 4) to latch MZBT# for the next 
transfer, and end the current locked read cycle on 
the memory bus (MBRDY# is not necessary since 
this example shows a single transfer cycle). MZBT# 
is driven high by the MBC in order to force the read 
cycle to begin with the correct burst address. 


For the locked write cycle, MDOE# is asserted by 
the MBC (clock 5) to drive the memory data outputs. 


MEOC# is again asserted (clock 6) to latch MZBT# 
for the next transfer, and end the current locked 
write cycle on the memory bus (MBROY# is not 
necessary since this is a single transfer cycle). 
MZBT# is again driven high. MFRZ# is also sam- 
pled during write cycles when MEOC# is sampled 
active by the 82495XP. 


MOOE# is deasserted by the MBC (clock 7) to allow 
the data pins to be used as inputs for the second 
locked read cycle. MEOC# is again asserted (clock 
8) to latch MZBT# for the next transfer, and end the 
locked read cycle on the memory bus. MZBT# is 
again driven high. 


MOOE# is asserted by the MBC (clock 9) to drive 
the memory data outputs for the second locked write 
cycle. MBROY# is asserted (clock 13) to cause the 
memory burst counter to be incremented and data to 
be placed into the 82490XP cache memory cycle 
buffers. 
The 
MBC drives 
MEOC# 
active 
and 
MSEL# inactive (clock 14) to end the locked write 
cycle on the memory bus and switch memory cycle 
buffers for a new cycle. MZBT# and MFRZ# for the 
next transfer are sampled at this time. 


For Strobed Memory Bus Mode, MSEL# is driven 
active by the MBC (clock 1) to allow sampling of 
MxSTB and to latch MZBT# for the first locked read 
transfer (on the falling edge of MSEL#). 


The memory data output enable signal (MOOE#) 
must be inactive to allow the data pins to be used as 
inputs for the first locked read cycle. The MBC as- 
serts MEOC# (clock 3) to latch MZBT# for the next 
transfer (on MEOC# falling edge while MSEL# is 
active), and end the current locked read cycle on the 
memory bus (MISTB is not necessary since this ex- 
ample shows a single transfer cycle). MZBT# is 
driven high by the MBC in order to force the read 
cycle to begin with the correct burst address. 


For the locked write cycle, MOOE# is asserted by 
the MBC (clock 4) to drive the memory data outputs. 
MEOC# is again asserted (clock 6) to latch MZBT# 
for the next transfer, and end the current locked 
write cycle on the memory bus (MOSTB is not nec- 
essary since this is a single transfer cycle). MZBT# 
is again driven high. MFRZ# is also sampled on the 
falling edge of MEOC#. 


MOOE# is deasserted by the MBC (clock 7) to allow 
the data pins to be used as inputs for the second 
locked read cycle. MEOC# is again asserted (clock 
8) to latch MZBT# for the next transfer, and end the 
locked read cycle on the memory bus. MZBT# is 
again driven high. 
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MOOE# is asserted by the MBC (clock 9) to drive 
the memory data outputs for the second locked write 
cycle. MOSTB is toggled (clock 12) to cause the 
memory burst counter to be incremented and data to 
be placed into the 82490XP cache memory cycle 
buffers. 
The 
MBC 
drives 
MEOC# 
active 
and 
MSEL# inactive (clock 13) to end the locked write 
cycle on the memory bus and switch memory cycle 
buffers for a new cycle. MZBT# and MFRZ# for the 
next transfer are sampled at this time. 


9.0 
TESTABILITY 


Testing the 82495XP/82490XP chipset can be divid- 
ed into three categories: Built-In Self Test (BIST), 
Boundary Scan, and external testing. BIST performs 
basic device testing on the 82495XP. Boundary • 
Scan provides additional test hooks that conform to 
the IEEE Standard Test Access Port and Boundary 
Scan Architecture 
(IEEE Std.1149.1). Additional 
testing can be performed by using software written 
to test the 82490XP cache SRAM. 


9.1 
Built-In Self Test (BIST) 


BIST tests the internal functionality of the 82495XP. 
The 82495XP's BIST tests approximately 90% of 
the cache controller. It tests the tag RAM and com- 
parators. 


The 
82495XP 
BIST 
is 
initiated 
by 
driving 
SLFTST#(CROY#) low and HIGHZ#(MBALE) high 
at least 10 clocks before RESET goes inactive. The 
82495XP Cache Controller reports the result of BIST 
on the CAHOLO signal. When the self test com- 
pletes, the 82495XP drives FSIOUT# inactive and 
the BIST result on CAHOLO. If CAHOLO is driven 
active the BIST successfully passed. If CAHOLO is 
driven inactive, BIST detected a flaw in the cache 
controller. CAHOLO is valid for one clock after 
FSIOUT# deactivation and should be sampled on 
the rising edge of FSIOUT#. 


On the 82495XP, BIST only informs the system that 
a failure did or did not occur. BIST is not able to 
indicate where a failure occurred. After completing 
BIST the cache controller perform reset and begin 
normal operation. 


9.2 
Boundary Scan 


The 82495XP/82490XP chipset provides additional 
test ability features compatible with the IEEE Stan- 
dard Test Access Port and Boundary Scan Architec- 
ture (IEEE Std.1149.1). The test logic provided al- 
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lows for testing 
to insure that components 
function 
correctly, 
that 
interconnections 
between 
various 
components 
are correct, 
and that 
various 
compo- 
nents interact 
correctly 
on the printed 
circuit board. 


The boundary 
scan test logic consists 
of a boundary 
scan 
register 
and support 
logic that 
are accessed 
through 
a test access 
port (TAP). The TAP provides 
a simple 
serial 
interface 
that 
makes 
it possible 
to 
test all signal traces with only a few probes. 


The TAP can be controlled 
via a bus master. 
The 
bus master 
can be either 
automatic 
test equipment 
or a component 
(PLO) that interfaces 
to the four-pin 
test bus. 


9.2.1 
BOUNDARY 
SCAN 
ARCHITECTURE 


The boundary 
scan test logic contains 
the following 
elements: 


- 
Test access 
port (TAP), consisting 
of input pins 
TMS, TCK, and TOI; and ouput pin TOO. 


- 
TAP controller, 
which interprets 
the inputs on the 
test 
mode 
select 
(TMS) 
line and 
performs 
the 
corresponding 
operation. 
The 
operations 
per- 
formed 
by the 
TAP 
include 
controlling 
the 
in- 
struction 
and 
data 
registers 
within 
the 
compo- 
nent. 


- 
Instruction 
register 
(IR), which 
accepts 
instruc- 
tion codes 
shifted 
into the test logic on the test 
data 
input 
(TOI) pin. The 
instruction 
codes 
are 
used to select 
the specific 
test operation 
to be 
performed 
or the 
test 
data 
register 
to 
be ac- 
cesse'd. 


- 
Test 
data 
registers: 
The 
82495XP/82490XP 
chipset 
components 
each contain three test data 
registers: 
Bypass 
register 
(BPR), Oevice Identifi- 
cation 
register 
(010), and Boundary 
Scan regis- 
ter (BSR). 


The instruction 
and test data registers 
are separate 
shift-register 
paths connected 
in parallel 
and have a 
common 
serial data input and a common 
serial data 
output connected 
to the TAP signals, TOI and TOO, 
respectively. 


9.2.2 
DATA 
REGISTERS 


The 
82495XP 
and 
82490XP 
both 
contain 
the two 
required 
test 
data 
registers; 
bypass 
register 
and 
boundary 
scan register. 
In addition, 
they also have a 
device 
identification 
register. 
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Each test data register 
is serially 
connected 
to TOI 
and TOO, with TOI connected 
to the most significant 
bit and TOO connected 
to the least significant 
bit of 
the test data register. 
Oata is shifted 
one stage (bit 
position 
within 
the register) 
on each 
rising edge of 
the test clock 
(TCK). 


9.2.2.1 
Bypass 
Register 


The Bypass 
Register 
is a one-bit 
shift register 
that 
provides 
the minimal 
length 
path between 
TOI and 
TOO. This path can be selected 
when no test opera- 
tion is being 
performed 
by the component 
to allow 
rapid movement 
of test data to and from other com- 
ponents 
on the board. While the bypass 
register 
is 
selected 
data is transferred 
from TOI to TOO without 
inversion. 


9.2.2.2 
Boundary 
Scan 
Register 


The Boundary 
Scan Register 
is a single shift register 
path 
containing 
the 
boundary 
scan 
cells' that 
are 
connected 
to 
all 
input 
and 
output 
pins 
of 
the 
82495XP 182490XP 
chipset. 
Figure 
9.1 shows 
the 
logical structure 
of the boundary 
scan register. While 
output cells determine 
the value of the signal driven 
on the 
corresponding 
pin, input 
cells 
only 
capture 
data; they do not affect 
the normal operation 
of the 
device. 
Oata is transferred 
without 
inversion 
from 
TOI to TOO through 
the boundary 
scan register 
dur- 


ing scanning. 
The boundary 
scan register can be op- 
erated 
by the 
EXTEST 
and 
SAMPLE 
instructions. 
The 
boundary 
scan 
register 
order 
is described 
in 
section 
9.2.5. 


9.2.2.3 
Device 
Identification 
Register 


The Oevice Identification 
Register 
contains 
the man- 
ufacturer's 
identification 
code, 
part 
number 
code, 
and version 
code in the format 
shown 
in Figure 9.2. 


Table 
9.1 
lists 
the 
codes 
corresponding 
to 
the 
82495XP 
and 82490XP. 


Table 9-1_ Device 
ID Register 
Values 


Component 
Version 
Part 
Manufacturer 
Number 
Code 
Code 
Code 
Identity 


82495XP 
0495h 
0495h 
09h 
(AO or A1) OAh 


82495XP 
(BO) 
OBh 
0495h 
09h 


82490XP 
OOh 
49AOh 
09h 
(AO or A1) 
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r----------------------------------, 
I 
I 
I 
I 
I 
I 
I 
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BOUNDARY 
SCAN 
REGISTER 


p------------- 


SYSTEM 


LOGIC 


TCK 


SYSTEW 
LOGIC 1--~-4 
INPUT 


"'----- 
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240956-46 


SYSTEW 
BIDIRECTIONAL 
PIN 


SYSTEW 
3-STATE 
OUTPUT 


TOO 


Figure 9-1. Boundary Scan Register Structure 
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Figure 9-2. Device ID Register 


9.2.2.4 Runbist Register 


The Aunbist Aegister is a one bit register used to 
report the results of the 82495XP SIST when it is 
initiated by the AUNSIST instruction. This register is 
loaded with a "1" prior to invoking the SIST and is 
loaded with "1" 
upon successfullco.mpletion. "0" 
indicates a failure occurred during SIST. 


NOTE: 
82495XP AUNSIST is not available in the A-step- 
ping. 


9.2.3 INSTRUCTION REGISTER 


The Instruction Aegister (lA) allows instructions to 
be serially shifted into the device. The instruction 
selects the particular test to be performed, the test 
data register to be accessed, or both. The instruc- 
tion register is four (4) bits wide. The most significant 
bit is connected to TOI and the least significant bit is 
connected to TOO. There are no parity bits associat- 
ed with the Instruction register. Upon entering the 
Capture-lA TAP controller state, the Instruction reg- 
ister is loaded with the default instruction "0001", 
SAMPLE/PAELOAO. Instructions are shifted into 
the instruction register on the rising edge of TCK 
while the TAP controller is in the Shift-lA state. 
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tion codes "Iabeled "RESERVEO". These instruc- 
tions can put the component in an undeterminant 
state which can only be cleared by power on reset. 


Table 9-2. 82495XP 
Boundary 
Scan 
Instruction 
Codes 


Instruction 
Code 
Instruction 
Name 


0000 
EXTEST 


0001 
SAMPLE 


0010 
10COOE 


0011 
RESERVED 


0100 
RESERVED 


0101 
RESERVED 


0110 
RESERVED 


0111 
'RUNBIST 


1000 
'TRISTATE 


1001 
RESERVED 


1010 
PRIVATE 


1011 
PRIVATE 


1100 
PRIVATE 


1101 
PRIVATE 


1110 
PRIVATE 


1111 
BYPASS 


• RUNBIST 
and TRISTATE 
are boundary 
scan Instructions 
that will be implemented 
in the B-stepping 
of the 82495XP. 
They are not available 
on the A-stepping. 


EXTEST 
The instruction code is "0000". The EX- 
TEST instruction allows testing of cir- 
cuitry external to the component pack- 
age, typically board interconnects. 
It 
does so by driving the values loaded 
into the 82495XP boundary scan regis- 
ter out on the output pins corresponding 
to each boundary scan cell and cap- 
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r-Ht:LUf\U ~f\Mt-'Lt:/t-'Ht:LUAU nas two functions 
that it performs. When the TAP control- 
ler is in the Capture-OR state, the SAM- 
PLE/PRELOAO 
instruction 
allows 
a 
"snap-shot" of the normal operation of 
the component without interfering with 
that normal operation. The instruction 
causes boundary scan register cells as- 
sociated with outputs to sample the val- 
ue being driven by the 82495XP. It caus- 
es the cells associated with inputs to 
sample the value being driven into the 
82495XP. On both outputs and inputs 
the sampling occurs on the rising edge 
of TCK. When the TAP controller is in 
the 
Update-OR state, the 
SAMPLEI 
PRELOAO instruction preloads data to 
the device pins to be driven to the board 
by executing the EXTEST instruction. 
Oata is preloaded to the pins from the 
boundary scan register on the falling 
edge of TCK. 


10COOE 
The instruction code is "0010". The 10- 
COOE instruction selects the device 
identification register to be connected to 
TOI and TOO, allowing the devices iden- 
tification code to be shifted out of the 
device on TOO. Note that the device 
identification register is not altered by 
data being shifted in on TO!. 


BYPASS 
The instruction code is "1111 ". The BY- 
PASS instruction selects the bypass 
register to be connected to TOI and 
TOO, effectively bypassing the test logic 
on the 82495XP by reducing the shift 
length of the device to one bit. Note that 
an open circuit fault in the board level 
test data path will cause the bypass reg- 
ister to be selected following an instruc- 
tion scan cycle due to the pull-up resis- 
tor on the TOI input. This has been done 
to prevent any unwanted interference 
with the proper operation of the system 
logic. 


RUNBIST 
The instruction code is "0111". 
The 
RUNBIST instruction selects the one (1) 
bit runbist register, loads a value of "0" 
into the runbist register, and connects it 
to TDO. It also initiates the built-in self 
test 
(BIST) feature of the 82495XP, 
which is able to detect approximately 
90% 
of 
the 
stuck-at 
faults 
on 
the 
82495XP. The 82495XP ac/dc specifi- 
cations for VCC and ClK must be met 
and reset must have been asserted at 
least 
once 
prior 
to 
executing 
the 
RUNBIST boundary scan instruction. Af- 
ter 
loading the 
RUNBIST instruction 
code in the instruction register, the TAP 
controller must be placed in the Run- 
Test/Idle state. BIST begins on the first 
rising edge of TCK after entering the 
Run-Test/Idle state. The TAP controller 
must remain in the Run-Test/Idle state 
until BIST is completed. It requires 100K 
clock (ClK) 
cycles to complete BIST 
and report the result to the runbist regis- 
ter. After completing the 100K clock 
(ClK) cycles, the value in the runbist 
register should be shifted out on TDO 
during the Shift-DR state. A value of "1" 
being shifted out on TDO indicates BIST 
successfully completed. A value of "0" 
indicates a failure occurred. After exe- 
cuting the 
RUNBIST instruction, the 
82495XP must be reset prior to normal 
operation. NOTE: This instruction is not 
available 
on 
the 
A-stepping 
of 
the 
82495XP. It will be implemented in the 
B-stepping. 
TRISTATE The instruction code is "1000". 
The 
TRISTATE instruction initiates the tri- 
state output test mode. After loading the 
TRISTATE boundary scan instruction 
into the instruction register, the TAP 
controller must be placed in the Run- 
Test/Idle state. To terminate the tristate 
output test mode, the 82495XP must be 
reset. NOTE: This instruction is not 
available 
on 
the 
A-stepping 
of 
the 
82495XP. It will be implemented in the 
B-stepping. 


9.2.3.2 
82490XP 
Boundary 
Scan Instruction 
Set 


The 82490XP cache controller supports all three 
mandatory boundary scan instructions (BYPASS, 
SAMPlE/PRELOAD, 
and EXTEST) along with one 
optional instruction (IDCODE). Table 9.4 lists the 
82490XP boundary sc:an instruction codes. The in- 
structions listed as PRIVATE cause TDO to become 
enabled in the Shift-DR state and cause "0" to be 
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shifted out of TDO on the rising edge of TCK. Execu- 
tion of the PRIVATE instructions will not cause haz- 
ardous operation of the 82490XP. Note that system 
tests should not execute instruction codes labeled 
"INTEl 
RESERVED". These instructions can put 
the component in an undeterminant state which can 
only be cleared by power on reset. 


Table 9-3. 82490XP 
Boundary 
Scan 
Instruction 
Codes 


Instruction 
Code 
InstructIon 
Name 


0000 
EXTEST 


0001 
SAMPLE 


0010 
IDCODE 


0011 
INTEL RESERVED 


0100 
INTEl RESERVED 


0101 
INTEl RESERVED 


0110 
INTEl RESERVED 


0111 
INTEl RESERVED 


1000 
INTEl RESERVED 


1001 
INTERl RESERVED 


1010 
PRIVATE 


1011 
PRIVATE 


1100 
PRIVATE 


1101 
PRIVATE 


1110 
PRIVATE 


1111 
BYPASS 


EXTEST 
The instruction code is "0000". The EX- 
TEST instruction allows testing of cir- 
cuitry external to the component pack- 
age, typically board interconnects. 
It 
does so by driving the values loaded 
into the 82490XP boundary scan regis- 
ter out on the output pins corresponding 
to each boundary scan cell and captur- 
ing the values on 82490XP input pins to 
be 
loaded 
into 
their 
corresponding 
boundary scan register locations. I/O 
pins are selected as input or output, de- 
pending on the value loaded into their 
control setting locations in the boundary 
scan register. Values shifted into input 
latches in the boundary scan register 
are never used by the internal logic of 
the 82490XP. Note: after using the EX- 
TEST instruction, the 82490XP must be 
reset 
before 
normal 
(non-boundary 
scan) use. 
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SAMPlEI 
The instruction code is "0001". 
The 
PRElOAO 
SAMPlE/PRElOAO 
has two functions 
that it performs. When the TAP control- 
ler is in the Capture-OR state, the SAM- 
PlE/PRElOAO 
instruction 
allows 
a 
"snap-shot" of the normal operation of 
the component without interfering with 
that normal operation. The instruction 
causes boundary scan register cells as- 
sociated with outputs to sample the val- 
ue being driven by the 82490XP. It caus- 
es the cells associated with inputs to 
sample the value being driven into the 
82490XP. On both outputs and inputs 
the sampling occurs on the rising edge 
of TCK. When the TAP controller is in 
the 
Update-OR state, the SAMPlEI 
PRElOAO instruction preloads data to 
the device pins to be driven to the board 
by executing the EXTEST instruction. 
Data is preloaded to the pins from the 
boundary scan register on the falling 
edge of TCK. 
10COOE 
The instruction code is "0010". The ID- 
CODE instruction selects the device 
identification register to be connected to 
TOI and TOO, allowing the devices iden- 
tification code to be shifted out of the 
device on TOO. Note that the device 
identification register is not altered by 
data being shifted in on TOI. 


BYPASS 
The instruction code is "1111 ". The BY- 
PASS instruction selects the bypass 
register to be connected to TOI and 
TOO, effectively bypassing the test logic 
on the 82490XP byreducing the shift 
length of the device to one bit. Note that 
an open circuit fault in the .board level 
test data path will cause the bypass reg- 
ister to be selected following an instruc- 
tion scan cycle due to the pull-up resis- 
tor on the TOI input. This has been done 
to prevent any unwanted interference 
with the proper operation of the system 
logic. 


9_2_4 
TEST 
ACCESS 
PORT (TAP) 
CONTROLLER 


The TAP controller is a synchronous, finite state ma- 
chine. It controls the sequence of operations of the 
test logic. The TAP controller changes state only in 
response to the following events: 
1. A rising edge of TCK 
2. Power-up. 
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The value of the test mode state (TMS) input signal 
at a rising edge of TCK controls the sequence of the 
state changes. The state diagram for the TAP con- 
toiler is shown in figure 9.3. Test designers must 
consider the operation of the state machine in order 
to design the correct sequence of values to drive on 
TMS. 


9.2.4.1 
Test-Loglc-Reset 
State 


In this state, the test logic is disabled so that normal 
operation of the device can continue unhindered. 
This is achieved by initializing the instruction register 
such taht the 10COOEinstruction is loaded. No mat- 
ter what the original state of the controller, the con- 
troller enters Test-logic-Reset state when the TMS 
input is held high (1) for at least five rising edges of 
TCK. The controller remains in this state while TMS 
is high. The TAp· controller is also forced to enter 
this state at power-up. 


9.2_4.2 
Run-Test/ldle 
State 


A controller state between scan operations. Once in 
this state, the controller remains in this state as 
long as TMS is held low. In devices supporting the 
RUNBIST instruction, the BIST is performed during 
this state and the result is reported in the runbist 
register. For instructions not causing functions to ex- 
ecute during this state, no activity occurs in the test 
logic. The instruction register and all test data regis- 
ters retain their previous state. When TMS is high 
and a rising edge is applied to TCK, the controller 
moves to the Select-OR state. 


9.2.4.3 
Select-DR-Scan 
State 


This is a temporary controller state. The test data 
register selected by the current instruction retains its 
previous state. If TMS is held low and a rising edge 
is applied to TCK when in this state, the controller 
moves into the Capture-OR state, and a scan se- 
quence for the selected test data register is initiated. 
If TMS is held high and a rising edge is applied to 
TCK, the controller moves to the Select-IR-Scan 
state. 
. 


The instruction does not change in this state. 
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Figure 9-3. Tap Controller 
State Diagram 


9.2.4.4 
Capture-OR 
State 


In this state, the boundary scan register captures 
input pin data if the current instruction is EXTEST or 
SAMPLE/PRELOAO. The other test data registers, 
which do not have parallel input, are not changed. 


The instruction does not change in this state. 


When the TAP controller is in this state and a rising 
edge is applied to TCK, the controller enters the 
Exit1-0R state if TMS is high or the Shift-OR state if 
TMS is low. 


9.2.4.5 
Shift-OR 
State 


In this controller state, the test data register con- 
nected between TOI and'TOO as a result of the cur- 
rent instruction, shifts data one stage toward its seri- 
al output on each rising edge of TCK. 


The instruction does not change in this state. 


When the TAP controller is in this state and a rising 
edge is applied to TCK, the controller enters the 
Exit1-0R state if TMS is high or remains in the Shift- 
OR state if TMS is low. 
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9.2.4.6 Exlt1-0R State 


This is a temporary 
state. While in this state, if TMS 


is held high, a rising edge applied to TCK causes the 
controller 
to enter the Update-OR 
state, which termi- 
nates the scanning 
process. 
If TMS is held low and a 
rising edge is applied 
to TCK, the controller 
enters 
the Pause-OR 
state. 


The test data register 
selected 
by the current 
instru- 
citon retains its previous 
value during this state. The 
instruction 
does not change 
in this state. 


9.2.4.7 Pause-OR State 


The pause state allows the test controller 
to tempo-. 


rarily halt the shifting 
of data through 
the test data 
register 
in the serial path between 
TOI and TOO. An 


example 
of using this state could be to allow a tester 
to reload its pin memory from disk during application 
of a long test sequence. 


The test data register 
selected 
by the current 
instru- 


citon retains its previous 
value during this state. The 
instruction 
does not change 
in this state. 


The controller 
remains 
in this state as long as TMS 
is low. Whne 
TMS 
goes 
high and a rising edge 
is 
applied to TCK, the controller 
moves to the Exit2-0R 
state. 


9.2.4.8 Exlt2-0R State 


This is a temporary 
state. While in this state, if TMS 
is held high, a rising edge applied to TCK causes the 
controller 
to enter the Update-OR 
state, which termi- 
nates the scanning 
process. 
If TMS is held low and a 
rising edge is applied 
to TCK, the controller 
enters 
the Shift-OR 
state. 


The test data register selected 
by the current instru- 
citon retains its previous 
value during this state. The 


instruction 
does not change 
in this state. 


9.2.4.9 Update-OR State 


The 
boundary 
scan 
register 
is 
provided 
with 
a 
latched 
parallel 
output 
to prevent 
changes 
at the 
parallel 
output 
while 
data is shifted 
in response 
to 
the 
EXTEST 
and SAMPLE/PRELOAO 
instructions. 


When 
the 
TAP 
controller 
is in this 
state 
and 
the 
boundary 
scan 
register 
is selected, 
data is latched 
onto the parallel output of this register from the shift- 
register 
path on the falling 
edge of TCK. The data 
held at the latched 
parallel 
output 
does not change 
other than in this state. 
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All Shift-register 
stages in test data register 
selected 


by the current 
instruciton 
retains 
its previous 
val~e 
during this state. The instruction 
does not change 
In 
this state. 


9.2.4.10 Select-IR-Scan State 


This 
is a temporary 
controller 
state. 
The test 
data 
register selected 
by the. current instruction. r~tains its 


previous 
state. If TMS IS held low and a nSlng edge 


is applied 
to TCK when 
in this state, 
the controller 
moves 
into 
the 
Capture-IR 
state, 
and 
a scan 
se- 


quence for the instruction 
register is initiated. 
If TMS 


is held high and a rising edge is applied 
to TCK, the 
controller 
moves to the Test-Logic-Reset 
state. 


The instruction 
does not change 
in this state. 


9.2.4.11 Cllpture-IR State 


In this controller 
state the shift register 
contained 
in 


the instruction 
register 
loads the fixed value "0001" 


on the rising edge of TCK. 


The test data register 
selected 
by the current 
instru- 


citon retains its previous 
value during this state. The 


instruction 
does not change 
in this state. 


When the controller 
is in this state and a rising edge 
is applied 
to TCK, the controller 
enters 
the Exit1-IR 
state if TMS is held high, or the Shift-IR 
state if TMS 


is held low. 


9.2.4.12 Shlft-IR State 


In this 
state 
the 
shift 
register 
contained 
in the 
in- 
struction 
register 
is connected 
between 
TOI 
and 
TOO and shifts data one stage towards 
its serial out- 


put on each rising edge of TCK. 


The test data register 
selected 
by the current 
instru- 


citon retains its previous 
value during this state. The 


instruction 
does not change 
in this state. 


When the controller 
is in this state and a rising edge 
is applied 
to TCK, the controller 
enters 
the Exit1-IR 
state if TMS is held high, or remains 
in the Shift-IR 
state if TMS is held low. 


9.2.4.13 Exit1-IR State 


This is a temporary 
state. While in this state, if TMS 
is held high, a rising edge applied to TCK causes the 
controller 
to enter the Update-IR 
state, which termi- 


nates the scanning 
process. 
If TMS is held low and a 


rising edge is applied 
to TCK, the controller 
enters 


the Pause-IR 
state. 


The test data register selected by the current instru- 
citon retains its previous value during this state. The 
instruction does not change in this state. 


9.2.4.14 
Pause-IR 
State 


The pause state allows the test controller to tempo- 
rarily halt the shifting of data through the instruction 
register. 


The test data register selected by the current instru- 
citon retains its previous value during this state. The 
instruction does not change in this state. 


The controller remains in this state as long as TMS 
is low. When TMS goes high and a rising' edge is 
applied to TCK, the controller moves to the Exit2-IR 
state. 


9.2.4.15 
Exlt2-IR 
State 


This is a temporary state. While in this state, if TMS 
is held high, a rising edge applied to TCK causes the 
controller to enter the Update-IR state, which termi- 
nates the scanning process. If TMS is held low and a 
rising edge is applied to TCK, the controller enters 
the Shift-IR state. 


The test data register selected by the current instru- 
citon retains its previous value during this state. The 
instruction does not change in this state. 


9.2.4.16 
Update-IR 
State 


The instruction shifted into the instruction register is 
latched onto the parallel output from the shift-regis- 
ter path on the falling edge of TCK. Once the new 
instruction has been latched, it becomes the current 
instruction. 


Test data registers selected by the current instruc- 
tion retain the previous value. 


9.2.5 
BOUNDARY 
SCAN 
REGISTER 
CELL 


The boundary scan register for each component 
contains a cell for each.pin, as well as cells for con- 
trol of I/O and tristate pins. 


9.2.5.1 
82495XP 
Boundary 
Scan Register 
Cell 


The following is the bit order of the 82495XP bound- 
ary scan register: (from left to right and top to bot- 
tom) 


82495XP 
Cache Controller/82490XP 
Cache RAM 


TDI- 
MKEN# 
KWEND# 
SWEND# 
BGT# 
CNA# 
BRDY# 
RESERVED CRDY# 
MWBWT# 
DRCTM# 
MRO# 
CWAY# 
FPFLD# 
SNPCYC# 
SNPBSY# MHITM# MTHIT# CAHOLD FSIOUT# 
PALLC# 
SNPADS# 
CADS# 
CDTS# 
CWR# 
CDC# 
CMIO# 
RDYSRC MCACHE# 
KLOCK# 
SMLN# NENE# CFA3 CFA2 TAG11 TAG10 TAG9 
T-AG8TAG7 TAG6 TAG5 TAG4 TAG3 TAG2 TAG1 
TAGO SET10 SET9 SET8 SET7 CLK SET6 SET5 
SET4 SET3 SET2 SET1 SETO CFA6 CFA5 CFA4 
CFA1 
CFAO ADS# 
LEN 
BLAST# 
BRDYC1# 
BRDYCU 
CACHE# LOCK# BLE# BOFF# KEN# 
AHOLD WR# MIO# DC# PWT PCD HITM# PCYC 
EADS# 
NA# 
INV 
WBWT# 
WAY 
WRARR# 
MCYC# BUS# MAWEA# WBWE# WBA WBTYP 
MCFAO MCFA1 MCFA4 MCFA5 MCFA6 MSETO 
MSET1 MSET2 MSET3 MSET4 MSET5 MSET6 2 
MSET7 MSET8 MSET9 MSET10 MTAGO MTAG1 
MTAG2 MTAG3 MTAG4 MTAG5 MTAG6 MTAG7 
MTAG8 MTAG9 MTAG10 MTAG11 MCFA2 MCFA3 
RESET MAOE# 
MBAOE# 
SNPCLK SNPSTB# 
EWBE# 
MPIC# 
SNPINV 
FLUSH# 
SNYC# 
SNPNCA MBALE MALE MACTL OCTL CFA4CTL 
CFA5CTL CACTL FPFLDCTL WBWTCTL NAC- 
TL-TDO 


"RESERVED" signals correspond to no connect 
"NC" signals on the 82495XP. 


EWBE# and MPIC# will be implemented in the 
82495XP B-stepping, omit from boundary scan reg- 
ister for A-stepping 82495XPs. 


All the *CTL cells are control cells that are used to 
select the direction of bidirectional pins or tristate 
output 
pins. If "1" 
is loaded 
into the 
control 
cell(*CTL), the associated pin(s) are tristated or se- 
lected as input. The following lists the control cells 
and their corresponding pins. 
1. MACTL controls the MSETO-10, MTAGO-11, 


and MCFAO-6 pins. 


2. OCTL controls the WAY, WRARR#, 
MCYC#, 


MAWEA#, BUS#, WBWE#, WBA, WBTYP,INV, 
EADS#, 
AHOLD, 
KEN#, 
BOFF#, 
BLE#, 
BRDYC2#, 
BRDYC1#, 
BLAST# , 
NENE#, 


SMLN#, 
KLOCK#, 
MCACHE#, 
RDYSRC, 
CMIO#, 
CDC#, 
CWR#, 
CDTS#, 
CADS#, 


SNPADS#, 
PALLC#, 
FSIOUT#, 
CAHOLD, 


MTHIT#, 
MHITM#, 
SNPBSY#, 
SNPCYC#, 


CWAY, EWBE#, and MPIC# output pins. 


3. CFA4CTL controls the CFA4 pin. 
4. CFA5CTL controls the CFA5 pin. 
5. CACTL 
controls 
the 
SETO-10, 
TAGO-11, 
CFAO-3, and CFA6 pins. 
6. FPFLDCTL controls the FPFLD# pin. 
7. WBWTCTL controls the WB/WT# 
pin. 


8. NACTL controls the NA# pin. 
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"RESERVED" 
signals correspond to no connect 
"NC" signals on the 82490XP. 


All the *CTL cells are control cells that are used to 
select the direction of bidirectional pins or tristate 
output 
pins. If "1" 
is loaded into the 
control 
cell(*CTL), the associated pin(s) are tristated or se- 
lected as input. The following lists the control cells 
and their corresponding pins. 
1. CDCTL controls the CDATAO-7 pins. 
2. MDCTL controls the MDATAO-7 pins. 


9.2.6 
TAP CONTROLLER 
INITIALIZATION 


The TAP controller is automatically intialized when a 
device is powered up. In addition, the TAP controller 
can be initialized by applying a high signal level on 
the TMS input for five TCK periods. 


9.2.7 
BOUNDARY 
SCAN SIGNAL 
DESCRIPTION 
AND TIMINGS 


The functionality of TDI, TMS, TOO, and TCK are 
described in Chapter 7. The A.C. timing specifica- 
tions for the boundary scan signals are located in 
Chapter 10. 


9.3 
Tri-State Output Test Mode 


The 82495XP has the ability to tri-state all of its out- 
puts and bidirectional pins and to disable all pull-ups 
and pull-downs. During tri-state output test mode all 
pins floated during bus hold as well as those which 
are 
never floated 
during normal operation 
are 
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9.4 
82490XP Cache SRAM Testing 


The 82490XP cache SRAM can be tested using 
standard cache memory testing techniques. Code 
must be written to: 
1. Flush and reset the 82495XP/82490XP/CPU 
cache 
2. Write 1's to every bit of a block of memory equal 
to the cache size 
3. Read the block of memory to fill the cache, tag- 
ging the data as read-only using the MRO# sig- 
nal 
4. Write O's to every bit in the block of memory 
5. Read the block, the cache hits should be all 1's 
6. Repeat the process, exchanging 0 for 1 and 1 for 
o 


In this example, the code to test the cache must be 
non-cacheable to the 82495XP. Also, the CPU 
cache must be on so that the 82495XP will perform 
line-fills. 


10.0 
AC/DC 
SPECIFICATIONS 


10.1 Background 


The 82495XP has four main interfaces: CPU Bus, 
memory bus controller, memory bus, and 82490XP. 
The memory bus controller is typically implemented 
with PLO devices. The MBC interface signal timings 
are, therefore, generated based on available, off- 
the-shelf PLOspecs. The memory bus interface was 
specified to suit a generic memory interface which 
works up to CPU frequency. 


intel~ 
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Cache RAM 


10.2 D.e.Specifications 


Table 
10-1. D.C. Specifications 


Vcc = 5V ± 5%, Tcase 
= 0 to + 85°C 


Symbol 
Parameter 
Mln 
Max 
Unit 
Notes 


Vu, 
Input Low Voltage 
-0.3 
+0.8 
V 
TTL Level 


VIH 
Input High Voltage 
2.0 
2.0 
Vcc+0.3 
V 
TTL Level 


VOl 
Output Low Voltage 
0.45 
V 
TTL Level (1) 


VOH 
Output High Voltage 
2.4 
V 
TTL Level (2) 


Icc 
Power Supply Current 
600 
mA 
82495XP 
@ 50 MHz, (3) 


300 
82490XP 
@ 50 MHz, (3) 


Power 
Power Dissipation 
3.00 
W 
82495XP 
@ 50 MHz, (3) 
1.50 
82490XP 
@ 50 MHz, (3) 


III 
Input Leakage Current 
±15 
uA 
0< 
VIN < vcc 


ILO 
Output Leakage Current 
±15 
uA 
o :<;; VOUT :<;; Vcc Tristate 


III 
Input Leakage Current 
200 
uA 
VIN = 0.45V, (4) 


CIN 
Input Capacitance 
14 
pF 
for 82495XP 
5 
for 82490XP 


Co 
Output Capacitance 
18 
pF 
for 82495XP 
15 
for 82490XP 


CI/O 
110 Capacitance 
18 
pF 
for 82495XP 
15 
for 82490XP 


CClK 
CLK Input Capacitance 
11 
pF 
for 82495XP 
5 
for 82490XP 


CTIN 
Test Input Capacitance 
15 
pF 
for 82495XP 
10 
for 82490XP 


CTOUT 
Test Output Capacitance 
15 
pF 
for 82495XP 
10 
for 82490XP 


CTCK 
Test Clock Capacitance 
15 
pF 
for 82495XP 
10 
for 82490XP 


NOTES: 
(1) Parameter 
measured 
at 4mA lIoad. 
For MCFA6-FCFAO, 
MSET10-MSETO, 
and MTAG11-MTAGO, 
this parameter 
is measured 
at 12 mA lIoad. 
(2) Parameter 
measured 
at 1mA lIoad. 
For MCFA6-MCFAO, 
MSET10-MSETO, 
and MTAG11-MTAGO, 
this parameter 
is measured 
at 2 mA lIoad. 
(3) Represents 
maximum 
power 
consumption 
given a typical 
cache 
cycle 
mix. 


(4) This parameter 
is for input with pullup. 
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10.3 A.C.Specifications 


NOTE: 
Please contact your local Intel Sales Office for the latest timing Information. 


All TTl timing specs are measured at 1.5V for both "0" and "1" logic level. 


Table 10-2. Clock, Reset, and Configuration 


Vcc = 5V ± 5%, Tease = Oto +85°C 
CL = 0 pF unless otherwise specified. 
All Inputs and Outputs are TTL Level. 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


to 
ClK, MClK, MOClK Frequency 
25 
50 
MHz 
1x clock 


t1 
ClK, MClK, MOClK Stability 
0.1 
% 


t2 
ClK, MClK, MOClK Period 
20 
40 
ns 
10·1 


t3 
ClK, MClK, MOClK High Time 
7 
ns 
10·1 
(1) 


t4 
ClK, MClK, MOClK low Time 
7 
ns 
10·1 
(1) 


t5 
ClK, MClK, MOClK Rise Time 
2 
ns 
(1) 


t6 
ClK, MClK, MOClK Fall Time 
2 
ns 
(1) 


t7 
RESET Setup Time 
6 
ns 
10-4 


t8 
RESET Hold Time 
2 
ns 
10-4 


t9 
RESET Duration 
8xt2 
ns 
10-4 
for 82495XP, (2) 


15xt2 
for 82490XP 


t10 
All Configurations CFG3-CFGO, 
10xt2 
ns 
10-4 
(3),'(4) 
SNPMD, MEMlDRV, C490lDRV, 
HIGHZ11, SlFTST 11 Setup Time 


t11 
All Configurations CFG3-CFGO, 
0 
ns 
10-4 
(3), (5) 
SNPMD, MEMlDRV, C490lDRV, 
HIGHZ11, SlFTST 11 Hold Time 


t12 
FLUSH11, SYNCII Setup Time 
8 
ns 
10-3 
for 82495XP, (6) 


t13 
FLUSH11 , SYNCII Hold Time 
1 
ns 
10-3 
for 82495XP, (7) 


t14 
FLUSH11 , SYNCII Duration 
2x12· 
ns 
(8) 


t15 
MOClK falling edge to MClK rising edge 
2 
ns 


NOTE: 
(1) Rise/Fall, 
High/low 
times 
measured 
between 
O.8V and 2.0V. 


(2) Power 
up reset duration 
should 
be 1 ms after Vcc and Cl.K are stable. 
If configuration 
inputs with pullups 
are left floated, 


10 us RESET 
duration 
is required. 


(3) Timing 
is referenced 
to reset falling 
edge. 
(4) 8ns setup time is required 
to guarantee 
recognition 
on next clock. 


(5) 1ns hold time is required 
to guarantee 
recognition 
on next clock. 


(6) To guarantee 
recognition 
on next clock. 
(7) Synchronous 
mode only. 
(8) Asynchronous 
mode only. To guarantee 
recognition. 
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Table 10-3. Memory Bus Controller 8249SXP/82490XP Interface 


Vcc = SV ± S%, Tease = Oto +8SoC 
CL = 0 pF unless otherwise specified. 
All Inputs and Outputs are TTL Level. 


Symbol 
Parameter 
Mln 
Mu 
Unit 
Figure 
Notes 


t30 
BRDY#, CRDY#, KWEND#, SWEND#, 
8 
ns 
10·3 
82495XP Only 
BGT#, CNA#, Setup Time 


t30a 
BRDY#, CRDY# Setup Time 
7 
ns 
10·3 
82490XP Only 


t31 
BRDY#, CRDY#, KWEND#, SWEND#, 
1 
ns 
10-3 
82495XP Only 


BGT#, CNA#, Hold Time 


t32 
CW/R#, CD/C#, CMI/O#, RDYSRC, 
2 
10 
ns 
10-2 
MCACHE#, KlOCK#, 
BlE#, 
PAllC#, 


CAHOlD, CWAY, FSIOUT#, CADS#, 
CDTS#, SNPADS# FPFlD# 
Valid Delay 


t33 
NENE#, SMlN# 
Valid Delay 
2 
14 
ns 
10-2 


t34 
MDATA Setup to ClK (clock before 
6 
ns 
10-3 
(1) 
BRDY11active) 


t35 
MDATA Valid Delay from ClK (ClK from 
2 
13 
ns 
10-2 
CDTS# valid, MDOE# active) 


t36 
MDATA Valid Delay from MDOE# active 
8 
ns 
10-2 


t37 
MDATA Float Delay from MDOE# inactive 
0 
14 
ns 


NOTE: 
(1) Even if MBRDY# 
or MISTB 
is not used for this cycle, 
the data must still be held according 
to t43. 


Table 10-4. 8249SXP Memory Interface 


Vcc = 5V ± S%, Tcase = Oto +8SoC 
CL = 0 pF unless otherwise specified. 
All Inputs and Outputs are TTL Level. 


Symbol 
Parameter 
Mln 
Mu 
Unit 
Figure 
Notes 


t50 
SNPClK Frequency 
8 
50 
MHz 
1x clock (10) 


t51 
SNPClK Period 
20 
125 
ns 
10-1 
(11) 


t52 
SNPClK High TIme 
7 
ns 
10-1 


t53 
SNPClK low Time 
7 
ns 
10-1 


t54 
SNPClK Rise Time 
2 
ns 
(1) 


t55 
SNPClK Fall Time 
2 
ns 
(1) 


t56 
MCFA6-MCFAO, MSET10-MSETO, 
2 
11 
ns 
10-5 
(2), (3) 
MTAG11-MTAGO Valid Delay 


t57 
MCFA6-MCFAO, MSET10-MSETO, 
2 
15 
ns 
10-5 
(4) 
MTAG11-MTAGO Float Delay 


t58 
MCFA6-MCFAO, MSET10-MSETO, 
2 
13 
ns 
10-5 
(5) 
MTAG11-MTAGOValid Delay 
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t62c 
MCFA6-MCFAO, 
MSET10-MSETO, 
MTAG11- 
8 
ns 
10-3 
(7c) 
MTAGO, SNPINV, SNPNCA, 
MAOE#, 
MBAOE#, 
SNPSTB# 
Setup Time 


t63a 
MCFA6-MCFAO, 
MSET10-MSETO, 
MTAG11- 
1 
ns 
10-3 
(7a) 
MTAGO, SNPINV, SNPNCA, 
MAOE#, 
MBAOE#, 
SNPSTB # Hold Time 


t63b 
MCFA6-MCFAO, 
MSET10-MSETO, 
MTAG11- 
8 
ns 
10-3 
(7b) 
MTAGO, SNPINV, SNPNCA, 
MAOE#, 
MBAOE# 
Hold Time 


t63c 
MCFA6-MCFAO, 
MSET10-MSETo, 
MTAG11- 
1 
ns 
10-3 
(7c) 
MTAGO, SNPINV, SNPNCA, 
MAOE#, 
MBAOE#, 


SNPSTB# 
Hold Time 


t64 
SNPSTB# 
Setup Time 
8 
ns 
10-3 
(8) 


t65 
SNPSTB # Hold Time 
1 
ns 
10-3 
(8) 


t66 
SNPSTB# 
Active/Inactive 
Time 
8 
ns 
10-3 
(9) 


t67 
MRO#, 
MKEN#, 
DRCTM#, 
MWB/WT# 
Setup 
8 
ns 
10-3 
Time 


t68 
MRO#, 
MKEN#, 
DRCTM#, 
MWB/WT# 
Hold 
1 
ns 
10-3 
Time 


t69 
MTHIT#, 
MHITM#, 
SNPBSY#, 
Valid Delay 
2 
12 
ns 
10-2 


t69a 
SNPCYC# 
Valid Delay 
2 
10 
ns 
10-2 


NOTES: 
(1) Rise/fall 
times 
measured 
between 
0.45V and 2.4V 
(2) See capacitive 
derating 
curves 
for additional 
loading 
delay. 


(3) Valid delay from 
MAOE#, 
MBAOE# 
going active 
(Iow) 


(4) Float delay from 
MAOE#, 
MBAOE# 
going inactive 
(high) 
(5) Valid delay from 
MALE or MBAlE 
if both MAOE#, 
MBAOE# 
are active 
(6) Valid delay from ClK 
only if MALE or MBAlE, 
MAOE# 
and MBAOE# 
are active 
(7) a. In clocked 
mode 
referenced 
to SNPClK 
rising edge 
b. In strobed 
mode referenced 
to SNPSTB# 
falling 
edge 
c. In synchronous 
mode, 
refer to ClK 


(8) Asynchronous 
clocked 
mode only. Timings 
referenced 
to SNPClK 


(9) Asynchronous 
signal. Time to guarantee 
recognition 
on next clock 


(10) SNPClK 
is only used for the clocked 
memory 
bus mode. 
SNPClK 
Min frequency 
not tested. 


(11) t51 > t2 
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Table 
10-5. 82490XP 
Clocked 
Mode 


Vcc = 5V ± 5%, Tease = Oto +85°C 
CL = 0 pF unless otherwise 
specified. 
All Inputs and Outputs are TTL Level. 


Symbol 
Parameter 
Min 
Max 
Unit 
Figure 
Notes 


t38 
MBRDY#, 
MSEL#, 
MEOC# 
Setup to MCLK 
5 
ns 
10-3 


t39 
MBRDY#, 
MSEL#, 
MEOC# 
Hold from MCLK 
2 
ns 
10·3 


t40 
MZBT #, MFRZ # Setup to MCLK 
5 
ns 
10-3 


t41 
MZBT#, 
MFRZ# 
Hold from MCLK 
2 
ns 
10-3 


t42 
MDATA Setup to MCLK 
5 
ns 
10-3 


t43 
MDATA Hold from MCLK 
2 
ns 
10-3 


t44 
MDATA Valid Delay from MCLK'MBRDY# 
2 
14 
ns 
10-2 


t45 
MDATA Valid Delay from MCLK'MEOC#, 
MCLK'MSEL# 
2 
20 
ns 
10-2 


t46 
MDATA Valid Delay from MOCLK 
2 
10 
ns 
10-2 


Table 10-6. 82490XP 
Strobed 
Mode 


Vcc = 5V ± 5%, Tease = Oto +85°C 
CL = 0 pF unless otherwise 
specified. 
All Inputs and Outputs are TTL Level. 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


t85 
MISTB, MOSTB High Time 
12 
ns 
10-6 


t86 
MISTB, MOSTB Low time 
12 
ns 
10-6 


t87 
MEOC# 
High time 
8 
ns 
10-6 


t88 
MEOC# 
Low time 
8 
ns 
10-6 


t89 
MxSTB, MEOC# 
Rise time 
2 
ns 
(1) 


t90 
MxSTB, MEOC# 
Fall time 
2 
ns 
(1) 


t91 
MSEL # High time for restart 
8 
ns 
10-6 


t92 
MSEL# 
Setup before transition 
on MxSTB 
5 
ns 
10-8 


t93 
MSEL # Hold after transition 
on MxSTB 
10 
ns 
10-8 


t92 
MSEL # Hold after transition 
on MEOC # 
2 
ns 
10-8 


t95 
MxSTB transition 
tolfrom 
MEOC# 
falling transition 
10 
ns 


t96 
MZBT# 
Setup to MSEL# 
or MEOC# 
falling edge 
5 
ns 
10-7 


t97 
MZBT# 
Hold from MSEL# 
or MEOC# 
falling edge 
2 
ns 
10-7 


t98 
MFRZ# 
Setup to MEOC# 
falling edge 
5 
ns 
10-7 


t99 
MFRZ# 
Hold from MEOC# 
falling edge 
2 
ns 
10·7 


t100 
MDATA Setup to MxSTB or MEOC# 
falling transition 
5 
ns 
10·7 


t101 
MDATA Hold from MxSTB or MEOC# 
falling transition 
2 
ns 
10·7 


t102 
MDATA Valid Delay from MxSTB transition 
2 
14 
ns 
10-9 


t103 
MDATA Valid Delay from MEOC# 
falling transition 
or 
2 
20 
ns 
10-9 
MSEL # deactivation 


NOTE: 
(1) Rise/Fall 
times 
are measured 
between 
O.8V and 2.0V 
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Table 10-7. Test Mode 


Vcc = 5V ± 5%, Tcase = Oto +85°C 
Cl = 0 pF unless otherwise 
specified. 
All Inputs and Outputs are TTL Level. 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


t120 
TCK Frequency 
25 
MHz 
1x clock 


t121 
TCK Period 
40 
ns 
(2) 


t122 
TCK High Time 
10 
ns 
@2.0V 


t123 
TCKLowTime 
10 
ns 
@0.8V 


t124 
TCK Rise Time 
4 
ns 
(1) 


t125 
TCKFallTime 
4 
ns 
(1) 


t126 
TDI, TMS Setup Time 
8 
ns 
10-10 


t127 
TDI, TMS Hold Time 
7 
ns 
10-10 


t128 
TOO Valid Delay 
3 
25 
ns 
10-10 


t129 
TOO Float Delay 


t130 
All Outputs Valid Delay 
3 
25 
ns 
10-10 
(3) 


t131 
All Outputs 
Float Delay 
36 
ns 
10-10 
(3) 


NOTES: 
(1) Rise/Fall 
times 
are measured 
between o.av and 2.0V Rise/Fall 
times 
can be relaxed 
by 1ns per 10ns increase 
in TCK 


period 
(2) TCK period 
;" ClK 
period 


(3) Parameter 
measured 
from TCK 
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t2,51.71 


Figure 10-1. Clock Waveform 
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Signal 


~ = t16,32,33, 
35, 36, 44, 45, 60,69 


Figure 10-2. Valid Delay Timings 
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ClK 


Slgnol 


Signol 
240956-51 


Ix = 130, 62a, 62c, 64, 67, 76, 85 
~ = 131, 63a, 63c, 65,68, 77, 86 
Figure 10-3a. Setup and Hold Timings In 
Strobed 
Snooping 
Mode 
Figure 10-3. Setup and Hold Timings 


ClK 


RESET 
t9 


t10 
t11 


Conlig 
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Figure 10-4. Reset and Configuration 
Timings 


CFAO 


WAOE*, 
WBAOE* 
t57 


WCFAO 
VALID 


WALE, 
WBAlE 
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Figure 10-5. Memory 
Interface 
Signals 


Slgnol 
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Figure 10-6. Actlvellnactlve 
Timing 


2-403 


Signal 


t92 
t93,94 
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Figure 10-8, Setup and Hold Timing 


STB 


tl02,103 


XXXXXX 
XXX 
VALID 
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Signal 
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Figure 10-9. Valid Delay Timing 


TCK 


TOl 


Tt.4S 


TOO 


Output 
Signals 
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Figure 10-10. Test Timings 


10.4 Optimized Interface 
Specifications 


The optimized interface is the high performance in- 
terconnect between the i860 XP microprocessor, 
82495XP cache controller, and 82490XP cache 
RAM. This interface is tuned for the known configu- 
ration options of the chip set and includes non-stan- 
dard input and output buffers optimized for the de- 
fined electrical environment of each signal path. The 
specification of this interface is also non-standard. 
The output delay of a signal driver and the input set- 
up time of the corresponding receiver are summed 
into a single value. The time remaining in a clock 


82495XP 
Cache Controller/82490XP 
Cache RAM 


period must account for clock skew and signal trace 
delay. In designing the layout between these compo- 
nents, the designer must ensure that each signal in 
the optimized interface is carefully simulated using 
buffer models, and that signal timing achieves the 
specified limits. 


Tables 10-8 through 10-11 define the clock skew 
and combined setup + valid delay spec for each 
path in the optimized interface. 


NOTE: 
Please 
contact 
your 
local 
Intel 
Sales 
office 
for 
the latest timing Information. 


Table 
10-8. Signal Group: 1860TM XP CPU to Cache 
256 Kbyte Version 


Combined 
Maximum 
Driver 
Receiver 
Setup + 
ClK 
Valid Delay 
Skew 


CPUA3 
495 CFAO 
16.7 
0.5 


CPUA4-A5 
495 CFA1, CFA6 
16.7 
0.5 


CPU A6-A16 
495 SETO-SET10 
16.7 
0.5 


CPU A17-A21 
495 TAGO-TAG4 
17.9 
0.5 


CPU A22-A28 
495 TAG5- TAG11 
17.9 
0.5 


CPU A29-A31 
495 CFA2-CFA4 
17.9 
0.5 


495 CFAO* 
CPUA3 
31.9 
0.5 


495 CFA1, CFA6* 
CPUA4,A5 
31.9 
0.5 


495 CFA2-CFA4* 
CPUA29-A31 
37.4 
0.5 


495 SETO-SET10* 
CPUA16-A16 
31.9 
0.5 


495 TAGO- TAG4* 
CPUA17-A21 
37.4 
0.5 


495 TAG5- TAG11* 
CPUA22-A28 
37.4 
0.5 


CPU A3-A16 
490 A2-A15 
16.2 
1.0 


CPUADS# 
495 ADS# 
16.8 
0.5 


CPUADS# 
490 ADS# 
16.3 
1.0 


CPU HITM# 
495 HITM# 
16.8 
0.5 


CPU HITM# 
490 HITM# 
16.3 
1.0 


CPUW/R# 
495 
W/R# 
16.8 
0.5 


CPUW/R# 
490 
W/R# 
16.3 
1.0 
.Total time for these 
signals 
IS 40 ns. they are driven one clock 
before 
EADS# 
. 
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Combined 
Maximum 
Driver 
Receiver 
Setup + 
CLK 
Valid Delay 
Skew 


CPU CACHE# 
495 CACHE# 
17.9 
0.5 


CPUCTYP 
495 CFA5 
16.8 
0.5 


CPU D/C# 
495 
D/C# 
17.9 
0.5 


CPU LEN 
495 LEN 
18.0 
0.5 


CPU LOCK# 
495 LOCK# 
18.0 
0.5 


CPU M/IO# 
495 M/IO# 
17.9 
0.5 


CPU PCD 
495 PCD 
17.9 
0.5 


CPU PCYC 
495 PCYC 
17.9 
0.5 


CPUPWT 
495 PWT 
17.9 
0.5 


495 AHOLD 
CPUAHOLD 
17.6 
0.5 


495 BOFF# 
CPUBOFF# 
17.6 
0.5 


495 BRDYC1# 
CPUBRDY1# (RSRVD) 
18.1 
0.5 


495 EADS# 
CPUEADS 
17.6 
0.5 


495 EWBE# 
CPUEWBE 
17.6 
0.5 


495 INV 
CPUINV 
17.6 
0.5 


495 KEN# 
CPUKEN# 
17.6 
0.5 


495 NA# 
CPUNA# 
17.6 
0.5 


495 WB/WT# 
CPUWB/WT# 
17.6 
0.5 


Table 10-9. Signal Group: 1860™ XP CPU to 82495XP 
256 Kbyte Version 


Combined 
Maximum 
Driver 
Receiver 
Setup + 
CLK 
Valid Delay 
Skew 


CPU BE7#-BEO# 
490 BE# 
16.3 
1.0 


CPUBE7#-BEO# 
490 CDATA7-4 
16.3 
1.0 


CPUD63-DO 
490 CDATA7-0 
16.9 
1.0 


490 CDATA7-0 
CPUD63-DO 
16.8 
1.0 


CPUDP7-DPO 
490 CDATA3-0 
16.9 
1.0 


490 CDATA3-0 
CPUDP7-DPO 
16.8 
1.0 


Table 10-10. Signal Group: 1860TMXP CPU to 82490XP 
256 Kbyte Version 
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Table 
10-11. Signal Group: 82495XP 
to 82490XP 
, 
256 Kbyte Version 


Combined 
Maximum 
Driver 
Receiver 
Setup + 
ClK 
Valid Delay 
Skew 


495 BLAST# 
490 BLAST# 
17.2 
1.0 


495 BOFF# 
490 BOFF# 
17.5 
1.0 


495BRDYC2# 
490 BRDYC# 
17.3 
1.0 


495 BUS# 
490 BUS# 
16.9 
1.0 


495 MAWEA# 
490 MAWEA# 
16.9 
1.0 


495 MCYC# 
490 MCYC# 
16.9 
1.0 


495 WAY 
490 WAY 
17.0 
1.0 


495WBA 
490WBA 
16.8 
1.0 


495WBTYP 
490WBTYP 
16.9 
1.0 


495WBWE# 
490WBWE# 
16.9 
1.0 


495WRARR# 
490WRARR# 
17.1 
1.0 


10.5 
The First Order Electrical Buffer 
Model 
' 


The first order electrical buffer model provides an 
accurate and simple representation of the buffers 
used in the inputs and outputs of the 82495XP/ 
82490XP. The model output consists of four compo- 
nents: 


1. Linear voltage waveform (dV/dt) 
2. Intrinsic buffer delay due to CL (to) 
3. Buffer output impedance (Ra) 
3. Buffer output capacitance (Co) 


as shown in Figure 10-11. 


A fitting algorithm has been used to arrive at values 
for dV/dt. Co. and Ra such that Ra matches the 
actual buffer impedance and Co. the intrinsic buffer 
output capacitance whether the output is on or off. 
remains constant across the operating range while 
minimizing the difference between the full buffer cir- 
cuit and its simplified electrical model for a set of 
different loads (lumped capacitance. and short and 
long transmission lines). dV/dt is the slope of the 
voltage ramp. while to is the intrinsic buffer delay 
associated with a given CL.to accounts for the intrin- 
sic delay by offsetting the excitation of the model by 
the amount of the delay. 
. 


NOTE: 
to is zero for CL = 0 and when the load is repre- 
sented by a transmission line. 


•• 


Figure 10-11. Output Model 


The input model consists of one component. buffer 
capacitance (CIN).as shown in Figure 10-12. 
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Figure 10-12. Input Model 


10.5.1 
FIRST 
ORDER 
ELECTRICAL 
MODEL 
PARAMETER 
VALUES 


The parameters that make up the first order electri- 
cal model vary with the buffer design. In addition. 
these parameters also vary with the operating condi- 
tion (Le.temperature and VcC>of the buffer and pro- 
cess. The typical process corner is being modeled. 
Three sizes of buffer are used on these compo- 
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nents, labelled 
here as small, large, and extra large. 
The parameter 
values found in tables 
10-12 through 
10-14 list dV/dt, 
to, Ra, and Co. These 
parameters 
are 
provided 
for 
both 
low-to-high 
and 
high-to-Iow 
transitions 
at the typical process 
corner for three op- 
erating 
conditions 
(Vcc = 5.5V and TJ = 
-10·C, 
vcc = 5.0V and TJ = 80·C, and vcc = 4.5 
and 
TJ = 125·C. 


10.5.2 
PACKAGE 
PARAMETERS 


In addition to buffer characteristics, 
package 
charac- 


teristics 
are also 
included 
to complete 
the 
model. 


Package 
inductance, 
capacitance 
and 
resistance 
values vary with design geometry 
and material prop- 


erties 
of the package. 
Figure 
10-13 shows 
a model 
of 
the 
package 
including 
these 
parameters 
and 
should 
be placed 
between 
the first order 
electrical 
buffer 
model 
as shown 
in Figure 
10-14 
and 
the 
board interconnects. 
Notice the package 
model only 
includes 
the 
package 
inductance 
(Lp) and capaci- 
tance 
(Cp). This is sufficient 
since the package 
re- 
sistance 
is so small it is negligible. 


Tables 
10-15 and 10-16 list the buffer model param- 
eters 
for 
each 
pin of the 
82495XP 
and 
82490XP 
cache 
components. 
The 
tables 
give 
the 
package 
model parameters 
for each pin, followed 
by the input 
capacitance 
(input and 1/0 pins) andlor output buff- 
er size (outputs 
and 1/0). In those cases where 
the 
buffer 
used by a pin is an option 
selected 
at reset, 


the output 
buffer column 
lists the sizes available. 
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Figure 
10-13. Package 
Model 
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Figure 
10-14a. Output 
Buffer 
and Package 
Model 


4> 


~ 
q, T 
T GM 


240956-66 


Figure 
10-14b.lnput 
Buffer 
and Package 
Model 


10.5.3 
BOARD 
INTERCONNECTS 


The 
board 
interconnect 
can 
be 
considered 
as a 


lumped parameter 
(capacitive 
load) or as a transmis- 


sion line. As a rule of thumb, an unterminated 
board 
interconnect 
may be considered 
as a capacitive 
load 
if the round 
trip time (time for signal 
to travel 
from 
one end of the interconnect 
to the other and back) is 
short compared 
to the transition 
time of the signal. 


At frequencies 
of 50 MHz and above 
most intercon- 
nects 
behave 
as transmission 
lines (Figure 
10-15). 


For 
accurate 
results 
at 
high 
frequencies, 
these 
transmission 
line effects 
must be taken into account 
and modeled. 


~ 
T 
T 
•• • 
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Figure 
10-15. Transmission 
Line Model 
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Table 1()"12.Small Buffer First Order Electrical 
Model Parameter 
Values 


Transition 
Vee (V) 
TJ(C) 
Ro (.0.) 
Co (pF) 
dV/dt 
to (ns) CL (pF) = 


0 
5 
25 
50 
100 
150 


low-to- high 
5.5 
-10 
28.0 
4.3 
5.5/1.2 
0 
0.0 
0.1 
0.3 
0.7 
1.1 


low-to-high 
5.5 
80 
36.4 
4.3 
5.5/1.4 
0 
0.0 
0.1 
0.8 
0.8 
1.2 


low-to-high 
5.5 
125 
40.4 
4.3 
5.5/1.5 
0 
0.0 
0.1 
0.4 
0.8 
1.2 


low-to-high 
5.0 
-10 
30.2 
4.3 
5.0/1.2 
0 
0.0 
0.1 
0.4 
0.8 
1.2 


low-to-high 
5.0 
80 
39.2 
4.3 
5.0/1.4 
0 
0.0 
0.2 
0.4 
0.9 
1.3 


low-to-high 
5.0 
125 
43.5 
4.3 
5.0/1.6 
0 
0.0 
0.2 
0.4 
0.9 
1.3 


low-to-high 
4.5 
-10 
33.0 
4.3 
4.5/1.2 
0 
0.0 
0.2 
0.5 
1.0 
1.4 


low-to-high 
4.5 
80 
42.8 
4.3 
4.5/1.6 
0 
0.0 
0.2 
0.5 
1.0 
1.5 


low-to-high 
4.5 
125 
47.4 
4.3 
4.5/1.6 
0 
0.0 
0.3 
0.6 
1.1 
1.6 


high-to-Iow 
5.5 
-10 
23.2 
4.3 
5.5/1.0 
0 
0.0 
0.4 
0.7 
1.2 
1.6 


high-to-Iow 
5.5 
80 
31.4 
4.3 
5.5/1.4 
0 
0.0 
0.4 
0.9 
1.3 
1.8 


high-to-Iow 
5.5 
125 
36.1 
4.3 
5.5/1.6 
0 
0.0 
0.5 
0.8 
1.3 
1.8 


high-to-Iow 
5.0 
-10 
24.0 
4.3 
5.0/1.1 
0 
0.0 
0.5 
0.9 
1.2 
1.7 


high-to-Iow 
5.0 
80 
-32.8 
4.3 
5.0/1.4 
0 
0.0 
0.5 
0.9 
1.5 
1.9 


high-to-Iow 
5.0 
125 
37.8 
4.3 
5.0/1.7 
0 
0.0 
0.5 
0.9 
1.4 
1.8 


high-to-Iow 
4.5 
-10 
25.1 
4.3 
4.5/1.2 
0 
0.0 
0.4 
0.7 
1.2 
1.7 


high-to-Iow 
4.5 
80 
34.5 
4.3 
4.5/1.6 
0 
0.0 
0.4 
0.8 
1.3 
1.8 


high-to-Iow 
4.5 
125 
39.9 
4.3 
4.5/1.8 
0 
0.0 
0.5 
0.9 
1.4 
1.9 
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low-to-high 
4.5 
-10 
14.1 
4.3 
4.5/0.9 
0 
0.0 
0.2 
0.4 
0.7 
1.1 
1.4 
1.7 
2.0 


low-to-high 
4.5 
80 
18.0 
4.3 
4.5/1.2 
0 
0.0 
0.2 
0.4 
0.9 
1.2 
1.5 
1.9 
2.2 


low-to-high 
4.5 
125 
19.9 
4.3 
4.5/1.3 
0 
0.0 
0.2 
0.5 
0.8 
1.2 
1.5 
1.9 
2.2 


high-to-Iow 
5.5 
-10 
10.6 
4.3 
5.5/0.7 
0 
0.0 
0.3 
0.6 
0.9 
1.2 
1.5 
1.8 
2.0 


high-to-Iow 
5.5 
80 
13.9 
4.3 
5.5/1.0 
0 
0.0 
0.4 
0.7 
1.2 
1.5 
1.9 
2.2 
2.5 


high-to-Iow 
5.5 
125 
15.8 
4.3 
5.5/1.1 
0 
0.0 
0.4 
0.8 
1.3 
1.7 
2.0 
2.4 
2.8 


high-to-Iow 
5.0 
-10 
11.0 
4.3 
5.0/0.8 
0 
0.0 
0.4 
0.7 
1.0 
1.3 
1.6 
1.9 
2.1 


high-to-Iow 
5.0 
80 
14.5 
4.3 
5.0/1.0 
0 
0.0 
0.4 
0.8 
1.2 
1.6 
2.0 
2.3 
2.6 


high-to-Iow 
5.0 
125 
16.5 
4.3 
5.0/1.2 
0 
0.0 
0.4 
0.8 
1.3 
1.7 
2.1 
2.5 
2.8 


high-to-Iow 
4.5 
-10 
11.3 
4.3 
4.5/0.9 
0 
0.0 
0.4 
0.7 
1.1 
1.4 
1.7 
2.0 
2.4 


high-to-Iow 
4.5 
80 
15.2 
4.3 
4.5/1.2 
0 
0.0 
0.4 
0.8 
1.3 
1.6 
2.0 
2.3 
2.7 


high-to-Iow 
4.5 
125 
17.4 
4.3 
4.5/1.3 
0 
0.0 
0.4 
0.8 
1.3 
1.7 
2.1 
2.5 
2.8 
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Transition 
Vcc(V) 
TJ(C) 
Ro(U) 
Co(pF) 
dV/dt 
to (ns) CL (pF) = 


0 
5 
25 
50 
100 
150 
200 
250 
300 


low-to-high 
5.5 
-10 
7.7 
8.9 
5.5/0.8 
0 
0.0 
0.1 
0.2 
0.4 
0.6 
0.8 
0.9 
1.1 


low-to-high 
5.5 
80 
9.8 
8.9 
5.5/1.1 
0 
0.0 
0.2 
0.3 
0.5 
0.8 
1.0 
1.2 
1.4 


low-to-high 
5.5 
125 
10.8 
8.9 
5.5/1.2 
0 
0.0 
0.2 
0.4 
0.6 
0.8 
1.0 
1.2 
1.4 


low-to-high 
5.0 
-10 
8.2 
8.9 
5.0/0.9 
0 
0.0 
0.1 
0.2 
0.5 
0.7 
0.9 
1.1 
1.3 


low-to-high 
5.0 
80 
10.5 
8.9 
5.0/1.1 
0 
0.0 
0.2 
0.3 
0.7 
0.9 
1.2 
1.4 
1.6 


low-to-high 
5.0 
125 
11.5 
8.9 
5.0/1.2 
0 
0.0 
0.2 
0.4 
0.8 
1.0 
1.3 
1.5 
1.7 


low-to-high 
4.5 
-10 
8.9 
8.9 
4.5/1.0 
0 
0.0 
0.2 
0.3 
0.6 
0.8 
1.0 
1.2 
1.5 


low-to-high 
4.5 
80 
11.3 
8.9 
4.5/1.2 
0 
0.0 
0.1 
0.3 
0.7 
0.9 
1.2 
1.4 
1.6 


low-to-high 
4.5 
125 
12.4 
8.9 
4.5/1.2 
0 
0.0 
0.2 
0.4 
0.8 
1.1 
1.4 
1.6 
1.9 


high-to-Iow 
5.5 
-10 
8.5 
8.9 
5.5/0.8 
0 
0.0 
0.2 
0.4 
0.7 
0.9 
1.0 
1.2 
1.4 


high-to-Iow 
5.5 
80 
10.5 
8.9 
5.5/1.1 
0 
0.0 
0.3 
0.5 
0.9 
1.2 
1.4 
1.6 
1.8 


high-to-Iow 
5.5 
125 
11.7 
8.9 
5.5/1.2 
0 
0.0 
0.3 
0.6 
1.0 
1.3 
1.5 
1.8 
2.0 


high-to-Iow 
5.0 
-10 
8.7 
8.9 
5.0/0.9 
0 
0.0 
0.2 
0.4 
0.7 
0.9 
1.1 
1.2 
1.4 


high-to-Iow 
5.0 
80 
10.9 
8.9 
5.0/1.1 
0 
0.0 
0.3 
0.6 
0.9 
1.2 
1.5 
1.7 
1.8 


high-to-Iow 
5.0 
125 
12.1 
8.9 
5.0/1.3 
0 
0.0 
0.4 
0.6 
1.0 
1.3 
1.6 
1.8 
2.1 


high-to-Iow 
4.5 
-10 
9.0 
8.9 
4.5/1.0 
0 
0.0 
0.2 
0.5 
0.8 
1.0 
1.2 
1.3 
1.5 


high-to-Iow 
4.5 
80 
11.3 
8.9 
4.5/1.2 
0 
0.0 
0.3 
0.6 
1.0 
1.3 
1.5 
1.7 
1.9 


high-to-Iow 
4.5 
125 
12.6 
8.9 
4.5/1.3 
0 
0.0 
0.4 
0.7 
1.1 
1.4 
1.7 
2.0 
2.2 


Table 
10-14. Extra Large Buffer First Order Electrical 
Model Parameter 
Values 
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Table 10-15. 82495XP 
Cache Controller 
Buffer Models 


Input 
Output 


Pin Name 
Location 
Cp (pF) 
Lp (nH) 
Buffer 
Buffer 
Typ 
Typ 
Cln (pF) 
Size 
Typ 


A05# 
815 
7.7 
12.0 
3.8 


AHOLO 
A17 
9.5 
16.0 
5 


BGT# 
M03 
5.9 
10.1 
1.9 


BLA5T# 
C15 
7.7 
11.2 
LlX 


BLE# 
C16 
8.0 
14.1 
5 


BOFF# 
G15 
7.7 
9.4 
LlX 


BROY# 
P01 
7.3 
12.9 
1.7 


BROYC1# 
015 
8.0 
10.5 
5 


BROYC2# 
F14 
8.1 
8.7 
X 


BU5# 
P16 
5.9 
8.4 
X 


CACHE# 
G14 
8.1 
8.4 
2.3 


CA05# 
E03 
6.4 
10.4 
5 


CAHOLO 
G04 
6.2 
8.6 
5 


CO/C# 
003 
6.7 
10.2 
5 


COT5# 
F04 
6.5 
8.8 
, 
L 


CFAO 
E15 
8.3 
10.2 
4.3 


CFA1 
B14 
7.2 
11.3 
4.3 


CFA2 
006 
7.9 
8.8 
4.3 


CFA3 
B02 
8.8 
13.5 
4.3 


CFA4 
A16 
7.5 
14.5 
4.3 


CFA5 
E14 
7.2 
9.3 
4.3 


CFA6 
014 
7.1 
10.2 
5.5 


ClK 
011 
6.5 
8.8 
3.1 


CM/IO# 
004 
7.3 
10.2 
5 


CNA# 
l04 
5.8 
8.7 
1.7 


CROY# 
M02 
6.5 
10.8 
1.8 


CWAY 
J03 
5.9 
8.4 
0.0 
5 


CW/R# 
E04 
7.1 
9.4 
5 


O/C# 
H14 
5.8 
8.4 
2.3 


ORCTM# 
M01 
6.4 
11.8 
3.9 


EA05# 
J15 
5.9 
8.4 
5 


EWBE# 
502 
7.4 
12.9 
5 


FlU5H# 
N04 
8.1 
8.2 
1.6 


FPFlO# 
J04 
5.3 
8.1 
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Table 10-15. 82495XP 
Cache Controller 
Buffer Models 
(Continued) 


Input 
Output 


Pin Name 
Location 
Cp (pF) 
Lp (nH) 
Buffer 
Buffer 
Typ 
Typ 
Cln (pF) 
Size 
Typ 


F810UT# 
001 
6.9 
12.3 
8 
8 


HITM# 
017 
8.7 
15.4 
2.5 


INV 
K15 
5.6 
9.2 
8 


KEN# 
016 
7.9 
13.5 
8 


KLOCK# 
C03 
8.1 
11.5 
8 


KWENO# 
M04 
6.3 
9.1 
2.0 


LEN 
F15 
8.1 
9.7 
2.3 


LOCK # 
B16 
8.5 
13.8 
2.3 


MALE 
002 
7.5 
11.8 
1.6 


MAOE# 
804 
6.8 
11.7 
1.6 


MAWEA# 
017 
7.2 
12.6 
X 


MBALE 
P04 
7.2 
10.5 
1.7 


MBAOE# 
P06 
7.1 
8.6 
1.6 


MCACHE# 
CO2 
7.6 
12.1 
8 


MCFAO 
016 
7.6 
11.5 
6.6 
8/L 


MCFA1 
N14 
6.9 
9.7 
6.6 
8/L 


MCFA2 
R04 
7.0 
11.9 
6.6 
8/L 


MCFA3 
006 
6.6 
8.9 
6.6 
8/L 


MCFA4 
P15 
7.2 
11.2 
6.6 
8/L 


MCFA5 
P14 
7.3 
10.4 
6.6 
8/L 


MCFA6 
P13 
7.5 
9.6 
6.6 
8/L 


MCYC# 
P17 
6.9 
12.0 
X 


MHITM# 
H04 
6.1 
8.2 
L 


M/IO# 
F16 
8.5 
11.0 
2.3 


MKEN# 
R01 
7.7 
13.8 
1.7 


MRO# 
J01 
5.7 
10.6 
4.0 


M8ETO 
015 
8.0 
11.3 
6.6 
8/L 


M8ET1 
P12 
7.7 
8.9 
6.6 
8/L 


M8ET10 
011 
9.7 
9.5 
6.6 
8/L 


M8ET2 
P11 
8.8 
8.2 
6.6 
8/L 


M8ET3 


, 


014 
8.4 
10.9 
6.6 
8/L 


M8ET4 
R16 
8.7 
14.4 
6.6 
8/L 


M8ET5 
013 
8.8 
10.2 
6.6 
8/L 


M8ET6 
R17 
8.8 
16.2 
6.6 
8/L 


M8ET7 
817 
9.7 
16.8 
6.6 
8/L 
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Table 
10-15. 82495XP 
Cache 
Controller 
Buffer 
Models 
(Continued) 


Input 
Output 


Pin Name 
Location 
Cp (pF) 
Lp (nH) 
Buffer 
Buffer 
Typ 
Typ 
Cln (pF) 
Size 
Typ 


MSET8 
P10 
5.9 
8.1 
6.6 
S/L 


MSET9 
012 
9.3 
10.1 
6.6 
S/L 


MTAGO 
010 
6.1 
9.4 
6.6 
S/L 
I: 


MTAG1 
P09 
5.8 
7.9 
6.6 
S/L 


MTAG10 
007 
6.0 
9.3 
6.6 
S/L 


MTAG11 
P07 
6.0 
8.2 
6.6 
S/L 


I' 


MTAG2 
009 
5.9 
9.1 
6.6 
S/L 


MTAG3 
R14 
10.9 
14.9 
6.6 
S/L 


MTAG4 
008 
5.9 
9.1 
6.6 
S/L 


MTAG5 
R15 
11.0 
16.3 
6.6 
S/L 


MTAG6 
S14 
9.4 
16.5 
6.6 
S/L 


MTAG7 
S15 
10.4 
18.1 
6.6 
S/L 


MTAG8 
P08 
5.9 
8.0 
6.6 
S/L 


MTAG9 
S16 
10.3 
19.6 
6.6 
S/L 


MTHIT# 
G03 
6.0 
9.5 
L 


MWB/WT# 
K03 
5.4 
9.2 
4.0 


NA# 
J17 
5.9 
10.6 
S 


NENE# 
005 
7.7 
9.2 
L 


PALLC# 
002 
7.1 
11.3 
S 


PCO 
H15 
5.8 
9.4 
2.3 


PCYC 
J14 
5.6 
8.0 
2.3 


PWT 
C17 
9.1 
15.9 
2.3 


ROYSRC 
C01 
7.2 
12.7 
S 


RESET 
005 
7.1 
10.3 
1.6 


SETO 
013 
6.8 
8.4 
4.9 


SET1 
C13 
6.7 
9.3 
4.9 


SET10 
A09 
6.1 
10.8· 
4.9 


SET2 
C14 
7.1 
11.0 
4.9 


SET3 
B12 
6.6 
10.0 
4.9 


SET4 
C12 
6.4 
8.7 
4.9 


SET5 
C11 
6.2 
8.5 
4.9 


SET6 
012 
6.6 
9.2 
4.9 


SET7 
009 
6.2 
7.3 
4.9 


SET8 
010 
6.0 
8.2 
4.9 
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Table 
10-15. 82495XP 
Cache 
Controller 
Buffer 
Models 
(Continued) 


Input 
Output 


Pin Name 
Location 
Cp (pF) 
Lp (nH) 
Buffer 
Buffer 
Typ 
Typ 
Cln (pF) 
Size 
Typ 


8ET9 
B09 
6.1 
9.6 
4.9 


8MLN# 
C06 
8.5 
9.8 
L 


8NPA08# 
F03 
6.1 
9.8 
8 


8NPB8Y# 
F01 
5.5 
9.2 
L 


8NPCLK 
803 
6.8 
12.3 
1.7 


8NPCYC# 
H03 
5.8 
11.3 
8 


8NPINV 
P05 
7.9 
9.8 
1.7 


8NPNCA 
003 
7.6 
11.2 
1.6 


8NP8TB# 
R03 
7.1 
12.3 
1.7 


SWENO# 
001 
7.4 
13.1 
1.7 


SYNC# 
004 
7.7 
11.3 
1.7 


TAGO 
C08 
6.0 
9.1 
4.2 


TAG 1 
A04 
8.3 
14.2 
4.2 


TAG 10 
B01 
8.1 
14.3 
4.2 


TAG11 
C05 
8.2 
10.4 
4.2 


TAG2 
008 
5.8 
8.1 
4.2 


TAG3 
A03 
8.7 
15.0 
4.2 


TAG4 
B04 
8.3 
14.2 
4.2 


TAG5 
B03 
8.5 
14.7 
4.2 


TAG6 
C07 
8.5 
9.4 
4.2 


TAG7 
A02 
9.0 
15.7 
4.2 


TAG8 
D07 
8.5 
8.4 
4.2 


TAG9 
A01 
9.4 
16.1 
4.2 


TCK 
P03 
7.0 
11.2 
1.7 


TOI 
N03 
8.1 
9.6 
1.6 


TOO 
C04 
7.7 
11.0 
8 


TM8 
P02 
6.9 
12.1 
1.7 


WAY 
L15 
5.7 
9.4 
X 


WBA 
M14 
6.0 
8.7 
X 


WBTYP 
N15 
6.3 
10.1 
X 


WBWE# 
M15 
5.9 
9.6 
X 


WB/WT# 
K14 
5.6 
8.1 
8 


W/R# 
B17 
8.6 
15.2 
2.3 


WRARR# 
y 
L14 
5.8 
8.3 
X 
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Table 
10-16. 82490XP 
Cache 
RAM Buffer 
Models 


Input 
Output 


Pin Name 
Location 
Cp (pF) 
Lp (nH) 
Buffer 
Buffer 
Typ 
Typ 
Cln (pF) 
Size 
Typ 


AO 
65 
1.0 
8.9 
1.7 


A1 
66 
1.0 
8.8 
1.7 


A2 
67 
1.0 
8.6 
1.7 


A3 
68 
1.0 
8.4 
1.7 


A4 
69 
1.0 
8.3 
1.7 


A5 
70 
1.0 
8.2 
1.7 


A6 
71 
1.0 
8.2 
1.7 


A7 
73 
1.0 
8.1 
1.7 


A8 
75 
1.0 
8.1 
1.7 


A9 
76 
1.0 
8.1 
1.7 


A10 
77 
1.0 
8.2 
1.7 


A11 
78 
1.0 
8.2 
1.7 


A12 
79 
1.0 
8.3 
1.7 


A13 
80 
1.0 
8.4 
1.7 


A14 
81 
1.0 
8.6 
1.7 


A15 
82 
1.0 
8.7 
1.7 


ADS# 
63 
1.0 
9.6 
2.0 


BE# 
64 
1.0 
9.1 
1.7 


BLAST # 
59 
1.0 
8.9 
1.7 


BOFF# 
36 
1.0 
9.6 
2.0 


BRDY# 
60 
1.0 
9.1 
2.0 


BRDYC# 
61 
1.0 
9.2 
2.0 


BUS# 
40 
1.0 
8.7 
1.7 


CDATAO 
48 
1.0 
8.8 
3.9 
S 


CDATA1 
54 
1.0 
8.6 
3.9 
S 


CDATA2 
49 
1.0 
8.7 
3.9 
S 


CDATA3 
55 
1.0 
8.6 
3.9 
S 


CDATA4 
46 
1.0 
9.1 


/ 
3.9 
S 


CDATA5 
51 
1.0 
8.6 
3.9 
S 


CDATA6 
52 
1.0 
8.6 
3.9 
S 


CDATA7 
57 
1.0 
8.7 
3.9 
S 


ClK 
30 
1.0 
8.1 
3.4 


CRDY# 
43 
1.0 
9.6 
1.7 


HITM# 
62 
1.0 
9.6 
2.0 
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Table'1o-16. 
82490XP 
Cache 
RAM Buffer 
Models 
(Continued) 


Input 
Output 


Pin Name 
Location 
Cp (pF) 
Lp (nH) 
Buffer 
Buffer 
Typ 
Typ 
Cln (pF) 
Size 
Typ 


MAWEA# 
41 
1,0 
8.9 
1.7 


MBRDY# 
22 
1.0 
9.1 
1.7 


MCLK 
26 
1.0 
8.4 
2.0 


MCYC# 
42 
1.0 
9.1 
1.7 


MDATAO 
18 
1.0 
9.1 
6.6 
S/L 


MDATA1 
14 
1.0 
8.6 
6.6 
S/L 


MDATA2 
10 
1.0 
8.6 
6.6 
S/L 


MDATA3 
6 
1.0 
8.8 
6.6 
S/L 


MDATA4 
16 
1.0 
8.8 
6.6 
S/L 


MDATA5 
12 
1.0 
8.6 
6.6 
S/L 


MDATA6 
8 
1.0 
8.6 
6.6 
S/L 


MDATA7 
4 
1.0 
9.1 
6.6 
S/L 


MDOE# 
20 
1.0 
9.4 
1.7 


MEOC# 
23 
1.0 
8.9 
1.7 


MFRZ# 
24 
1.0 
8.7 
1.7 


MOCLK 
27 
1.0 
8.3 
2.0 


MSEL# 
25 
1.0 
8.6 
2.0 


MZBT# 
21 
1.0 
9.6 
1.7 


PAR# 
32 
STRAPPING 
OPTION 


RESET 
28 
1.0 
8.2 
1.7 


TCK 
3 
1.0 
9.2 
1.7 


TDI 
2 
1.0 
9.4 
1.7 


TOO 
84 
1.0 
9.1 
S 


TMS 
1 
1.0 
9.6 
1.7 


WAY 
45 
1.0 
9.2 
1.7 


WBA 
38 
1.0 
8.4 
1.7 


WBTYP 
37 
1.0 
8.3 
1.7 


WBWE# 
39 
1.0 
8.6 
1.7 


W/R# 
58 
1.0 
9.2 
1.7 


WRARR# 
44 
1.0 
9.4 
1.7 
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10.6 
Capacitive 
Derating 


The 82495XP/82490XP Cache chip set AC Timing 
Specifications are all given at a 0 pF capacitive load. 
For this reason, each output must be derated ac- 
cording to the load being driven. Capacitive derating 
is not a precise method of determining a signal's 
delay, and must only be applied to signals that inter- 
face to the Memory Bus (t32 to t130).A more accu- 
rate determination of delay and signal quality may be 
made by modeling the buffer using the first order 
buffer models. 


NENE,*" 
SMLN,*" 
MTHIT,*,, 
MHITM,*" 
and 
SNPBSY'*', and when MEMLORV is configured low 
for the signals MCFA, MSET, MTAG and MOATA. 
The small (normal) buffers are used with all other 
buffers and with MCFA, MSET, MTAG, and MOATA 
when MEMLORV is configured high. Note that for 
both buffers, the capacitive derating for a low-to- 
high transition is different from a high-to-Iow tran- 
sition. 


The following graphs represent the lumped-load ca- 
pacitive 
derating 
curves 
for 
the 
82495XP and 
82490XP buffers used to drive the memory bus. The 
large buffers are used for the signals COTS,*" 


Small buffers provide the best signal quality for ca- 
pacitive loads of about 50 pF or less. Large buffers 
provide the best quality for loads between 100 pF 
and 150 pF. The 82495XP/82490XP chip set com- 
ponent buffer model provides detailed information 
about buffer performance in specific environments. 
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Loading 
Delay vs Lumped 
Capacitive 
Load 
(Small Buffer, High to Low Transition) 
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Loading 
Delay vs Lumped Capacitive 
Load 
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11.0 
THERMAL DATA 


The 82495XP and 82490XP are specified 
for opera- 
tion 
when 
TC (case 
temperature) 
are 
within 
the 
range of 0·C-85·C. TC may be measured 
in any en- 
vironment 
to determine 
whether the components 
are 
within the specified 
operating 
range. The case tem- 
perature 
should 
be measured 
at the center 
of the 
top surface, 
opposite 
the pins. 


The ambient 
temperature 
(TN is guaranteed 
as long 
as Tc is not violated. 
The ambient 
temperature 
can 
be calculated 
from /1JC and /1JA using the following 
equations: 


TJ = Tc + pe 
8JC 


TA = TJ + pe 
8JA 


Tc = TA + pe 
(8JC - 
8Jcl 


This is true where 
TJ, TA and Tc = junction, 
ambi- 


ent and case temperature, 
respectively, 
and where 
/1JC and /1JA = junction-to-case 
and junction-to-am- 


bient 
thermal 
resistance, 
respectively. 
P = maxi- 


mum power consumption. 


The heat sink referenced 
for all parts is a unidirec- 


tional 
heat sink, 0.350· 
high, 40 MIL fin width, 
and 
155 MIL center-to-c~nter 
fin spacing. 
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82495XP 


8JC (C/W) = 1.5 without 
heat sink 


2.0 with heat sink 


intel~ 


Case to Ambient 
Thermal 
Resistance 
with and without 
Heat Sink for 82495XP 


111 


16 


14 
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Theta CA 
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800 
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•• 
With Heat Sink 
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8JC (C/W) = 7.5 without 
heat sink 


8.0 with heat sink 


Case to Ambient 
Thermal 
Resistance 
with and without 
Heat Sink for 82490XP 
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12.0 
MECHANICAL DATA 


Letter or 
Description 
of Dimensions 
Symbol 


A 
Distance 
fr im seating plane to highest point of body 


A1 
Distance 
butween 
seating plane and base plane (lid) 


A2 
Distance 
from base plane to highest point of body 


A3 
Distance from seating plane to bottom of body 
8 
Diameter 0 terminal 
lead pin 


0 
Largest 
OVI >railpackage 
dimension 
of length 


01 
A body len! th dimension, 
outer lead center to outer lead center 


e1 
Linear spac ing between 
true lead position centerlines 


L 
Distance from seating plane to end of lead 


51 
Other body dimension, 
outer lead center to edge of body 


NOTES: 
1. Controlling dimension: millimeter. 
2. Dimension "e1" ("e") is non-cumulative. 
3. Seating plane (standoff) is defined by P.C. board hule size: 0.0415 inch-0.0430 inch 
4. Dimensions "S", "S,' 
and "C" are nominal. 


5. Details of Pin 1 identifier are optional. 
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82495XP 


1.95" 
Sq Ceramic 
Pin Grid Array 
Package 
SEATING 
PlANE 
~-----------D------------~ 
I+-----Dj------t 


&!l1.65 
REF. 
o@@@@@@@@@@@@@@@@@o 
@@@@@@@@@@@@@@@@@@@ 
@@@@@@@@@@@@@@@@@@@ 
@@@O@@@@@@@@@@@O@@@ 
@@@@ 
@@@@ 
@@@@ 
@@@@ 
@@@@ 
@@@@ 
@@@@ 
@@@@ 
@@@@ 
@@@@ 
@@@@ 
@@@@ 
@@@@ 
@@@@ 
o@@@ 
@@@@ 
o@@@ 
@@@@ 
@ 
@@ 
@@@@ 
@@ 
@ 
@@@@ 
@@@ 
@@@@@@@@@@@ 
@@@ 
@@@@@@@@@@@@@@@ 
o@@ 
@@@@@@@@@@@@@@@@ 
@@ 
@@@@@@@@@@@@@@@@@ 
@ 


mm(lnch) 


240956-74 


PIN C3 


2.29 
REF ' 
1.52 
• 
450 
CHAMFER 
(INDEX CORNER) 


SWAGGED 
PIN 
(4 PL) 


SEATING1 
PLANE 


l!JB (ALL PINS) 
..L,-----I11-1 
T 


D 


Family: 
Ceramic 
Pin Grid Array 
Package 


Mlllimeters 
Inches 
Symbol 
Mln 
Max 
Notes 
Mln 
Max 
Notes 


A 
3.56 
4.57 
0.140 
0.180 


A, 
0.64 
1.14 
Solid Lid 
0.025 
0.045 
Solid Lid 


A2 
0.23 
0.30 
Solid Lid 
0.110 
0.140 
Solid Lid 


A3 
1.14 
1.40 
0.045 
0.055 


B 
0.43 
0.51 
0.017 
0.020 
D 
49.53 
50.17 
1.950 
1.975 
D, 
45.59 
45.85 
1.795 
1.805 
9, 
2.29 
2.79 
0.090 
0.110 


L 
2.54 
3.30 
0.100 
0.130 


N 
240 
280 
240 
280 


S, 
1.52 
2.54 
0.060 
0.100 


ISSUE 
IWS 
9/90 


SWAGGED 
PIN 
DETAIL 


BASE 
PLANE 


Figure 
12-1. 82495XP 
Mechanical 
Specifications 
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82495XP Cache Controller/82490XP 
Cache RAM 


82490XP 


mm (inch) 
240956-75 


Plastic 
Quad Flatpack 
(PQFP) 0.025 In. (0.635 mm) Pitch 


Symbol 
Description 
Mln 
Max 
Mln 
Max 
(mm) 
(mm) 
(In.) 
(In.) 


N 
Leadcount 
84 
84 


A 
Package 
Height 
4.06 
4.57 
0.160 
0.180 


A1 
Standoff 
0.51 
1.02 
0.020 
0.040 


D,E 
Terminal 
Dimension 
19.56 
20.07 
0.770 
0.790 


D1, E1 
Package 
Body 
16.43 
16.59 
0.647 
0.653 


D2,E2 
Bumper Distance 
20.24 
20.39 
0.797 
0.803 


D3,E3 
Lead Dimension 
12.70 REF 
0.500 REF 


D4,E4 
Foot Radius Location 
18.36 
18.71 
0.723 
0.737 


L1 
Foot Length 
0.51 
0.76 
0.020 
0.030 


Issue 


Figure 
12·2. 82490XP 
Mechanical 
Specifications 
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Cache Controller/82490XP 
Cache RAM 
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J, 


r-, 
t 


Jl 


-~J 
3.81 (.151l) MAXTYP 
, 


y-SEE DETAILM 
,~) 
- 
-'1.91 
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; 


-$lll.25 
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JLI .1l1l2MM/MM(IN/IN)lol 
240956-76 


Molded 
Details 


240956-77 


Terminal 
Details 
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SEEDETAILL 
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DETAILJ 
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Typical 
Lead 


~0.13 
(.005)@II]A®-B®~ 


0.41 
(.01b) 
.IL20 (.098) 
• 


(.00S) 
( .005) 


0.31 
(.012) -f I- 
0. 20 (. 0BS) 
• • 
D4/E4 ----<-t 


[IJB.20 
(.00S)@ 
IIJA®-B®~ 
S DEG . 
o DEG. 


mm (inch) 
240956-78 


Detail J 
Detail L 


Detail M 


t 
1.32 (.Il52) 
1.22 (.Il48) 
~ 
a. ~Il ('Il35) 
MIN. 
2.Il3 (.Il81l) 
1.~3 ('Illb) 


---D2---l 
240956-79 
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82495XP 
Cache Controller/82490XP 
Cache RAM 


13.0 
REVISION HISTORY 


The following list represents the major differences between version -002 and version -001 of the i860 XP 
Microprocessor Data Sheet. 


Section 2.2.4 
AI bit has been changed to TAl in Figure 2.5. 
The explanation for PI bit has been expanded. 


Section 4.2.33 
PCHK 11 signal description has been expanded. 


Section 4.2.35 
Output buffer configuration has been added in PEN 11 signal description. 
Section 5.1.3 
Table 5.2 has been corrected. 
Section 5.2.2.4-5 
The explanation of late back-off mode has been expanded. 
Section 5.2.4 
Figure 5.27 has been corrected. 


Section 9.2 
D.C. Characteristics are corrected. 
Section 9.3 
A.C. Characteristics are replaced with nominal timings based on CL = 0 pF. 
Figure 9.3 and Figure 9.4 have been replaced with nominal A.C. Timings based on CL = 0 
pF. 


Figure 9.5 has been corrected for normal and high-current output buffers. 
Component mbuffer model has been added. 
Programming restrictions on flush instruction has been added. 
Section 9.4 
Section 10.4 
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Introduction 


The i860™ 
64-bit microprocessor 
is a general-purpose 
CPU with on-chip 
integer 
unit, floating 
point, 
memory 
management, 
caches, 
and 
graphics. 
The 
i860 
micro- 
processor 
supports 
3-D graphics 
software 
with the fol- 
lowing functions: 
I. Hidden 
surface 
elimination 


2. Distance 
interpolation 


3. Intensity 
ipterpolation 
for 3-D shading 


The 
fzchks 
(Z-buffer 
Check) 
and pst (Pixel 
Store) 
in- 
structions 
expedite 
hidden 
surface 
elimination. 
Dis- 
tance 
interpolation 
is accomplished 
with 
faddz 
(Add 
with Z merge), 
and intensity 
interpolation 
occurs 
with 
faddp (Add 
with Pixel Merge). 
The purpose 
of this ap- 
plication 
note is to illustrate 
the intended 
use of these 
instructions 
in a manner 
independent 
of any graphics 
environment 
in which the instructions 
might be used. It 
is not the purpose 
of this application 
note to present 
the 
most 
efficient 
instruction 
sequences. 
While 
the 
inner 
loop of Example 
7 has as few instructions 
as logically 
possible, 
the 
other 
examples 
are 
intended 
to present 
general 
concepts, 
not optimum 
implementations. 
Tun- 
ing for maximum 
performance 
depends 
on the specific 
environment. 


This 
application 
note 
assumes 
familiarity 
with 
the 
i860™ 64-bit Microprocessor Programmer's Reference 
Manual (Intel 
order 
number 
240329); 
the i860 micro- 
processor 
instructions 
for graphics 
are detailed 
in sec- 
tion 6.6. 


1.0 3-D RENDERING 


This series of examples 
are routines 
that might be used 
at the lowest level of a graphics 
software 
system to con- 
vert a machine-independent 
description 
of a 3-D image 
into values for the frame buffer of a color video display. 
Typically, 
higher-level 
graphics 
routines 
represent 
an 
object 
as a set of polygons 
that 
together 
roughly 
de- 
scribe 
the surfaces 
of the objects 
to be displayed. 
The 
graphics 
system 
maintains 
a database 
that 
describes 


intel~ 


these 
polygons 
in terms 
of their 
colors, 
properties 
of 
reflectance 
or translucence, 
and 
the 
locations 
in 3-D 
space 
of their 
vertices. 
Due 
to the 
roughness 
of the 
representation, 
the amount 
of information 
in the data- 
base is considerably 
less than that which must be deliv- 
ered to the video display. 
A rendering 
procedure, 
such 
as Example 
7, uses interpolation 
to derive 
the detailed 
information 
needed 
for each pixel in the graphics 
frame 
buffer. The rendering 
procedure 
also performs 
pixel-by- 
pixel hidden-surface 
elimination. 


The 
focus 
of this 
series 
of examples 
is Example 
7, 
which 
operates 
on a segment 
of a scan line. The 
seg- 
ment 
is bounded 
by two points 
of given location 
and 
color: 
from 
point 
(Xl, 
YO, Zl) 
with 
color 
intensities 
Redl, e-a, Blul 
to point (X2, YO,Z2) with color 
iit- 
tensities Red2, Gm2, Blu2. The points 
and color inten- 
sities are determined 
by higher-level 
graphics 
software. 


The 
points 
represent 
the intersection 
of the 
scan 
line 
with two edges of the projected 
image of a polygon. 
For 
a given scan line, the rendering 
procedure 
is executed 
once for each polygon 
that projects 
onto that scan line. 
The 
higher-level 
graphics 
software 
is responsible 
for 
orienting 
the 
objects 
with 
respect 
to the 
viewer, 
for 
making 
perspective 
calculations, 
for scaling, and for de- 
termining 
the amount 
of light that 
falls on each poly- 
gon vertex. 


The 16-bit pixel format 
is used, giving ample resolution 
for color shading: 
26 intensity 
values for red, 26 intensi- 
ty values 
for green, 
and 
24 intensity 
values 
for blue. 


Example I shows how to set the pixel size. For hidden- 
surface 
elimination, 
the Z-buffer 
(or depth buffer) 
tech- 
nique is employed, 
each Z value having 
a resolution 
of 
16-bits. 


Because 
the examples 
presented 
here use almost 
all of 
the registers 
of the i860 microprocessor, 
the 
registers 
are given symbolic 
names, 
as defined 
by Example 
2. In 
a real application, 
it is likely that some of the inputs 
to 
the rendering 
procedure 
would 
be passed 
in floating- 
point registers 
instead 
of the integer 
registers 
employed 
here. The register 
allocation 
shown 
in Example 
2 sim- 
plifies 
the 
examples 
by avoiding 
the 
need 
to use any 
register 
for multiple 
purposes. 


II SET PIXEL 
Id.e 
andnoth 
orh 
st.e 


SIZE TO 16 
psr, 
OxOOCO, 
Ox0040, 
Ra, 


Ra 
Ra, 
Ra, 
psr 


II Work 
on psr 
RaIl Clear 
PS 
Rail PS = 16-bit 
11 
pixels 


Example 1. Setting Plxel Size 
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// REGISTER DEFINITIONS FOR RENDERING PROCEDURE 
// 
INTEGER LOCALS 
= r4 
// Temporary 
= r5 
// Temporary 
= r6 
// Temporary 
= r7 
// Temporary 
INTEGER INPUTS 
Xl 
= r16 
// X coordinate of starting point of line segment in pixels 
dX 
= r17 
// Width of scan line segment in number of pixels 
ZBP = r18 
// Z-buffer pointer to the current line segment 
Zl 
= r19 
// Initial Z value, fixed-point 16.16 format 
mZ 
= r20 
// Z slope, fixed-point 16.16 format 
FBP = r2l 
// Graphics frame buffer pointer to the current line segment 


Redl = r22 
// Initial red intensity, fixed-point 6.10 format, plus .5 
Grnl = r23 
// Initial green intensity, fixed-point 6.10 format, plus .5 


Blul = r24 
// Initial blue intensity, fixed-point 6.10 format, plus .5 
mR 
= r25 
// Red slope, fixed-point 6.10 format 
mG 
= r26 
// Green slope, fixed-point 6.10 format 
mB 
= r27 
// Blue slope, fixed-point 6.10 format 
REAL LOCALS 
aZ 
= f2 
// Accumulated 
Z values 
aZh 
= rs 
11 
iZl = f4 
// Z interpolant, coefficient 1.0 
iZlh = rs 
11 
iZ3 = f6 
// Z interpolant, coefficient 3.0 
iZ3h = f7 
11 
oldz = f8 
// Original values from the Z-buffer 
newz = flO 
// New Z-buffer values 
newzh = fU 
// 
newi = fl2 
// New pixel values 
iR 
= f14 
// Red interpolant, coefficient 4.0 
iRh 
= fl5 
11 
aR 
= fl6 
// Accumulated 
red intensities 
aRh 
= fl7 
11 
iG 
= fl8 
// Green interpolant, coefficient 4.0 
iGh 
= fl9 
11 
aG 
= f20 
// Accumulated 
green intensities 
aGh 
= f2l 
11 
iB 
= f22 
// Blue interpolant, coefficient 4.0 
iBh 
= f23 
11 
aB 
= f24 
// Accumulated blue intensities 
aBh = f25 
11 
IZmask 
= f26 
IZmaskh 
= f27 
rZmask 
= f28 
rZmaskh 
= f29 


Ra 
Rb 
Rc 
Rd 
11 


11 


// left-end Z mask 
11 
11 
11 
right-end Z mask 


Example 
2. Register 
Assignments 


L 
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2.0 
DISTANCE 
INTERPOLATION 


To perform hidden surface elimination at each pixel, 
the rendering routine first interpolates the value of Z at 
each pixel. Distance interpolation consists of calculat- 
ing the slope of'Z over the given line segment, then 
increasing the Z value of each successive pixel by that 
amount, starting from Xl. The width of the line seg- 
ment in pixels is ... 


dX = X2 - 
Xl 


Calculate the reciprocal of dX: 


RdX = lIdX 


The value of dX is used several times as a divisor. It is 
most efficient to calculate its reciprocal once, then, in- 
stead of dividing by dX, multiply by RdX. The slope of 
Zis ... 


mZ = (Z2 - 
Zl)· RdX 


Because each polygon is a plane, the value of mZ is 
constant for all scan lines that intersect the polygon; 
therefore mZ needs to be calculated only once for each 


polygon. Example 7 assumes that dX and mZ have al- 
ready been calculated, and all that remains is to apply 
mZ to successive pixels. Let Z(Xn) be the Z value at 
pixel Xn. Then ... 


Z(XI) 
= Zl 
Z(XI + 1) = Zl + mZ 
Z(XI + 2) = Zl + rmZ 


Z(XI + N) = Zl + N"mZ 


Z(XI + dX) =Zl 
+ dX"mZ 
= Z(X2) 


Figure 1 illustrates this Z-value interpolation. 


The faddz instruction helps to perform the above calcu- 
lations 64 bits at a time. Because a Z value is 16 bits 
wide, Example 7 operates on the Z buffer in groups of 
four. The faddz instruction, however, treats the interpo- 
lation values (N" mZ) as 32-bit fixed-point numbers; 
therefore, two faddz instructions are executed for each 
group of four pixels. Because of the way the faddz shifts 


(r, g, b, x, y, Z = .000) 
ooo 
'" 


(r', g', b', x', y', z' = 800) 


mZ = 3000-2.00 
12 plxels 


(r", g", bOO, x", y", ZOO = 1000) 
240856-1 


Figure 1. Z·Buffer 
Interpolation 
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the MERGE 
register, the first faddz corresponds to 
even-numbered pixels, while the second corresponds to 
odd-numbered pixels. Instead of starting with the value 
for the first pixe1(Z(Xl» 
and adding mZ to each pixel 
to produce the value for the next pixel, the example 
procedure starts with the values for the first two even- 
numbered pixeis and adds I" mZ to each of these values 
to produce the values for the adjacent odd-numbered 
pair. Adding 3"mZ to each of the Z values of an odd- 
numbered pair produces the values for the next even- 
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numbered pair. Figure 2 shows one way of constructing 
the operands before starting the distance interpolations. 
(The initial value given to srcJ depends on the align- 
ment of the first pixel.) Table I helps to visualize the 
process. 


After two faddz instructions, 
the MERGE 
register 
holds the Z values for four adjacent pixe1s(in the cor- 
rect order). The form instruction copies MERGE into 
one of the 64-bit floating-point registers. 


MERGE Register 
Operands 
63-32 
31-0 
63-48 
I 
47-32 
I 
31-16 
I 
15-0 


src1 
-1.0 
-3.0 


src2 
3.0 
3.0 


rdestlsrc1 
2.0 
0.0 
2 
I 
I 
0 
I 


src2 
1.0 
1.0 


rdestlsrc1 
3.0 
1.0 
3 
I 
2 
I 
1 
I 
0 


src2 
3.0 
3.0 


rdestlsrc1 
6.0 
4.0 
6 
I 
I 
4 
I 
src2 
1.0 
1.0 


rdestlsrc1 
7.0 
5.0 
7 
I 
6 
I 
5 
I 
4 


src2 
3.0 
3.0 


rdestlsrc1 
10.0 
8.0 
10 
I 
I 
8 
I 


src2 
1.0 
1.0 


rdestlsrc1 
11.0 
9.0 
11 
I 
10 
I 
9 
I 
8 


src2 
3.0 
3.0 


rdest/src1 
14.0 
12.0 
14 
I 
I 
12 
I 
src2 
1.0 
1.0 


rdest 
15.0 
11.0 
15 
I 
14 
" 


13 
I 
12 
Because the values of Z1 and mZ are constant for each loop through the rendering routine,the numbers shown here are 
the values of the coefficientN, where the actual operands have the values Z1 + N*mZ For each execution of faddz, src1 
is the same as rdest of the priorfaddz. Afterevery two faddz instructions,a form instructionempties the MERGE register. 
I 
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/I CONSTRUCT INTERPOLANTS iZ1 AND iZ3 GIVEN mZ 
, 


ixfr 
mZ, 
iZ1 
II Join each half in 64-bit register 
sh1 
1, 
mZ, 
Ra 
/I Ra = 2*mZ 
adds 
Ra, 
mZ, 
Ra 
/I Ra = 3*mZ 
ixfr 
Ra, 
iZ3 
/I Join each half in 64-bit register 
fmov.ss 
iZ1, 
iZ1h 
II Join each half in 64-bit register 
fmov.ss 
iZ3, 
iZ3h 
/I Join each half in 64-bit register 


Example 
3. Construction 
of Z Interpolants 


(r' = 40, g', b', x', y', z') 


mR= 
27-30 
12 plxels 


(r = 20,g,b,x,y,z) 


Red Color 
(0-63) 


(r" = 40, g", b", x", y", z") 
240856-2 


, Figure 3. Plxellnterpolatlon 
for Gouraud 
Shading 


The same register is used as both srcI and rdest in all 
faddz instructions. This register serves to accumulate Z 
values for successive pixels; therefore, it is called an 
accumulator. The registers used as src2 are called inter- 
polants. The code in Example 3 constructs the interpo- 
lants; it needs to be executed only once for each poly- 
gon. 


3.0 COlOR INTERPOLATION 


To determine the RGB color intensities at each pixel, 
the rendering routine interpolates between the color in- 
tensities at the end points. (This rendering technique is 
called "Gouraud shading" after H. Gouraud, "Contin- 
uous Shading of Curved Sufaces," IEEE Transactions 
on Computers, C-20(6), June 1971, pp. 623-628.) Let 
the symbol C (color) represent 
either R (red), G 


(green), or B (blue). Color interpolation consists of cal- 
culating the slope of C over the given line segment, then 
increasing the C values of each successive pixel by that 
amount, starting from the values for Xl. This must be 
done for C= R, C= G, and C= B. The slope ofC is ... 


mC = (C2 - 
Cl)' RdX 


... 
where RdX = l/dX 
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The value of mC is constant for all scan lines that inter- 
sect a given pair of polygon edges; therefore mC needs 
to be calculated only once for each such pair. Example 
7 assumes that mC has already been calculated for all 
colors, and all that remains is to apply mC to successive 
pixels. Let C(Xn) be a C value at pixel Xn. Then ... 


C(XI) = Cl 
C(XI + 1) = Cl + 
mC 
C(XI + 2) = Cl + 2'mC 


C(XI + N) = Cl + N'mC 


C(XI + dX) = Cl + dX*mC = C(X2) 


Figure 3 illustrates Gouraud shading of a triangle. 


The faddp instruction performs the above calculations 
64 bits at a time. Because a pixel is 16 bits wide, Exam- 
ple 7 operates on pixels in groups of four. Instead of 
starting with the value for the first pixel (C(XI» 
and 


adding mC to each pixel to produce the value for the 
next pixel, the example procedure starts with the values 
for the first four pixels and adds 4' mC to each group of 


I 


four to produce the values for the next four. Three 
faddp instructions are executed for each group of four 
pixels. The first increments the blue values; the second, 
green; the third, red. Figure 4 shows one way of con- 
structing the operands for each color before starting the 
color interpolations. (The initial value given to srcl de- 
pends on the alignment of the first pixel.) 


Setup of the accumulator and interpolants is similar to 
that of the Z-buffer. The code in Example 4 constructs 
the interpolants; it needs to be executed only once for 
each pair of edges in each polygon. 


4.0 
BOUNDARY 
CONDITIONS 


The i860 microprocessor operates on 64-bit quantities 
that are aligned on 8-byte boundaries. The code in this 
~xample takes full advantage of this design, handling 
four 16-bit pixels in each loop. However, if the first or 
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last pixel of a line segment is not on an 8-byte bounda- 
ry, two kinds of special considerations are required: 


I. Masking of Z values near the end points. 
2. Initialization of the accumulators. 


4.1 Z-Buffer Masking 


When either the first or last pixel of the line segment is 
not at an 8-byte boundary, the rendering procedure 
must mask the first or last set of new Z-buffer values 
(newz) so that the Z-buffer and the frame buffer are not 
erroneously updated. Sometimes both the first and last 
pixels are in the same 4-pixel set, in which case either 
one may not be on an 8-byte boundary. A function that 
looks up and calculates masks is shown in Example 5. 


Because the value OxFFFF is used for masking, the Z- 
buffer is initialized with OxFFFE, so that the fzchks 
instruction always finds the mask to be greater than 
any Z-buffer contents. 


Accumulator 
63 
47 
31 
0 


I 
I 
Initial 
C1+3·mCI 
frac 
C1+2·mCI 
frac 
C1+mC 
frac 
frac 


I 
I 
src1 


Interpolant 
63 
47 
31 
15 
0 


4·mC 
frac 
4·mC 
frac 
4·mC 
frac 
4·mC 
frac 
src2 


Figure 4. faddp Operands 


11 CONSTRUCT INTERPOLANTS iR. iG. iB GIVEN mR. mG. mB 
shl 
18. 
mR. 
Ra 
// Multiply eachcolor 
slope by four. then 
shl 
18. 
mG. 
Rb 
11 
shift by 16 to put the significant 
shl 
18. 
mB. 
Rc 
11 
bits into the high-order half 
shr 
16. 
Ra. 
mR 
// Return Significant 16 bits 
shr 
16. 
Rb. 
mG 
11 
to low-order half. Any sign bits 
shr 
16. 
Rc. 
mB 
11 
in high-order half are gone. 
or 
mR. 
Ra. 
Ra 
// Join 16-bit quarters 
or 
rG. 
Rb. 
Rb 
// in 32-bit register 
or 
mB. 
Rc. 
Rc 
11 
ixfr 
Ra. 
iR 
11 Join 32-bit halves 
ixfr 
Rb. 
iG 
11 in 64-bit register 
ixfr 
Rc. 
iB 
11 
fmov.ss 
iR. 
iRh 
11 
fmov.ss 
re, 
iGh 
11 
fmov.ss 
iB. 
iBh 
11· 


Example 4. Construction 
of Color Interpolants 
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.macro zmask l_align, r_align, Rx, Ry 
II l_align, r_align - left- and right-end alignment [0••3] in 2-byte units 
II Rx, Ry 
- scratch registers 
.data 
.align a 
left_mask:: Illow 
.long OxOOOOOOOO, 
.long OxOOOOFFFF, 
.long OxFFFFFFFF, 
.long OxFFFFFFFF, 
right_mask::lllow 
.long OxFFFFOOOO, 
.long OxOOOOOOOO, 
.long OxOOOOOOOO, 
.long OxOOOOOOOO, 


shl 
3, 
r_align, 
r_align II Multiply by a 
mov 
right_mask, 
Rx 
II 
fld.d 
r_align (Rx), 
rZmask 
II Load a-byte mask 
I I 
If the first and last pixels are ,containedin the same 64-bit 
II aligned set, then lZmask = lZmask OR rZmask. 
andh 
oxaooo, dX, 
rO 
II Is dX negative 
bc 
L2 
II If not, right end is in other set 
fxfr 
lZmask, Rx 
II 
fxfr 
rZmask, Ry 
II 
or 
Rx, 
Ry, 
Rx 
II OR low-order half 
ixfr 
Rx, 
lZmask 
II 
fxfr 
lZmaskh, 
Rx 
/I 
fxfr 
rZmaskh, 
Ry 
I I 
or 
Rx, 
Ry, 
Rx 
II OR high-order half 
ixfr 
Rx, 
lZmaskh 
II 
L2: nop 
II 
.endm 


.text 
shl 
3, 
l_align, 
mov 
left_mask, 
fld.d 
Lalign· (Rx), 


high 
OxOOOOOOOO 
/I o mod 4 
OxOOOOOOOO 
/I 1 mod 4 
OxOOOOOOOO 
/I 2 mod 4 
OxOOOOFFFF 
/I 
3 mod 4 
high 
OxFFFFFFFF 
/I o mod 4 
OxFFFFFFFF 
/I I mod 4 
OxFFFFOOOO 
/I 2 mod 4 
OxOOOOOOOO 
/I 
3 mod 4 


I_align II Multiply by a 
Rx 
/I 
IZmask 
II Load a-byte mask 
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Example 
5_Z Mask Procedure 
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Table 3. Accumulator 
Initialization 
Table 


Alignment 
Table Values 


·mZ 
·mR 
·mG 
·mB 


0 
-1, -3 
-1, -2, -3, -4 
-1, -2, -3, -4 
-1, -2, -3, -4 


2 
-2, -4 
-2, -3, -4, -5 
-2, -3, -4, -5 
-2, -3, -4, -5 


4 
-3, -5 
-3,-4,-5,-6 
-3, -4, -5, -6 
-3,-4,-5,-6 
6 
-4, -6 
-4, -5, -6, -7 
-4, -5, -6, -7 
-4, -5, -6, -7 


4.2 Accumulator 
Initialization 


When the first pixel of the line segment is not at an 8- 
byte boundary, initial values placed in the accumulators 
(aZ, aB, aG, and oR) must be selected so that Zl, 
Redl, Grnl, and Blul correspond to the correct pixel. 
The desired result is that shown by Table 2. However, 
each value is a composite of two terms: one that is 
constant 
for each edge pair (n-mZ, n-mR, n-mG, 


n-mB) and one that can vary with each scan line (Zl, 
Redl, Gml, Blul). The example assumes that the con- 
stant values have all been calculated and stored in a 
memory table of the format shown by Table 3. At the 
beginning of each line segment the values appropriate 
to the alignment of the line segment are retrieved from 
the table and added to the initial Z and color values, as 
shown in Example 6. 


5.0 THE INNER 
LOOP 


Once the proper preparations have been made, only a 
minimal amount of code is needed to render each scan- 


I 


line segment of a polygon. The code shown in Example 
7 operates on four pixels in each loop. The left and 
right ends of the line segment go through different logic 
paths so that the Z-buffer masks can be applied by the 
form instruction. All the interior points are handled by 
the tight inner loop. 


The controlling variable dX is zero-relative and is ex- 
pressed as a number of pixels. The value of dX also 
indicates alignment of the end-points with respect to 
the 4-pixel groups. Unaligned left-end pixels are sub- 
tracted from dX before entering the inner loop; there- 
fore, subsequent values of dX indicate the alignment of 
the right end. A value that is 3 mod 4 indicates that the 
right end is aligned, which explains the test for a value 
of - 5 near the end of the loop (- 5 mod 4 = 3). The 
fact that the value - 5 is loaded into register Rb on 
every execution of the loop does not represent a pro- 
gramming inefficiency, because there is nothing else for 
the core unit to do at that point anyway. 
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// Four initial 16-bit blue values 
// Four initial 16-bit green values 
// Four initial 16-bit red values 
// Two initial 32-bit Z values 


// ACCUMULATOR INITIALIZATION TABLE 
.data; .align .double 
acc_init_tab:: 
.double [16] 0 
.dsect 
aBi: 
.double 
aGi: 
.double 
aRi : 
•double 
aZi: 
•double 
.end 
.text 
// INITIALIZE ACCUMULATORS 
.macro acc_init 
Lalign, Rtab, Rx, Ry, Fx, Fxh 
// 
Lalign - left-end alignment (0••3) in two-byte units 
// 
Rtab 
- register to use for addressing the table 
// 
Rx, Ry, Fx, Fxh - scratch registers 
mov 
acc_init_tab, 
Rtab 
// 
shl, 
5, 
Lalign, Lalign 
// Multiply by row width 
adds 
Lalign, Rtab, 
Rtab 
// Index row corresponding to alignment 
rre ,e 
aZi(Rtab), 
aZ 
1/ 
Z 
ixfr 
Zl, 
Fx 
// Z 
fld.d 
aRi(Rtab), 
aR 
// R-Load constant values 
shl 
16, 
Redl, 
Rx 
// R-Shift starting value to hi-order 


fmov•.ss 
Fx, 
Fxh 
1/. Z 
shr 
16, 
Rx, 
Ry 
// R-Redl stripped of sign bits 
fiadd.dd 
Fx, 
aZ, 
aZ 
1/ 
Z 
or 
Rx, 
Ry, 
Ry- 
// R-Form 
(Redl,Redl) 
ixfr 
Ry, 
Fx 
// R-Put in 64-bit register 
fld.d 
aGi(Rtab), 
aG 
I/G 
shl 
16, 
Grnl, 
Rx 
// G 
fmov.ss 
Fx, 
Fxh 
// R-Form 
(Redl,Redl,Redl,Redl) 
shr 
16, 
Rx, 
Ry 
1/ 
G 
fiadd.dd 
Fx, 
aR, 
aR 
// R-Add variables to constants 
or 
Rx, 
Ry, 
Ry 
1/ 
G 
ixfr 
Ry, 
Fx 
1/ 
G 
fld.d 
aBi(Rtab), 
aB 
I/B 
shl 
16, 
Blul, 
Rx 
// B 
fmov.ss 
Fx, 
Fxh 
// G 
shr 
16, 
Rx, 
Ry 
1/ B 
fiadd.dd 
Fx, 
aG, 
aG 
// G 
or 
Rx, 
Ry, 
Ry 
1/ B 
ixfr 
Ry, 
Fx 
1/ B 
fmov.ss 
Fx, 
Fxh 
// B 
fiadd.dd 
Fx, 
aB, 
aB 
// B 
.endm 


Example 6. Accumulator 
Initialization 
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II 
RENDERING PROCEDURE 
II 
16-bit pixels, 16-bit Z-buffer 
and 
3, 
Xl, 
Ra 
II Determine alignment of starting-point 
acc_init 
Ra, Rb, Rc, Rd, Fa, Fah 
II Initialize accumulators 
subs 
4, 
Ra, 
Rb 
II 4 - alignment 
subs 
dX, 
Rb, 
dX 
II Adjust dX by Xl alignment 
II If dX <= 0, then right end is in same set as left end 
and 
3, 
dX, 
Rb 
II Determine alignment of right end 
zmask 
Ra, Rb, Rc, Rd 
II Prepare both left- and right-end masks 
left_end:: 
II Handle boundary conditions 
d.faddz 
aZ, 
iZ3, 
aZ 
II Interpolate 2 even Z values 
adds 
-8, 
FBP, 
FBP II Anticipate autoincrement 
d.faddz 
aZ, 
iZl, 
aZ 
II Interpolate 2 odd Z values 
adds 
-8, 
ZBP, 
ZBP II Anticipate autoincrement 
d.form 
lZmask, newz 
II Mask 4 new Z values 
fld.d 
8(ZBP), oldz 
II Fetch 4 old Z values 
d.faddp 
aB, 
iB, 
aB 
II Interpolate 4 blue intensities 
mov 
-4, 
Ra 
II Loop increment: 4 pixels 
d.faddp 
aG, 
iG, 
aG 
II Interpolate 4 green intensities 
adds 
-4, 
dX, 
dX 
II Prepare dX for bla at end of loop 
d.faddp 
aR, 
iR, 
aR 
II Interpolate 4 red intensities 
bla 
Ra, 
dX, 
Ll 
II Initialize LCC 
d.form 
fO, 
newi 
II Move 4 new pixels to 64-bit reg 
adds 
5, 
dX, 
rO 
II Are there any whole sets (dX < -5)? 
Ll: d.fzchks 
oldz, 
newz, 
newzll 
Mark closer points in PM[7••4] 
bc 
short_segment 
II Get out now if no whole set 
d.fnop 
II 
fld.d 
16(ZBP), 
oldz 
II Fetch 4 old Z values 
inner_loop:: II Handle all interior points 
d.faddz 
aZ, 
iZ3, 
aZ 
II Interpolate 2 even Z values 
nop 
II 
d.faddz 
aZ, 
iZl, 
aZ 
II Interpolate 2 odd Z values 
fst.d 
newz, 
8(ZBP)++ 
II Update Z buf from prior loop 
d.form 
fO, 
newz 
II Move 4 new Z values to 64-bit reg 
nop 
II 
d.fzchks 
fO, 
fO, 
fO 
II Shift PM[7••4] to PM[3••0] 
mov 
-5, 
Rb 
II -5 mod 4 = 3, aligned right end 
d.faddp 
aB, 
iB, 
aB 
II Interpolate 4 blue intensities 
pst.d 
newi, 
8(FBP)++ 
II Store pixels indicated by PM[3••0] 
d.faddp 
aG, 
iG, 
aG 
II Interpolate 4 green intensities 
xor 
Rb, 
dX, 
rO 
II Are we at an aligned right end? 
d.faddp 
aR, 
iR, 
aR 
II Interpolate 4 red intensities 
bc 
aligned_end 
II Taken if at an aligned right end 
d.form 
fO, 
newi 
II Move 4 new pixels to 64-bit reg 
bla 
Ra, dX, inner_loop 
II Loop if not at end of line segment 
d.fzchks 
oldz, 
newz, 
newzll 
Mark closer points in PM[7••4] 
fld.d 
16(ZBP), 
oldz 
II Fetch 4 old Z values for next loop 
II End of inner_loop. 
Right end not aligned 


Example 7. 3·0 Rendering (1 of 2) 
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right_end:: 
d.faddz 
nop 
d.faddz 
fst.d 
d.form 
nop 
d.fzchks 
nop 
d.faddp 
pst.d 
d.faddp 
nop 
d.faddp 
nop 


aligned_end: : 
d.form 
br 
d.fzchks 
nop 


short_segment: : 
d.fnop 
adds 
d.fnop 
bnc.t 
d.fnop 
rra.e 


wrap_up:: // 
fzchks 
fst.d 
fnop 
pst.d 


// Handle boundary 
aZ, 
iZ3, 


aZ, 
iZl, 
aZ 
newz, 
8(ZBP)++ 
rZmask, newz 


fO, 
fO, 


conditions 
aZ 
// Interpolate 2 even Z values 
11 
11 
11 
11 
11 
fO 
// Shift PM[7 ••4] to PM[3 ••0] 
11 
11 
11 
11 
11 
aR 
// Interpolate 4 red intensities 
11 


Interpolate 2 odd Z values 
Update Z buf from prior loop 
Mask 4 new Z values 


Interpolate 4 blue intensities 
Store pixels indicated by PM[3 ••0] 
Interpolate 4 green intensities 


boundary conditions 
// Move 4 new pixels to 64-bit reg 
11 
newz// Mark closer points in PM[7 ••4] 
11 


11 
8, 
dX, 
rO 
11 Is right end in same set as left? 
11 
right_end 
11 Branch taken if no. 
11 
16 (ZBP), 
oldz 
11 Fetch 4 old Z values 


newi, 
8(FBP)++ 


Example 7. 3-~ Rendering (2 of 2) 


// Store pixels indicated by PM[3 ••0] 


aB, 
newi, 
aG, 


iB, 
aB 
8(FBP)++ 
iG, 
aG 


Store the unstored and leave dual mode. 
fO, 
fO, 
fO 
// Shift PM[7 ••4] to PM[3 ••0] 
newz, 
8(ZBP)++ 
// Update Z buf from prior loop 
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aR, 
iR, 


// No special 
fO, 
newi 
wrap_up 
oldz, 
newz, 


I 


:-.1. • 
a.fzchks oldZ_B,newZ_B,newZ_B 
nop 
d.faddp 
intens,dI,intens 
fst.q 
newZ_A ,16(ZBP)++ 
d.faddp 
intens,dI2,intens 
bte 
O,dX,end 
d.form 
fO,newi 
bla 
neg8,dX,inner_loop 
d.fnop 
pst.d 
newi,8(FBP)++ 
11------------------------------------------------------------ 
II 16-Bit Pixels, 16-Bit Zbuffer = 4 Pixels in 10 Clocks 
II 
G-Unit 
I 
Core Unit 
inner_loop:: 
d.faddz 
aZ,iz3,aZ 
nop 
d.faddz 
aZ~izl,aZ 
fst.d 
newz,8(ZBP)++ 
d.form 
fO,newz 
nop 
d.fzchks fO,fO,fO 
mov 
-5,Rb 
d.faddp 
aB,iB,aB 
pst.d 
newi,8(FBP)++ 
d.faddp 
aG,iG,aG 
xor 
Rb,dX,rO 
d.faddp 
aR,iR,aR 
bc 
aligned_end 
d.form 
fO,newi 
bla 
neg4,dX,inner_loop 
d.fzchks oldz,newz,newz 
fld.d 
16(ZBP),oldz 
11------------------------------------------------------------ 


Example 8. Inner Loop of Renderers for Two Plxel Sizes 
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ABSTRACT 


The i860 Processor computes floating-point results rap- 
idly, lending itself to DSP (digital signal processing) as 
well as general-purpose computing. With this high per- 
formance, DSP functions can be added to any system 
containing an i860 CPU. A Fast Fourier Transform 
(FFI) 
illustrates this DSP power. Complete code for 
the FFT is presented in this application note, as well as 
performance measurements. Both complex and real in- 
put data FFTs are included, as well as both Decimation 
in Time and Decimation in Frequency. 


1.0 INTRODUCTION 
TO FAST 
FOURIER 
TRANSFORMS 


Discrete Fourier Transforms (DFTs) change time-do- 
main data samples into a frequency-domain profile of 
the sampled signal. The frequency-domain representa- 
tion consists of the magnitudes of sine waves at various 
frequencies, which would recreate the original data if 
superimposed. To accomplish the transform, a DFT 
adds combinations of the input data samples, after mul- 
tiplying some of those inputs with weighting factors. 
The number of samples, "N", is usually a power of two. 


Each result in the frequency domain comes from a 
weighted sum of all data samples. The weighting ("W") 
factors are called "twiddles", .and are complex cosine/ 
sine values for each particular frequency. 


The FFT (Fast Fourier Transform) is an efficient im- 
plementation of the DFT, defined by: 
x(n) = time domain samples of the signal, 
n = 0, I, ... 
N-I 
X(k) = the Discrete Fourier Transform of x(n), k = 
0,1, ... 
N- I 
a "frequency domain" equivalent of x(n) 
l: x(n) • Wnk, n = 0 to N-I, and 
Wnk = e-j21TnkiN, where j = Ff 


= l: x(n) • (cos(27TnkIN) - j • sin(27TnkIN) 


The (N-I) complex adds and (N-I) complex multiplica- 
tions required for each X(k) make the DFT an Order 
(NZ) computation. Fortunately, 
the FFT decomposes 
this to an Order (N • log2 N) algorithm by splitting the 
N-sum into units of 2-sums. These units are called 
"butterflies" 
because they produce 2 output 
values 
from 
2 inputs, 
with the butterfly-shaped 
dataflow 


shown below. (Some FFT algorithms, called Radix-4, 
use 4-input, 4-output butterflies.) The butterfly calcula- 
tions are executed in stages, with log2 N stages and N/2 
butterflies per stage. 
I 
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The subdivision, or decimation, of the N-sum into but- 
terflies can be done via two different methods: "Deci- 
mation in Time" (DIT) or "Decimation in Frequency" 
(DIF). The methods differ in the ordering of twiddles 
and the form of the butterfly arithmetic, but they yield 
the same answer. They are based on different mathe- 
matical derivations of the FFT: DIT results from recur- 
sively splitting the input time-domain samples into an 
even-indexed group and an odd-indexed, while DIF 
comes from splitting the DFT output frequency-do- 
main points into odd/even groups. 


2.0 
BUTTERFL:Y DEFINED 


Let 
A = the first input to the butterfly (complex 
number, composed of Real part AR and 
Imaginary part AI) 
B = the second input to the butterfly (com- 
plex, BR and BI) 
W = twiddle factor (also complex, WR and 
WI) 
Anew = complex result 
11 I, which overwrites A 
Bnew = result 
112, which overwrites B 


For a "Decimation-in-Frequency" 
butterfly, 
Anew = A+B 


Bnew = (A - 
B) • W 


The complex add, subtract, and multiply of a butterfly 
decompose into 4 real multiplies, 3 real adds, and 3 real 
subtracts: 
AnewR = AR + BR 
AnewI = AI + BI 
tempR = AR-BR 
tempI 
= AI-BI 


BnewR = (tempR • WR) - 
(tempI • WI) 
BnewI = (tempR • WI) + (tempI • WR) 


For a "Decimation-in-Time" 
butterfly, 


Anew 
A + (B • W) 
Bnew 
= A - 
(B • W) 


The number of real operations remains 4 multiplies and 
6 add/subtracts, 
but the equations differ and the multi- 
plies must be done first: 


tempR 
= (WR • BR) - 
(WI • BI) 
tempI 
= (WR • BI) + (WI • BR) 
AnewR = AR + tempR 
BnewR = AR-tempR 
AnewI = AI + tempI 
BnewI = AI-tempI 
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Butterfly Dataflow: 


(Decimation 
In Frequency) 
(Decimation 
In Time) 


240658-1 


The stages, twiddles, and butterflies for B-point FFfs 
are shown in Figures I and 2. For larger values of N, 
the dataflow patterns are very similar, with N/2 butter- 
flies executed at each stage, and a greater number of 


stages. Refer to a text on Digital Signal Processing for a 
complete discussion of FFf design, such as chapter 6 of 
Theory and Application 
of Digital Signal 
Processing (see 


the Bibliography at the end of this note). 
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Figure 1. Declmatlon-In-Frequency 
FFT for 8 points 
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Figure 2. Declmatlon-In- 
Time FFT for 8 points 
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3.0 BIT REVERSAL 


Due to their structure, FFT algorithms have the side- 
effect of scrambling the ordering of output data. For 
radix-2 FFTs, the output is in "bit-reversed" order- 
for example, the value for frequency one is NOT at 
location one in the output array, but at location N/2. 
Time to unscramble the output is often NOT included 
in FFT benchmarking, because scrambled output is fine 
for some signal-processing uses such as convolution. In 
any event, unscrambling consists of swapping the loca- 
tions of pairs of output values. Alternatively, input val- 
ues can be shuffled, as Decimation in Time usually does 
before the first stage (as shown in Figure 2). Otherwise, 
to avoid the shuffling of input in DIT, the twiddles 
must be aecessed in bit-reversed order. As an example 
of bit-reversal, for 256 points the reordering involves: 


SWAP X(i) and X(j), where i = 'klmnopqr'b and j = 
'rqponmlk'b. 
The second index (j) contains the same 
bits as (i), but in opposite order. 
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4.0 FFT IMPLEMENTATION ON THE 
i860 CPU 


Several features of the i860 CPU contribute to FFT 
performance. The floating-point multiplier and adder 
can simultaneously produce I product and 1 sum per 
cycle, using Dual-Operation 
FP instructions. To fetch 
the butterfly inputs and store outputs, Dual-Instruc- 
tion-Mode allows a memory fetch or store simultaneous 
with the multiply and add. Four floating-point numbers 
can be stored by one instruction, using the 16-byte-op- 
erand "fst.q" instruction. 
Likewise, 16 bytes can be 
fetched from the data cache in one fld.q op. 


The floating-point arithmetic 
of the i860 CPU con- 
forms to IEEE 754 format, which some DSPs fail to do. 
Shown below is code for the crucial inner loop of the 
FFT: 


Core-op 


pfld.d 
fld.d 
rra.e 
fst.q 
adds 
pfld.d 


adds 
rra.e 
fld.d 
fst.q 
bla 
and 


//-------------------- 
//inner_loop: do 2 Decimation-In-Frequency FFT butterflies. 
// 
Twelve clockS for 2 butterflies - 12 FP add/sub, a multiplies, 
// 
6 a-byte loads, 4 a-byte stores. 
// 
FP-op 
inner_loop: : 
d.r2pt.ss 
WR,DI,BnewR 
d.pfsub.ss 
AR ,BR,AnewRo 
d.ratls2.ss 
AI,BI,AnewIo 
d.i2st.ss 
WI,DR,BnewI 
d.ratlp2.ss 
AR,BR,DR 
d.ialp2.ss 
AI,BI,DI 
//-------------------- 
d.r2pt.ss 
WRo,DI,BnewRo 
d.pfsub.ss 
ARo,BRo,AnewR 
d.ratls2.ss 
AIo,BIo,AnewI 
d.i2st.ss 
WIo,DR,BnewIo 
d.ratlp2.ss 
ARo,BRo,DR 
d.ialp2.ss 
AIo,BIo,DI 
//-------------------- 
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wind (wstart),WRo 
a (fetch)++,ARo 
offset (fetch),BRo 
AnewR,16(store)++ 
wincr,wind,wind 
wind (wstart),WR 


wincr,wind,wind 
a (fetch)++,AR 
offset (fetch),BR 
BnewR, offset (store) 
decrem,count,inner_loop 
wlimit,wind,wind //modulo. 
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5.0 CODE DESIGN 


Refer to the inner.Lloop above and code listings at the 
end of this application note for the discussions that fol- 
low. Refer to the ''i860™ 
64-bit Microprocessor Pr0- 


grammer's 
Reference Manual" 
(Intel order number 
240329) for details on instructions and formats. 


The programs include both assembly and Fortran com- 
ponents. Input data can number any power of 2 from 
16 to 1024 points. The algorithms are radix-Z, floating- 
point, in-place. Included in the listing are both Decima- 
tion-in-Time and Frequency, and both complex-input 
and real-input FFfs. 


5.1 Cache Utilization 


Because the instruction cache contains 4-Kbytes, all re- 
quired code easily fits in cache. However, a 1024-point 
complex FFT fills the 8-Kbyte data cache with the in- 
put XO array. Thus the more rarely-used twiddle WO 
array is intentionally kept out of cache, as described in 
the "pfld" section. 


A subroutine ("fetch. ss") is used to move the input data 
array efficiently into cache for the 1024-point FFT. 
"Fetch" allows all data to be brought into cache using 
the next-near (NENE#) 
accesses to DRAM. Without 
that routine, getting A and B from locations separated 
by 4 Kbytes (NOT the same DRAM 
page) makes 
fetches and writebacks from DRAM for the first stage 
slower, and adds 30% to overall execution time. 


For larger FFfs 
(2048 points = 16 kB), straightfor- 
ward expansion of the present algorithm would cause 
increased cache misses. Thus a larger FFf 
should be 
broken into multiple FFfs of 1024 points so that all 10 
stages of each can achieve high cache hits. The algo- 
rithm becomes (assuming 2048 points, Decimation-In- 
Time): 


I) Bit-reverse the entire input array 
2) Do a IO-stage FFf on the second set of 1024 points. 


Cache hits should be high on those, since they were 
most recently accessed by the bit-reversal. 


3) Do a IO-stage FFf on the first 1024 points. Prefetch 
before the first stage to ensure cache hits. 


4) Combine the 2 separate 1024-point results with a fi- 
nal stage of butterflies, where A is offset from B by 
8 Kbytes. 
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5.2 Pfld 


Twiddle factors (W) are fetched with pfld (Pipelined 
Floating-Point Load), to avoid caching thc,n. Only in 
the first stage are all the WO elements used; successive 
stages use fewer and fewer elements, which are separat- 
ed by larger and larger strides. Thus placing WO in 
cache would be inefficient. The streaming of WO from 
main memory actually yields better performance than 
caching WO, for 512 and 1024 points. With the i860 
CPU's 8-byte external data bus, a complex WO value 
can be transferred in a single bus cycle. Some FFf rou- 
tines calculate WO on the fly, rather 
than fetching pre- 
calculated values; however, performance decreases due 
to the added run-time calculations. 


5.3 Fst.q 


Quad-word (Ie-byte) stores allow 4 floating-point regis- 
ter values to update the cache in one cycle. Likewise, 
fld.q (Quad Floating Point Load) transfers 4 values to 
the registers in a cycle. However, in some FFf 
stages, 


double-word fetches (fld.d) are used instead of fld.q; 
that allows the "background" fetch of a set of operands 
concurrent with arithmetic on the other set. For the 
same reason, the inner loop does 2 butterflies, rather 
than one. 


5.4 Bit Reversal Code 


The code for bit-reversal fetches the indices of 2 ele- 
ments to be swapped from a pre-allocated array of indi- 
ces, and swaps the data elements. Again, pfld.d keeps 
the indices out of cache, for the 1024 point case. That 
assembly version of bit-reversal is approximately 
7 


times faster than the standard Fortran routine. The ar- 
ray of indices was generated by printing out the values 
generated during operation of the standard Fortran ver- 
sion; similarly, the twiddle WO values can be pre-allo- 
cated and generated using a high-level- language pro- 
gram. 


6.0 PIPELINE SCHEDULING 


The adder pipeline is 3 stages, as is the multiplier; for 
the calculation of 


BnewR = (AR - 
BR) • WR 
- 
(AI - 
BI) • WI 


the adder result is fed back into the multiplier, and the 
product again feeds into the adder. The adder and mul- 
tiplier pipes each advance one stage for each floating- 
point instruction issued. 


• 
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The butterfly decomposes into 6 real add/subtracts 
and 
4 real multiplies. Thus the best possible performance 
would be 6 clocks per butterfly, with the multiplies to- 
tally overlapping the adds. The overlap is accomplished 
with the Dual-Operation instructions: 


r2pt 
(KR'src2, Treg + Mout, Iosd KR - 
srcl) 


ratls2 
(KR' Aout, srct-srcz, load T - 
Mout) 


i2st 
(KI'src2, Treg-Mout, load KI .....:srcl) 


ratlp2 
(KR'Aout, srcl +src2,load 
T - 
Mout) 


ialp2 
(KI'Aout, srcl +src2,Ioad 
KI - 
srcI) 


KR. KI, and T are operand registers feeding the multi- 
plier and adder, separate from the floating-point regis- 
ter me. They permit the 4 inputs for multiply and add, 
even thought the instruction format holds only 2 regis- 
ters. "Aout" and "Mout" are adder and multiplier out- 
puts. 


The data path arrangements of some of these ops are 
illustrated in Figures 3 and 4. Fetching and storing of 
butterfly operands is overlapped with the calculations, 
using Dual Instruction 
Mode - 
the integer core op 
(such as a load or branch) and FP op are fetched simul- 
taneously from the instruction 
cache and executed 
simultaneously. 


Scheduling of instructions was done with a pipeline dia- 
gram, as illustrated in the comments of the code listing 


erez 
rdest 
,rc1 
I 


KR 


op1 
op2 


•• ULTIPlIER 
UNIT 


ADDER 
UNIT 


RESULT 


'2pl" 
,211 
240658-4 


Figure 3. Datapath 
for r2pt op 
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of difstep.ss in the Appendix. (The comments show the 
machine state after the instruction is processed.) Begin 
by placing the desired results in the rightmost column, 
then tracing progress backwards through the adder. 
When adder inputs are products (of the multiplier), one 
product is kept in the Treg for a cycle while the other 
propogates through the multiplier final stage. Those 
products can be traced back on the multiplier pipeline, 
to determine at what instruction the multiplier inputs 
must be provided. 


For example, place the BnewR label in the "Write" 
stage of the pipe (the output of the Adder). Now 


BnewR = WR • OR - 
WI • 01 


Three instructions earlier, the adder inputs for BnewR 
must be fed to adder; those inputs are products, one of 
which comes directly from the multiplier output, and 
the other from the Treg. The multiplier output and 
Treg value must then be traced back through multiplier 
stages, requiring the following instructions: 


i2st.ss 
Wlo,OR,Bnewlo 
as the 10th op of 12, to start (T - 
Mout) 


rat1s2.ss 
Alo,Blo,Anewl 
as the 9th instruction, to update the Treg 


ialp2.ss 
AI,BI,DI 
as the 6th op, to multiply 01 ' WI 


ratlp2.ss 
AR,BR,OR 
as the Sth op, to multiply OR ' WR 


rat1s2.ss 
AI,BI,Anewlo 
as the 3rd, 10 start 01into the adder 


pfsub.ss 
AR,BR,AnewRo 
as the 2nd, to start OR into the adder 


are1 


"UL TIPLIER 
UNIT 


ADDER 
UNIT 


RESULT 


rallp2 " ,0\112 
240658-5 


Figure 4. Datapath 
for rat1p2 op 


Some trial-and-error ordering of the desired outputs is 
needed to devise a sequence which keeps the adder 
pipeline full. An op is chosen for each slot for its ability 
to load the KR or KI register, or to initiate an adder 
operation simultaneous with the multiplies required to 
calculate BnewR and Bnew!. 


Handy hints to assist dual-operation 
scheduling in- 
clude: 


I) Feedback the adder result to the multiplier, or visa 
versa, whenever possible. For example, the ratip2 
op feeds adder-out to multiplier. Thus both srcl and 
src2 fields of the instruction are available to feed the 
adder-in, and a simultaneous useful add and multi- 
ply are initiated. 


2) Freeze one of the pipes, by using a pfadd or pfmul, 
when appropriate. In the butterfly, where 6 adds are 
done for every 4 multiplies, freezing of the multipli- 
er does not degrade performance. The freeze allows 
multiplier results to be held until needed in the ad- 
der. 


3) The Treg can hold a multiplier result for several 
cycles until needed in the adder. 


4) Unroll a loop to do 2 iterations per loop. That pro- 


vides time to fetch inputs for iteration 2 while calcu- 
lating iteration 
I, and store results of iteration 
I 
(and fetch more inputs) while calculting iteration 2. 


7.0 
PERFORMANCE 
MEASUREMENTS 


The code was run on an evaluation card with DRAM 
memory only, no external cache, 33.33 MHz clock, and 
5 wait-states or more for some accesses. Next-near ac- 
cesses (address falls into the same DRAM page as the 
previous access) are zero wait-state, but far accesses 
take 5 or more wait-states. The code was run under a 
virtual-memory multitasking executive. Shown below 
are measured results: 


System: 
33.3 MHz 
80860 with 
a single bank 
of 
static-column DRAM 


Algorithm: Radix-2 FFf, 
in-place. Data is IEEE 754 
single-precision floating point. Implemented in assem- 
bly-language and Fortran code. 
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Time 
TypeofFFT 
Time 
(Including 
blt~reversal) 


1024-point-complex,DIF 
1.17 ms 
1.33 ms 
1024-point-real 
0.67 ms 
512-point-complex,DIF 
0.48 ms 
0.56 ms 
512-point-real 
0.33 ms 
256-point-complex, 
DIF 
0.22 ms 
0.26 ms 
1024-point-complex,DIT 
1.37 ms 
512-point-complex, 
DIT 
0.59 ms 


7.1 Cache Fill and Write back Time 


Measured times do not include cache-fill and write- 
back. That is, the timings measured 200,000 executions 
of the FFf 
using the same input array. (performance 


figures offered by other manufacturers for DSP chips 
likewise assume that the data is already in on-chip 
RAM. Of course, the i860 CPU will do that fetching 
automatically into its data cache.) The additional time 
for cache fill and writeback were measured as: 


1024-point-complex 0.25 ms 
(8 Kbytes fetched, 
8 Kbytes writeback) 


512-point-complex 0.12 ms (4 Kbytes) 


To quantify the calculations in MFlops (Millions of 
FLoating-point OPerations per Second), consider that 
the 
1024-point complex FFf 
is implemented 
with 


about 
16,400 multiplies and 28,700 adds/subtracts. 


Thus the 1.17 ms translates to a sustained 38.5 MFlops 
rate. For 512 points, the required 20,000 Flops means 
41.6 MFlops. 


The overall FFf is about 10 times faster than the equiv- 
alent Fortran. Inner loop performance was measured at 
13cycles for the 24 instructions, which is 6.5 cycles per 
butterfly. 
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8.0 CODE HIERARCHY 
Pictured below are the programs developed for the i860 CPU FFf: 


I 
bltr.v.ssI 


fetch 


~' 


dlfU 
ff\.f 
I 
I 
dlfst.p." 
I 
r.alflx.ss 


240658-6 


The Fortran program ffttest.f is the highest-level pro- 
gram of those listed on the following pages. It calls two 
FFf 
subroutines, diff.f and fft.f, then compares their 
outputs. 
Fft.f is a Fortran 
decimation-in-time 
algo- 
rithm, while diff.f is the high-speed DIF routine. Diff.f 
is callable by C or Fortran applications. It in turn calls 
difstep, which is implemented in assembly code (difs- 
tep.ss). Difstep is called once per stage of the FFf. 
A 
Fortran version (difstepf.f) is shown, for comparison. 
Other assembly routines are the bit-reversal-data-move- 
ment (bitreT.ss) and prefetch ("fetch" inside bitrev.ss). 


Difstep.ss 
contains approximately 
225 assembly in- 
structions, and bitrev.ss contains about 24. The Fortran 
diff.f compiles to about 80 instructions. 


A Decimation-in- Time version of diff.f and difstep.ss 
can be found in ditt.f and ditstep.ss. The DIT version 
performs 5-10% slower than the Decimation-in-Fre- 
quency because the DIT loop takes 7 cycles per butter- 
fly, while DIF takes 6. 


A real-input algorithm is dirr.f, which can be called 
and tested using program real.f. Dirr.f calls difstep to 
do a complex DIF FFf 
on N real data points, but 
treats them as N/2 complex points. Then realfix.ss is 
called by dirr.fto fix the DIF output, compensating for 
the treatment of the N real points as N/2 complex. The 
derivation of the real-fix can be found in reference 3, 
Numerical 
Recipes in C. 


The mixture of Fortran, C, and assembly code is ac- 
complished by passing function inputs and outputs in 
registers. Only pointers and integer values were used in 
the above code, but floating point parameters can also 
be exchanged. A calling program feeds arguments to a 
function in rl6, r17, and higher-numbered integer reg- 
isters. The callee is permitted to destroy the contents of 
those registers, but rl:rl5 
must be preserved. For more 
details on parameter-passing conventions see the i860 
M-bit Microprocessor 
Programmer's 
Reference 
Manual,' 


Chapter 8. 
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9.0 CONCLUSION 


The i860 CPU computes very Fast Fourier Transforms, 
quicker than most high-end dedicated DSP chips. Con- 
tributing to the FFf 
performance are the 8-kByte on- 
chip data cache and 4-kByte instruction cache. Also the 
8-byte external data bus, pfld instruction, and l6-byte 
data cache width provide sufficient bandwidth to keep 
the arithmetic units busy. Dual-Operation instructions 
and Dual-Instruction-Mode 
allow parallel data move- 
ment and calculations. The 33.3 MHz clock rate allows 
both an add and a multiply every 30 ns, giving a time of 
1.17 ms for a 1024-point complex FFf. A 40 MHz i860 
Microprocessor will yield a time of less than 1 mSec. 
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[Numerical Recipes contains the C-code source for 
"realfix"] 


1) diff.f: 
Fortran module to do fast Decimation-In-Frequency 
(DIF) Radix-2 FFf. 


2) difstep.ss: 
Assembly code which does all DIF FFf 
butterflies; called by diff.f. 


3) difstepf.f: 
Fortran equivalent of difstep.ss. Included here for clarity. 


4) bitrev.ss: 


Assembly code to do bit-reversal. 


5) ffitest.f: 
Highest-level Fortran code. Tests diff.f or ditt.f. 


6) ditt.f: 


Fortran module to do fast Decimation-In-Time (DIT) Radix-2 FFT. 


7) ditstep.ss: 


Assembly code which does all DIT FFf 
butterflies; called by ditt.f. 


8) dirr.f: 


Fortran module for Real-Input Decimation-In-Frequency 
(DIF) Radix-2 FFf. 


9) realfix.ss: 


Assembly code required by dirr.f to compensate for Real-Input. 


10) real.f: 


Highest-level Fortran code, for Real-value input. Tests dirr.f. 


11) fft.f: 
Fortran FFT algorithm. Generates "correct" answers for comparison against the other code. 


12) makefile: 


Unix V/386 version of a makefile to maintain the FFT code, using the Unix "make" program-mainte- 
nance utility. Note that this makefile uses the Unix macro preprocessor "m4" to convert symbolic names 
to register numbers. 


13) start.ss: 
Assembly code preamble for Fortran runtime. 
14) time.c: 
Dummy routine, used to install breakpoints. 


APPENDIX 
A 
PROGRAM LISTINGS 


Pg. 


A-2 


A-3 


A-11 


A-13 


A-17 


A-21 


A-22 


A-30 


A-31 


A-36 


A-40 


A-43 


A-45 


A-45 
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C------------------- 
C File: 
dif'f'. 
f 
C 
FFT 
- Decimation 
in Freq, 
radix-2, 
inplace, 
l-dimen 


C Intel 
assumes 
no responsibility 
for use 
or misuse 
of this 
code. 


C 
5/19/89: 
call 
fetch8() 
added 
for 1024-point 
caching. 
C 
6/01/89: 
fetch() 
CRUCIAL-30% 
performance 
loss 
if removed 


C Inputs: 
. 


C 
A= 
complex 
array 
of input, 
up to 1024 pts, 
single-prec 
float 
C 
M= log of number 
of pts 
C 
= 
(number 
of stages 
of FFT) 
C 
N = number 
of points. 
ie, N= 2**M = number 
of pts 


'C 
W= complex 
array 
of twiddle 
factors, 
length 
N/2. 
C 
REV= 
0 if bitreversed 
output 
ok. l=must 
re-order 
output 
C 
C 
C 
C 


Outputs: 
A= 
complex 
fft of input A 


subroutine 
diff(a,m,N,W,REV) 
integer 
m,N, 
i, j,k, REV,wlimit 
integer 
offset, 
stage, 
groups, 
wincr,powers2(0:10) 
complex 
a(n),w(N/2),temp 


data powers2 
/1,2,4,8,16,32,64,128,256,512,1024/ 
C Powers2 
to avoid 
calls 
to POW, 
DIV 


C Twiddle 
factor 
array 
w(k) 
has 
(cos,-sin) 
of 2pi*k/N 
CC Assume 
the 
caller 
provides 
w(k) 
constants 
ALREADY 
initialized 
C------- ..---- 
C Pre-touch 
data, 
lock 
into 
cache, 
for 8kByte 
fft: 
IF 
(N .gt. 513) 
THEN 
call 
fetch(a,%VAL(n)) 
ENDIF 
C------------ 
wlimit 
= 8*((N/2) 
- 1) 


C "DO 20" 
stage-loop 
DO 20 stage 
= l.m 
groups 
= powers2(stage-l) 
C groups=number 
of times 
the twiddle 
factors 
are used, 
ie, the number 
of 
C 
smaller 
DFTs 
the 
stage 
is split 
into. 


C 
offset 
gets 
N/2.N/4.N/B,N/16 
•••• 
offset 
= powers2(m-stage) 
wincr 
= groups 
call d1fstep(a,w.groups.offset.wincr.wlimit) 
20 CONTINUE 


IF 
(REV .ne. 0) THEN 
cc REV 
.ne. 0 means 
must 
do bit-reversal 
reordering 
of output 
call 
bitrev(a.%VAL(M) 
.n) 
ENDIF 


RETURN 
END 
C------------ 
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11-------------- 
11 difstep.ss: 
do one stage 
of fft butterflies 
11 DIF = Decimation 
in Frequency. 
radix-2. 
inplace. 
l-dimension 
11 (C) Copyright 
1989 
INTEL 
Corporation. 
11 
Inner 
loop 
developed 
with 
assistance 
from 
Tricord 
Systems. 
Inc. 
11-------------- 
11 5/18/89: 
1 pm 
- offset_2 
added. 
as next-to-last 
stage was 
slow 
11 5/19/89: 4 pm 
- fetch8() 
routine 
added. 
for cache 
miss 
avoidance. 
11 5/31/89: 
am 
- use 
fst.q 
(13% perf 
improvement 
of inner_loop!) 
11 
last_bfly 
added. 
for performance. 
11 6/02/89: 
am 
- bptr 
deleted. 
Modulo-address 
W 
(5% perf 
improved) 
11------------ 
11 Intel 
is not 
responsible 
for use nor 
for misuse 
of this 
program. 
11------------ 
11 Do one entire 
stage 
(n/2 butterflies). 
Sample 
invocation: 
11 
call 
difstep(a.w.groups.offset.wincr.wlimit) 
11 
=== 
11 
11 
11 
11 
W= 
11 
11 
11 
11 
11 
11 
11 
11 
11 
11 Outputs: 
11 
A= 
complex 
radix-2 
butterflied 
version 
of input. 
II--------~---------- 
define (astart. 
r16) 
Ilinput data 
base 
address 
define (wstart.r17) 
Iltwiddle 
array 
ptr. 
Because 
w-contents 
depend 
on N. 
11 we will 
assume 
the 
caller 
has 
initialized 
w() 
array. 
define (groups.r18) 
Ilgroups=number 
of sub-DFTs 
this 
stage 
is split 
into. 
define(offset.r19) 
Iloffset (initially 
elements. 
mult 
by 8 to get bytes) 
11 between 
node 
and 
its dual 
(the 2 numbers 
to butterfly. 
ie. A and B) 
define (wincr.r20) 
Ilincrement 
between 
successive 
W values. 
Remains 
constant 
11 within 
a given 
stage. 
For Decimation 
in Freq. 
wincr 
addressing 
is: 
11 
+8 for offset=N/2 
(WO.Wl.W2.W3 ••••W(n-l)) 
11 
+16 
offset=N/4 
(WO. 
W2. 
W4.... 
) etc ••• 
defineCwlimit.r21) 
IImax 
index. 
in bytes. 
of W table. 
define (wind.r22) 
Ilcurrent 
index. 
in bytes. 
of W table. 
define (offset2.r23) 
Iloffset*2 


Inputs: 
A= 
complex 
array 
of input. 
single-prec 
float 
(complex 
stored 
as 4byte 
real. 
4byte 
imag 
contiguously) 
pointer 
to array 
of twiddle 
factors. 
Assuming 
W(k) 
is 
CMPLX(cos(2pi*k/N)) 
.-sin(2pi*k/N)) 
for k=O to 
(N/2)-1. 
offset 
= distance 
(except 
for scale-by-8byte 
sizeof(complex)) 
between 
the 2 input 
values 
for each 
butterfly. 
Offset 
also 
is the number 
of butterflies 
done 
per 
"group". 
groups 
= N/(2*offset). 
The number 
of sub-DFTs 
this 
stage 
is split 
into. 
wincr 
= distance 
(except 
for scale-by-8byte 
sizeof(complex)) 
between 
successive 
w values 
for successive 
butterflies 
wlimit 
=max 
index. 
in bytes. 
of W table. 


define (decrem.r24) 
Ilbla 
decrement 
define (somecount.r25) 
11 bla 
counter 


define (FEtch. 
r26) 
define(STore.r27) 
Ilpointer 
to 1st component 
of butterfly 
(load) 
11 " " 1st component 
of butterfly 
(store) 
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define (FI, f25) 11 • 
define (FRo,f26) 
define(FIo,f27) 


imag • 


define (DR, f28) IIDifference 
of A-B, real part 
define(DI, 
f29) 11 • ., imag • 
define(WR, 
f30) IIW 
(twiddle factor), 
real part 
define (WI, f3l) 11 • • , imag 
define (WRo,flO) IIW 
(twiddle factor), 
real part 
(EXTRA copy) 
definelWIo,fll) 
11 • • , imag 


.text 
.align 
.quad 
_difstep_: : 
ld.l 
O(groups) ,groups 
ld.l 
O(offset) ,offset 
shl 
3,offset,offset 
shl 
l,offset,offset2 


Ilfix 
Fortran 
call-by-ref 
11 
I 


11 change 
from elements 
to bytes 


fst.q 
fst.q 
f8 ,-16(sp)++ 
Ilsave 
·local· regs 
f12,-16(sp)++ 
11 •• 


adds 
adds 
-l,groups,groups 
11 pre-decrement 
for bnc usage, 
or bla usage 
-16,rO,decrem 
Ilbla decrement 


11 
We code the last 2 stages as special 
cases: 
11-------- 
xor 
8,offset, rO Iloffset=l, special case, no complex mul t, funny addressing 
bcoffset_lll 
(ASSUMING offset=l 
means wincr=O, 
and no twiddle 
used) 
xor 
l6,offset,rO Iloffset=2, 
special case, no complex mult, funny addressing 
bcoffset_211(ASSUMING 
offset=2 
means wincr=N/4) 
11-------- 
ld.l 
O(wincr),wincr 
ld.l 
O(wlimit),wlimit 
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pfadd.ss 
fO,fO,fO 
pfadd.ss 
fO,fO,fO 
pfadd.ss 
fO,fO,fO 
// init Al,A2,A3=0 
pfmul.ss 
fO,fO,fO 
pfmul.ss 
fO,fO,fO 
pfmul.ss 
fO,fO,fO 
//-------- 
// init pointers: 
shl 
3,wincr,wincr 
shl 
l,wincr,wind 
//scale 
for bytes. 
//init wind =2*wincr 


pfld.d 
0 
( wstart),fO 
pfld.d 
wincr 
( wstart),fO 
adds 
-8,astart,FEtch 
pfld.d 
wind 
(wstart),fO 
adds 
wincr,wind,wind 
//wind 
now 3*wincr 
// here 
fetch 
first 
set of A,B,W 
before 
bla-loop 
pfld.d 
wind 
(wstart),WR 
adds 
wincr,wind,wind 
and 
wlimit,wind,wind 
//modulo-wlimit 
the w index 
// We do modulo-addressing 
on W(), 
to keep the pfld pipeline 
full. We 
// 
never 
do a W-fetch 
beyond 
the end of the table. 
// And the modulo-check 
needs 
to be done 
only every 4th pfld, 
as always 
// 
we use a multiple 
of 4 W() 
factors. 


fld.d 8 
(FEtch)++,AR 
fld.d 
offset 
(FEtch),BR 
d.r2apl.ss 
fO,fO,fO 
//clear 
Treg. 
adds -32,offset,somecount 
// bla counter 
(predecrement 
by 4 elements) 
// ----------- 
// Definitions 
for pipe diagram: 
// 
(the complex 
multiply 
product, 
F, broken 
into 4 real mult 
and 2 adds) : 
// 
WR = cos(), WI=-sin(). 
// 
DR = AR - BR; 
(diffence 
of Real 
components 
of A,B) 
// 
DI = AI - BI; 
(diffence 
of Imag components) 
// 
ER = AR + BR; El = AI + BI; 
// 
FR = K - L; where 
K= WR*DR, 
L=WI*DI 
// 
FI = N + M; where 
M= WI*DR, 
N=WR*DI 


// For 1st time thru 
inner_loop, 
don't have 
correct 
values 
to store. 
// Must 
do 1 loop before 
the loop, 
sans the stqres. 


first_bfly:: 
//fill 
pipe 
// KR •••KI •••Ml ••••M2 ••••M3 
T 
d.r2pt.ss 
WR,fO,fO 
// WRO 
pfld.d 
wind 
(wstart) ,WRo 
d.pfsub.ss 
AR,BR,fO 
// 
fld.d 8 
(FEtch)++,ARo 
d.ratls2.ss 
AI,BI,fO 
// 
fld.d 
offset 
(FEtch),BRo 
d.i2st.ss 
WI,fO,fO 
// 
WIO 
adds 
wincr,wind,wind 


Al ••••A2 ••••A3 ••••Write 


DRO 


DIO 
DRO 


DIO 
DRO 


IlfJllIlIl! 0.0IMlOOO&.1iiW 
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KO 
ERO - 
DIO 
DRO 


LO 
KO 
EIO 
ERO - 
DIO 


NO 
LO 
KO 
EIO 
ERO 


NO 
LO 
KO 
DRl - 
EIO 
ERO 


NO 
LO 
KO 
DIl 
DRl - 
EIO 


MO 
NO 
KO 
K-L 
DIl 
DRl 


d.ratlp2.ss 
AR,BR,DR 
// 
nop 
d.ialp2.ss 
AI,BI,DI 
// 
pfld.d 
wind 
(wstart) ,WR 
d.r2pt.ss 
WRo,DI,fO 
// WRl 
- 
fld.d 
a 
(FEtch)++,AR 
d.pfsub.ss 
ARo,BRo,ER 
// 
fld.d 
offset 
(FEtch) ,BR 
d.ratls2.ss 
Alo,Blo,EI 
// 
adds 
wincr,wind,wind 
d.i2st.ss 
Wlo,DR,fO 
// 
WIl 
and 
wlimit 
,wind,wind 


quickstart: : 
d.ratlp2.ss 
ARo,BRo,DR 
// 
Kl 
MO 
NO 
ERl 
FRO 
DIl 
DRl 
bla 
decrem,somecount,inner_loop 
//init 
LCC 
d.ialp2.ss 
Alo,Blo,DI 
// 
Ll 
Kl 
MO 
NO 
Ell 
ERl 
FRO 
DIl 
adds 
-16,astart,STore 
// ptrs 
init 16 low, 
for fst.q 
instructions 
//------------------- 
// Each 
butterfly = 1 complx multiply, 
1 complx add, 
1 complx 
subtract 
// = 
4 multiply, 
// 
3 add 
// 
3 subtract 
// 
3 a-byte 
fetches 
(A, B, W) 
// 
2 a-byte 
stores 
(A, B) 
11 
// 6 cycles 
per butterfly 
11 
// inner_loop: 
iterates 
"offset/2" 
times 
(eg, N/4 for stage 
1, N/a for stage2), 
// 
for each group. 
It does 2 butterflies 
per iteration 


// KR •••KI •••loIl 
•••1oI2 
••1oI3 
11 I 
I 
I 
I 
I 
d.r2pt.ss 
WR,DI,FR 
// WR2 
- 
Nl 
Ll 
Kl 
pfld.d 
wind 
(wstart),WRo 
d.pfsub.ss 
AR,BR,ERo 
// 
fld.d 
a 
(FEtch)++,ARo 
d.ratls2.ss 
AI,BI,Elo 
// 
fld.d 
offset 
(FEtch),BRo 
d.i2st.ss 
WI,DR,FI 
// 
fst.q ER,16(STore)++ 
//update 
d.ratlp2.ss 
AR,BR,DR 
// 
adds 
wincr,wind,wind 
d.ialp2.ss 
AI,BI,DI 
// 
L2 
K2 
loll Nl 
EI2 
ER2 
FRl 
DI2 
//no need 
for modulo-check 
("and") here, 
as odd num of W's have been 
fetched. 
pfld.d 
wind 
(wstart) ,WR 
// 
. 


T 
I 
NO 


Al ••A2 •••A3 ••Write 
I 
I 
I 
I 
N+1oI Ell 
ERl 
FRO 


Nl 
Ll 
Kl 
NO 
DR2 
FIO 
Ell 
ERl 


Nl 
Ll 
Kl 
DI2 
DR2 
FIO 
Ell 


WI2 
loll 
ER/EI/ERo/Elo 
K2 
loll 


Nl 
Kl 
K-L 
DI2 
DR2 
FIO 


ER2 
FRl 
DI2 
Nl 
DR2 
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II KR •••KI •••Ml ••••M2 ••••M3 
T 
Al ••••A2 ••••A3 ••••Write 
d.r2pt.ss 
WRo,DI,FRo 
II WR3 
N2 
L2 
K2 
Nl 
N+M 
EI2 
ER2 
FRl 
adds 
wincr,wind,wind 
d.ptsub.ss 
ARo,BRo,ERII 
N2 
L2 
K2 
Nl 
DR3 
FIl 
EI2 
ER2 
tld.d 8 
(FEtch)++,AR 
d.ratls2.ss 
Alo,Blo,EIII 
N2 
L2 
K2 
DI3 
DR3 
FIl 
EI2 
tld.d 
ottset 
(FEtch) ,BR 
d.i2st.ss 
Wlo,DR,Floll 
WI3 
M2 
N2 
K2 
K-L 
DI3 
DR3 
FIl 
tst.q 
FR, ottset 
(STore) 
Ilupdate 
FR/FI/FRo/Flo 
d.ratlp2.ss 
ARo,BRo,DRII 
K3 
M2 
N2 
ER3 
FR2 
DI3 
DR3 
bla decrem,somecount, 
inner_loop 
d.ialp2.ss 
Alo,Blo,DIII 
L3 
K3 
M2 
N2 
EI3 
ER3 
FR2 
DI3 
and 
wlimit,wind,wind 
Ilmodulo. 


end_inner_loop:: 
llKEEP Pipelines 
tull 
II RE-init 
pointers 
tor tetches 
d.tiadd.ss 
to,tO,tO 
adds 
ottset2,astart,astart 
Ilbump 
to next 
group 
Ilredo A,B tetches, 
with proper 
ptr. 
d.tiadd.ss 
to,tO,tO 
tld.d O(astart) 
,AR Ilget tirst AR/AI 
in next group 
d.tiadd.ss 
to,tO,tO 
tld.d 
ottset 
(astart) ,BR 
d.tiadd.ss 
to,tO,tO 
adds 
O,astart,FEtch 


last_btly:: 
lIdo tinal 2 buttertlies, 
start next 
II KR •••KI •••Ml ••••M2 ••••M3 
d.r2pt.ss 
WR,DI,FR 
II WR4 
- 
N3 
L3 
K3 
ptld.d 
wind 
(wstart),WRo 
d.ptsub.ss 
AR,BR,ERo 
II 
tld.d 
8 (FEtch)++,ARo 
d.ratls2.ss 
AI,BI,EIoll 
tld.d 
ottset 
(FEtch) ,BRo 
d.i2st.ss 
WI,DR,FI 
II 
tst.q ER,16(STore)++ 
d.ratlp2.ss 
AR,BR,DR 
II 
adds 
wincr,wind,wind 
d.ialp2.ss 
AI,BI,DI 
II 
ptld.d 
wind 
(wstart),WR 
II· ....................•.......................................•.... 
II KR •••KI •••Ml ••••M2 ••••M3 
T 
Al ••••A2 ••••A3 ••••Write 
d.r2pt.ss 
WRo,DI,FRo II WR5 
N4 
L4 
K4 
N3 
N+M 
EI4 
ER4 
FR3 
tld.d 
8 
(FEtch)++,AR 
d.ptsub.ss 
ARo,BRo,ERII 
adds -32,ottset,somecount 
d.ratls2.ss 
Alo,Blo,EIII 
adds 
wincr,wind,wind 
d.i2st.ss 
WIo,DR,FIoll 
adds -l,groups,groups 
d.tnop 
tld.d 
ottset 
(FEtch),BR 
d.tnop 
bnc.t 
quickstart 
Ilbranch 
on value ot groups 
d.tnop 
tst.q 
FR, ottset 
(STore) 


group 
T 
N2 
Al ••••A2 ••••A3 ••••Write 
N+M 
EI3 
ER3 
FR2 


L3 
DR4 
FI2 
EI3 
ER3 
N3 
K3 
N2 


DR4 
N3 
L3 
K3 
DI4 
FI2 
EI3 


WI4 
M3 
K-L 
DR4 
FI2 
N3 
K3 
DI4 


ER4 
FR3 
K4 
M3 
N3 
DI4 
DR4 


L4 
FR3 
K4 
M3 
N3 
EI4 
ER4 
DI4 


N4 
L4 
K4 
II reset bla counter 
N4 
L4 


N3 
DR5 
FI3 
EI4 
ER4 


K4 
DI5 
DR5 
FI3 
EI4 


WI5 
M4 
N4 
K4 
K-L 
DI5 
DR5 
FI3 
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end_last_bf'ly: : 
d.fnop 
br endit 
fiadd.ss 
fO.fO.fO 
fst.q 
FR. 
offset 
(STore) 
Ilrepeated 
for bnc.t 
untaken 
case 
.align 
•quad 
11========================================= 
offset_l: : 
11 want 
FEtch=O.2.4.6.8 
•••• elements. 
ASSUMING 
wincr=O. 
11 
and that w=(l.O). 
so that 
no complex 
mult 
needed. 
a~d NO W will 
be fetched. 
11 
E=A+B. 
F=A-B. 
(Per double-butterfly 
loop: 
8 pfadd.4 
dword 
fld. 4 fst. 
11 
1 bla) (fld.q required. 
to reduce # flds to avoid 
pipe 
stailS) 
11 
Performance 
= 4 cyc/bfly 
best 
case. 


IIRedefine 
regs 
define (AR3.f'l2) 
define (AI3.f'l3) 
define (BR3.f'l4) 
define (BI3.f'l5) 
define (AR4.f'l6) 
define (AI4.f'l7) 
define (BR4.f'l8) 
define (BI4. f'l9) 


for fld.q.fst.q 
usage. 
when 
A and B adjacent: 
Ilelement 
A. real 
component 
11 • ". 
imag 
Ilelement 
B. real 
component 


11 extra 
A value. 
for prefetch 


11 extra 
A value. 
for prefetch 


define (ER3. f20) IIA+B. real 
(ER = AR + BR) 
define(EI3. 
f21) 11· 
imag" 
define (FR3. f22) II(A-B). real 
define(FI3. 
f23) 11" 
imag 
" 


define (ER4.f24) 
IIA+B.·real. 
extra 
copy 
define(EI4.f25) 
11· 
imag 


define (FR4.f26) 
define (FI4.f27) 
11================== 
adds 
-16.astart.FEtch 
fld.q 
16 
(FEtch)++.AR4 
adds 
-l.groups.somecount 
11 bla 
counter 
(predecremented 
already 
by 1) 
Ilusing groups=blacount 
on the 
offset_l 
loop. 
intentionally. 
adds 
-16.FEtch.STore 
Ilstartup the loop: 
11 ---------------------11 
Al •••••• A2 •••••• A3 •••••• Write: 
d.pfadd.ss 
AR4.BR4.fO 
11 ARn+BRn 
- 
fld.q 
16 
(FEtch)++.AR3 
d.pfadd.ss 
AI4.BI4.fO 
11 AIn+BIn 
ERn 
adds 
-2.rO.decrem 
112 bflies 
per 
loop 
d.pfsub.ss 
AR4.BR4.fO 
11 ARn-BRn 
EIn 
ERn 
bla 
decrem.somecount. 
offsetl_loop 
Ilinit LCC 
d.pfsub.ss 
AI4.BI4.ER4 
11 AIn-BIn 
FRn 
EIn 
ERnext 
nop 
11 ---------------------11 
Al •••••• A2 ••••••A3 •••••• Write: 
offsetl_loop: 
: 
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rST..q 
.I!.~').J.\j\;'l:OreJTT 
11-------------------------- 
end_offsetl_loop:: 
d.fiadd.ss 
fO.fO.fO 
br endit 
fiadd.ss 
fO.fO.fO 
nop 
11-------------------------- 
.align 
.quad 
offset_2: : 
II want 
FEtch=O.1;4.5;8.9;12.l3; 
••• elements. 
II ASSUMING 
wincr=N/4 
(W_addr=O.N/4.0.N/4.0 
•••• ). Trivial 
W() 
factors. 
II USE 
bla 
loop. 
incrementing 
FEtch 
by 16 
(2*offset). 
II 
Even-indexed 
elements 
identical 
to offset_l.W=WO. 
no complex 
multo 
II 
SO FReven=(AR-BR). 
FIeven=(AI-BI). 
II 
Odd 
components 
have 
W=(O.-l). 
So FRodd=(AI-BI). 
FIodd=(BR-AR). 
II 
Each 
fld.q 
fetches 
AReven.AIeven.ARodd.AIodd. 


IIAssume 
ER.EI.ERo.EIo 
are 4 contiguous 
regs. 
IIAssume 
FR.FI.FRo.FIo 
are 4 contiguous 
regs. 


adds 
-16.astart.FEtch 
fld.q 
16 
(FEtch)++.AR 
fld.q 
16 
(FEtch)++.BR 
adds 
O.groups.somecount 
Ilbla 
counter 
Ilstartup 
the loop: 
/I -------------// 
pfadd.ss 
AR 
.BR .fO 
II 
pfadd.ss 
AI 
.BI .fO 
II 
d.pfadd.ss 
ARo.BRo.fO 
II 
nop 
d.pfadd.ss 
AIo.BIo.ER 
II AIo+BIo 
ERo 
nop 
d.pfsub.ss 
AR 
.BR .EI 
II AR-BRe 
EIo 
adds 
-l.rO.decrem 
112 bflies 
per 
d.pfsub.ss 
AI 
.BI .ERo 
II AI-BIe 
FR 
adds 
-16.astart.STore 
d.pfsub.ss 
AIo.BIo.EIo 
II AIo-BIo 
FI 
FR 
bla 
decrem.somecount. 
offset2_loop 
Ilinit 
LCC 
d.pfsub.ss 
BRo.ARo.FR 
II BRo-ARo 
FRo 
FI 
nop 


Al •••••• A2 •••••• A3 •••••• Write: 
AR+BRe 
AI+BIe 
ER 
ARo+BRo 
El 
ER 


El 
ER 


ERo 
El 
loop. but 
groups 
is half 
desired 
value. 
EIo 
ERo 


EIo 


FR 


• 
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offset2_1oop:: 
d.fnop 
fld.q 
16 (FEtch)++,AR 
//fetch AR,AI,ARo,Alo 
d.fnop 
fld.q 
16 (FEtch)++,BR 
//fetch 
BR,BI,BRo,Blo 
// ---------------------// 
Al ••••••A2 ••••••A3 ••••••Write: 
d.pfadd.ss 
AR 
,BR ,FI 
// AR+BRe 
Flo 
FRo 
FI 
nop 
d.pfadd.ss 
AI 
,BI ,FRo 
// AI+Ble 
ER 
Flo 
FRo 
nop 
d.pfadd.ss 
ARo,BRo,Flo 
// ARo+BRo 
El 
ER 
Flo 
fst.q 
ER 
,16(STore)++ 
//update 
ER 
,El ,ERo,Elo 
d.pfadd.ss 
Alo,Blo,ER 
// Alo+Blo 
ERo 
El 
ER 
nop 
d.pfsub.ss 
AR 
,BR ,El 
// AR-BRe 
Elo 
ERo 
El 
nop 
d.pfsub.ss 
AI 
,BI ,ERo 
// AI-Ble 
FR 
Elo 
ERo 
fst.q 
FR 
,16(STore)++ 
d.pfsub.ss 
Alo,Blo,Elo 
// Alo-Blo 
FI 
FR 
Elo 
bla decrem,somecount,offset2_100p 
d.pfsub.ss 
BRo,ARo,FR 
// BRo-ARo 
FRo 
FI 
FR 
nop 


endit: : 
// restore 
regs 
fiadd.ss 
fO,fO,fO 
//exit DIM 
fld.q 
O(sp) ,f12 
fiadd.ss 
fO,fO,fO 
//last DIM pair 
fld.q 
16(sp),f8 
adds 
32,sp,sp 
bri rl 
nop 
//---------------------------------- 
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c------------------------------------------------------------ 
c difstepf.f: 
do one stage 
of fft 
(DIF) butterflies 
c 
(C) Copyright 
1989 
INTEL 
Corporation. 
ALL RIGHTS 
RESERVED. 
c------------------------------------------------------------ 
c Decimation 
in Freq, 
radix-2, 
inplace, 
l-dimen 
c 6/20/89 


c Do one entire 
stage 
(n/2 butterflies). 
Sample 
invocation: 
c 
call difstep(a,w,groups,offset,wincr) 


c Inputs: 
c 
A= complex 
array 
of input, 
single-prec 
float 
c 
(complex 
stored 
as 4byte 
real, 
4byte 
imag contiguously) 
c 
W= pointer 
to array 
of twiddle 
factors. 
Assuming 
W(k) 
is 


-c 
CMPLX(cos(2pi*k/N»,-sin(2pi*k/N» 
for k=O to 
(N/2)-1. 
c 
offset 
= distance 
(in "elements") 
between 
c 
the 2 input values 
for each butterfly 
c 
groups 
= number 
of sub-DFTs 
this 
stage 
is split 
into. 
c 
,(groups*offset*2 
= N) 
c 
wincr 
= distance 
between 
successive 
w values 
for successive 
butterflies 
c 
c Outputs: 
c A= 
complex 
butterflied 
version 
of input. 


SUBROUTINE 
difstep(a,w,groups,offset,wincr) 
integer 
groups,offset,wincr 
integer 
i,j,indexl,iplus 
complex 
a(groups*offset*2),w(groups*offset) 
,wtemp,temp 
c-------------------------------------------------------- 
c We implement 
a ••• 
c Special 
case 
for offset=l(last 
stage): 
no complex 
multiplies, 
simple 
add 
c 
(Performance 
enhancement) 
IF 
(offset 
.eq. 1) THEN 
CVD$ NODEPCHK 
DO 8 i = 1, (2*groups) ,2 
iplus = i + 1 
' 


temp = a(iplus) 
a(iplus) 
= a(i) - temp 
8 
a(i) = a(i) + temp 
ELSE 
C------------ 
C Special 
case 
for offset=2 
(next-to-last 
stage): 
no complex 
multiplies, 
cc 
Simple 
add. 
(Performance 
enhancement) 
cc For half 
the butterflies, 
W=(l,O). 
For the other half, 
W=(O,-l) 
IF 
(offset 
.eq. 2) THEN 
CVD$ 
NODEPCHK 
DO 90 i = 1, (4*groups) ,4 
iplus = i + 2 
temp = a(iplus) 
a(iplus) 
= a(i) 
- temp 
a(i) = a(i) + temp 
call to i-loop: 
w=cmplx(O,-l.) 
NODEPCHK 
NOVECTOR 
92 i = 2, (4*groups) .4 
iplus = i + 2 
temp = a(i) - a(1plus) 
a(i) = a(i) +-a(iplus) 
92 
a(iplus) 
= CMPLX(AIMAG(temp).-REAL(temp» 


90 
C 
2nd 
CVD$ 
CVD$ 
DO 


• 
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ELSE 
C------------ 
c "DO 20" indexl-loop 
is "outer loop" 
CVD$ 
VECTOR 
CVD$ 
NODEPCHK 
DO 20 indexl = l,(2*offset*groups),(2*offset) 
j = 1 
NODEPCHK 
ALTCODE 
DO 10 i = indexl,(indexl+o~fset-l) 
iplus = i + offset 
temp = a(i) - a(iplus) 
a(i) = a(i) + a(iplus) 
a(iplus) = w(j) * temp 
j = j + wincr 


CVD$ 
CVD$ 


10 
20' CONTINUE 
ENDIF 
ENDIF 
RETURN 
END 
cccccccccccccccccccccccccccccccccc 
subroutine 
fetch(a,n) 
integer n 
complex a(n) ,temp 
cc Kludge 
do-nothing 
prefetch. 
temp = a(l) 
RETURN 
END 


. 


cccccccccccccccccccccccccccccccccc 
subroutine 
bitrev(a,dummy,n) 
C Bit-Reverse 
C Inputs: 
C 
A= complex array of input, single-prec 
float 
C 
dummy = ~val(m). 
Probably unusable 
from Fortran. 


C 
N = number 
of input points 
(and output points) 


C Ouput: 
C 
A = original A data, but in bit-reversed 
order from A 


integer n,i,j,k,ndiv2 
complex a(n) ,temp 
C------------ 
C "DO 7" loop to in-place-bit-reverse-shuffle 
output 
j=l 
ndiv2 = n I 2 
DO 7 
i= 1, n-l 
IF (i .It. j) THEN 
temp = a(j) 
a(j) = a(i) 
a(i) = temp 
ENDIF 
k = ndiv2 
C 
"While 
(j .gt. k)" ,*decrease 
j by 2**something 
*' 
6 
IF (j .gt. k) THEN 
j = j-k 
k = k , 
2 
GOTO 6 
ENDIF 
C 
Add next lower power of 2 to j 
7 
j = j+k 
RETURN 
END 
C------------ 
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11-------------- 
I! bitrev.ss 
11 
(C) Copyright 
1989 INTEL 
Corporation. 
ALL RIGHTS 
RESERVED. 
I! 
11 
BIT-reversal 
of 8byte 
array 
elements. 
I! 
IN PLACE. 
11 
(Allows arrays 
of 8,16,32,64,128,,256,512, 
or 1024 elements) 
11-------------- 
11 
INTEL 
is not responsible 
for use nor misuse 
of this 
code. 
11-------------- 
11 8/13/89 
11=================================== 
11 
Invocation: 
(from Fortran) 
11 call bitrev(a,%VAL(m)) 


I!I!I!I!I!I!I! 
11 Expected 
best-can-do 
performance, 
and measured 
performance= 
11 
approx 
4*N clocks 
(0.06 mSec 
for 512 points) 
11------------------- 
define (astart, 
r16) 
Ilinitial 
input 
data base 
address 
define (m, r17) 
define (logN,r17) 
define (destl,r19) 
define (dest2,r20) 
define (dest3,r21) 
define (dest4,r22) 
define(iptr, 
r23) 
Ilindex-array 
pointer 


Inputs: 
a = r16 = pointer 
to array 
of 8byte 
elements 
m = r17 
(call by value)= 
base-2 
log of total 
number 
of elements 
(2**m = N) 
Outputs: 
a= Bit-reversed 
ordered 
version 
of A 


define(decrem,r24) 
Ilbla 
decrement 
define (count ,r25) 11 bla counter 


.text 
.align 
.quad 
11'============================= 
_bitrev_:: 
_bitr_:: 
Ilfetch base address 
for index 
table 
(rbasetab) 
11 base-addr-table 
elements 
= 
(baseaddr, 
number_of_swaps-2) 
11 base-addr-table 
indexed 
by logN. 
shl 
3,logN,r30 
Ilscale 
to 8-byte-entry 
length 
mov 
rbasetab,r29 
ld.l 
r29(r30), 
iptr 
addu 
4,r29,r29 
ld.l 
r29(r30), 
count 
Ilnumber 
of swaps 
required 
for this value 
N 


pfld.d 
pfld.d 
adds 
pfld.d 


O(iptr),fO 
Ilinitiate 
fetch 
of first 
2 bit-rev 
indices 
8(iptr)++,fO 
-2,rO,decreml12 
swaps 
per loop 
8(iptr)++,fO 


• 


bla 
decrem, count , revloop 
Ilinit 
LCC 
pfld.d 
8(iptr)++,f16 
Ilget 2 indices, 
but don't 
cache 
the indices 
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revloop:: 
112 swaps per loop 
117.5 cycles 
consumed 
for each 
swap, best 
case. 
pfld.d 
8(iptr)++,f18 
112 more 
indices 
fxfr f16,destl 
Iltransfer 
to integer 
index regs 
fxfr f17,dest2 
fld.d 
destl 
(astart),f24 
Ilfetch 2 elements 
to swap 
fld.d 
dest2 
(astart),f26 
fxfr f18,dest3 
fst.d 
f24, dest2 
(astart) 
fst.d 
f26, dastl 
(astart) 
fxfr f19,dest4 
fld.d 
dest3 
(astart) ,f28 
fld.d 
dest4 
(astart),f30 
pfld.d 
8(iptr)++,f16 
112 more 
indices 
fst.d 
f28, dest4 
(astart) 
bla decrem,count, 
revloop 
11 
fst.d 
f30, dest3 
(astart) 


bri 
rl 
nop 
11--------------- 
11 _fetch8_: 
Touch 
all 32-byte 
lines 
in the 8k data bytes, 
to get them 
11 
into dcache. 
(ASSUMING 
.lte. 8Kbytes 
and 
.gte. 4Kbytes) 
1/ 
11 
Invocation= 
fetch(astart,num8) 
11 
Inputs= 
11 
astart=r16=pointer 
to data which 
is to be touched. 
11 
num8=r17 
(passed by VALUE, 
%VAL(), 
not by reference) 
11------------ 
11 Using 
RC and RB to improve 
dcache 
hit rates, 
for FFTs bigger 
than 
11 
1024 complex 
(8kB). 
11 
RC=lO 
causes 
replacement 
only of block 
denoted 
by RB Isbit. 
RC=ll 
disables 
11 
replacement. 
11-------- 
define (num8,r17) 
define (FEtch, r26) 


_fetch8_: : 
_fetch_: : 
Id.c dirbase,r30 
or Ox800,r30,r30 
11 Replace 
Dcache 
slot 0 only 
(RC=lO,RB=OO) 
st.c r30,dirbase 
11 Put 4Kbytes 
into Dcache 
slot O. 
(The rest after 
4kB goes to slotl). 
adds 
-4,rO,decrem 
114 8-byte-groups 
per cache line 
adds 
508,rO,count 
11512, but pre-decremented 
for bla usage 
bla 
decrem,count,floop 
adds 
-32,astart,FEtch 
floop: : 
bla 
fld.d 
decrem,count,floop 
32(FEtch)++,f30 
Ildummy 
load. 


adds 
-512,num8,count 
bc 
fdone 
Ilif data exhausted, 
quit 
11 Id.c dirbase,r30 
or Ox900,r30,r30 
11 Replace 
Dcache 
slot 1 only 
(RC=lO,RB=Ol) 
st.c r30,dirbase 
• ' 
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11 


-8.count.count 
Ilpredecr for bla 
decrem.count.floop2 
lIset LCC 
32 (FEtch)++.f30 


adds 
bla 
fld.d 
floop2: : 
bla 
fld.d 
fdone: : 
11 unlock dcache 
andnot OxFOO.r30.r30 
st.c r30.dirbase 
bri 
rl 
nop 


decrem.count.floop2 
32(FEtch)++.f30 
Ildummy 
load. 


Ilclear RC.RB 
(dirbase(11:8» 


.data 
11--------------- 
11 rbasetab:: 
(Table of bit-reversed 
indices for bitrev subroutine) 
11 
base-addr-table 
elements = 
(baseaddr. number_of_swaps-2) 
11 
base-addr-table 
indexed by logN • 
•align 
.quad 
rbasetab: : 
.long [6]0 Ildon't bother with log(n)=0.1.2 
.long rev8. 
0 
.long rev16. 
4 
.long rev32. 
10 
.long rev64. 
26 
.long rev128. 
54 
.long rev256. 
118 
.long rev5l2. 
238 
.long revl024. 494 
1/'======= 


Ilnumber of swaps=240 for N=512 
(ie. 32 symmetrical patterns 
11 
exist between 0 and 511.) 
11rev512: array of bit-reversed 
indices. for N=512. 
11 
Each entry is ("i". and "bit-reversed-i"). 
shifted left by 3 
11 
to account for 8-byte-elements. 
II 
NOTE: 
This listing DOES NOT SHOW 
all the table elements. to save paper. 


•align 
.quad 
rev5l2: : 
.long 
8. 2048. 
.long 
24. 3072. 
.long 
40. 2560. 
11 ETC •••• ETC ••••• 
1/'===== 
.align .quad 
revl024: : 
.long 
8. 4096. 
.long 
24. 6144. 
.long 
40. 5120. 
.long 
56. 7168. 
11ETC •••• ETC ••••• 


16. 1024 
32. 
512 
48. 1536 
ETC ••• 


16. 2048 
32. 1024 
48. 3072 
64. 
512 
ETC ••• 
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//Number 
of swaps = 496 
//N 
(Number of elements) 
= 1024 
1/-===== 
.align 
.quad 
rev16: : 
.long 1*8.8*8.2*8.4*8 
.long 3*8.12*8.5*8.10*8 
.long 7*8.14*8.11*8.13*8 
rev8: : 
.long 1*8.4*8.3*8.6*8 
1/====== 
.align 
.quad 
rev32: : 
.long 
.long 
1/====== 
.align 
.quad 
rev64: : 
.long 
8. 
256. 


.long 
24. 
384. 
.long 
40. 
320. 
.long 
56. 
448. 
// ETC •••• ETC ••••• 
1/====== 
•align 
•quad 
rev128: : 
.long 
8. 
512. 
16. 
.long 
24. 
768. 
32. 
.long 
40. 
640. 
48. 
.long 
56. 
896. 
72. 
// ETC •••• ETC ••••• ETC ••• 
//Number 
of swaps = 56 
(Number of elements) 
=128 
1/====== 
.align 
.quad 
rev256: : 
.long 
8. 1024. 
16. 
.long 
24. 
1536. 
32. 
.long 
40. 
1280. 
48. 
.long 
56. 
1792. 
64. 
// ETC •••• ETC ••••• ETC ••• 
//Number 
of swaps = 120. N 
(Number of elements) 
= 


8. 128.16. 
64. 24. 
192. 40. 
160. 48. 
96. 56. 224 
72. 144. 88. 208. 
104. 
176. 120. 240. 
152. 200. 
184. 232 


16. 
128 
32. 
64 
48. 
192 
72. 
288 
ETC ••• 


256 
128 
384 
576 


512 
256 
768 
128 


256 
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r~~~~ 
\~=~U~~.M:~U) 
PARAMETER 
(N=5l2.M= 
9) 
c 
PARAMETER 
(N=256.M= 
8) 
c 
PARAMETER 
(N=128.M= 
7) 
c 
PARAMETER 
(N=64.M= 
6) 
c 
PARAMETER 
(N=32.M= 5) 
c 
PARAMETER 
(N=16. M=4) 
PARAMETER 
(PI=3.l4l5926536) 
COMPLEX 
X(N).Xl(N) .X2(N) .X3(N). W(N/2) 
c Fortran 
complex 
values 
stored 
R.I. R.I 
for arrays. 
Real 
ASQR(N).ASQR2(N).XR(N) 
complex 
wtemp 
real 
rtemp 
•• 


C 


PRINT 
*.' FFT test program 
(ffttest.f) 
print 
*. '===--========--=====--=====' 
IF 
(IREV .eq. 0) THEN 
print 
*.'NOT 
counting 
time 
for bit-reversal.' 
print 
*.'00 NOT expect 
matching 
answers.without 
bit-rev' 
ELSE 
print 
*. 'Time 
for bit-reversal 
included.' 
ENDIF 


print 
*. 'Time for cache writeback 
and 
IF 
(CACHETIME 
.eq. 0) THEN 
print 
*.' NOT 
included. 
if iterating.' 
ELSE 
print *.' 
included. ' 
ENDIF 


fills •••' 


print 
*· 
print 
*· 
'If iterating ••• Number 
of Iterations 
=' .IT 
print *· 
------ 
print 
*· 
'Number 
of Points 
= ,. N 
print 
*· 
'('.REALLY.' 
data) , 
print 
* ,· 
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C------------------ 
C Init twiddle 
factor array w(k) with 
(cos,-sin) 
of 2pi*k/N 
C 
(Should just declare 
this as constant, 
if N is non-variable) 
C (OR could have 
one constant 
512-entry 
W 
(for N=1024), 
adjust 
wincr 
accordingly 
C 
in diff.f 
for smaller 
N) 
rtemp = 2.0*pi/N 
wtemp= 
CMPLX(cos(rtemp), 
-sin(rtemp)) 
w(l) = 
(1.0, 0.0) 
DO 
200 
k 
= 
2,N/2 
200 
w(k) = wtemp 
* w(k-l) 
cc print *,' 
W 
(twiddle) 
initialization 
completed •••••• ' 
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC 
C 
INITIALIZE 
input data 


C 
PIN = 
(4*PI)/ N 
DO 100 I = 1, N 
c 
For testing 
with 
sinewave 
input data: 
c 
Treal = COSt I*PIN) 
c 
Timag = SIN ( I*PIN) 


c 
For testing 
with 
squarewave 
input: 
cc IF 
(I .It. N/2) THEN 


cc 
Treal = 1.0 
cc 
Timag = 0.5 
cc ELSE 
cc 
Treal = 0.0 
cc 
Timag = 0.0 
cc ENDIF 
C 
For testing 
with 
ramp function 
input 
data: 
Treal = I - 1.0 
Timag = Treal + 0.5 
X(I) = CMPLX 
(Treal, Timag) 
Xl(I) 
= CMPLX 
(Treal, Timag) 
X2(I) 
= CMPLX 
(Treal, Timag) 
X3(I) 
= CMPLX 
(Treal, Timag) 
100 
CONTINUE 
C 
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC 
IF 
(TIMEIT 
.ne. 0) THEN 


CALL fft 
(X2, M, N) 
cc Subroutine 
fft is Decimation-In-Time, 
Fortran 
version. 


c 
CALL ditt(X, 
M, N,W,IREV) 
CALL diff(X, 
M, N,W,IREV) 
ENDIF 


ccccccccccccccccccccccccccccccccccccccc 
IF 
(IREV .ne. 0) THEN 
IF 
(TIMEIT 
.eq. 0) THEN 
call vcompare(X,X2,2*N) 
call cmags(X,N,ASQR) 
c cmags 
to take 
squared 
magnitude 
of complex 
values 
call cmags(X2,N,ASQR2) 
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c----------------------c 
C print 
non-zero 
results: 
J=O 
DO 700 
I = 1,N 
IF 
«ASQR(I) 
.GT. 1.0) 
.OR. 
(ASQR2(I) 
.GT. 1.0» 
THEN 
WRITE 
(6,22) 
(1-1), ASQR(I), 
ASQR2(I) 
22 
FORMAT 
(' 1-1=',14,' 
ASQR(I)= 
',F14.2, 
, ASQR2(I)= 
',F14.2//) 
J = J+l 
IF 
(J 
.GT. 32) 
GOTO 
725 
ENDIF 
700 
CONTINUE 


725 
CALL 
TIME 
ENDIF 
ENDIF 


IF 
(TlMEIT 
.ne. 0) THEN 
ccccccccccccccccccccccccccccccccccccccc 
cc- 
Timing 
loop 
follows: 


print *,' 
Start 
Ass.FFT' 
IF 
(CACHETlME 
.eq. 0) THEN 
DO 500 
I = 1, IT,4 
C Reuse 
same 
array, 
so cache 
fill 
and 
CALL 
diff(X, 
M, N,W,IREV) 
CALL 
diff(X, 
M, N,W,IREV) 
CALL 
diff(X, 
M, N,W,IREV) 
CALL 
diff(X, 
M, N,W,IREV) 


writeback 
time 
NOT 
included. 


500 
ELSE 
DO 504 
I = 1, IT,4 
C Alternating 
between 
X,Xl,X2,X3 
should 
CALL 
diff(X, 
M, N,W,IREV) 
CALL 
diff(Xl, 
M, N,W,IREV) 
CALL 
diff(X2, 
M, N,W,IREV) 
CALL 
diff(X3, 
M, N,W,IREV) 


provide 
cache 
misses. 


504 
ENDIF 
print 
*,' 
END Ass. 
FFT' 
ccccccccccccccccccccccccccccccccccccccc 
ENDIF 
STOP 
END 
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c----------------------c 
subroutine 
vcompare(res,exp,n) 
c VCOMPARE 
compares 2 REAL vectors, 
prints 
out 1st fey miscompares 
c 
integer n, errcnt 
real res(n), 
exp(n) 


Yrite(6,12) 
12 
format('*** 
VCOMPARE: 
vector 
comparison 
beginning 
***') 


data errcnt/OI 
. 


do 30 i = l,n 
if(AINT(res(i» 
.ne. AINT(exp(i») 
then 
c 
(print out error, exit if alot already} 
120 
print *,'*** Error 
in compares 
***' 
write(6,121) 
i 
121 
format(' 
Item number = 
',16) 
Yrite(6,124) 
res(i), 
exp(i) 
124 
format(' 
Res_=',F14.2,' 
Expected_=',F14.2) 
errcnt = errcnt + 1 
if (errcnt .gt. 19) then 
return 
end if 
end if 
30 
continue 


if (errcnt .eq. 0) then 
190 
print 
*,' *** vector 
compares 
SUCCESSFUL 
***' 
end if 


99 return 
end 
c----------------------c 
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C--------------- 
C File: 
dit t.f 
C 
6/15/89 


C Intel 
assumes 
no responsibility 
for use 
or misuse 
of this 
code. 


FFT 
- Decimation 
in TIME. 
radix-2. 
inplace. 
l-dimen' 


Inputs: 
A= 
complex 
array 
of input. 
up to 1024 
pts. 
single-prec 
float 
M= log 
of number 
of pts 
= 
(Number 
of stages 
of FFT) 
N = number 
of points. 
ie. N= 2**M 
= number 
of pts 
W= 
complex 
array 
of twiddle 
factors. 
length=N/2. 
REV= 
ignored 
parameter. 


C 
C 
C 
C 
C 
C 
C 
C 
C 
C Outputs: 
C 
A= 
complex 
fft of input 
A. 
Correct 
order 
(bit-reversal 
done). 
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC 


subroutine 
ditt(a.m.N.W.REV) 
integer 
m.N. 
i. REV.wlimit 
integer 
offset. 
stage. 
groups. 
wincr.powers2(0:10) 
complex 
a(n).w(N/2).temp 


data 
powers2 
/1.2.4.8.16.32.64.128.256.512.1024/ 
C Powers2 
to avoid 
calls 
to POW. 
DIV 


C Twiddle 
factor 
array 
w(i) 
has 
(cos.-sin) 
of 2pi*i/N 
CC Assume 
the 
caller 
provides 
w(i) 
constants 
ALREADY 
initialized 
C------------ 
C Pre-touch 
data. 
lock 
into 
cache. 
for 8kByte 
fft: 
IF 
(N .gt. 513) 
THEN 
call 
fetch(a.%VAL(n)) 
, 
ENDIF 
C------------ 
call 
bitrev(a.%VAL(M) 
.n) 
C Bitreversal 
of input 
needed 
for in-place 
decim 
C 
fetching 
twiddle-factors 
in bitrev 
order. 
wlimit 
= 8*((N/2) 
- 1) 


in time 
FFT. 
to avoid 


\,.,... • 


DO 20 stage 
= l.m 
groups 
= powers2(m-stage) 
C groups=number 
of times 
the 
twiddle 
factors 
are used. 
ie. the number 
of 
C 
smaller 
DFTs 
the 
stage 
is split 
into. 


C 
offset 
gets 
1.2.4.8 ••••N/2 
offset 
= powers2(stage-l) 
wincr 
= groups 
call 
ditstep(a.w.groups.offset.wincr.wlimit) 
20 CONTINUE 


RETURN 
END 
C------------ 
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11-------------- 
II ditstep.ss: 
do one stage 
of fft butterflies 
II DIT = Decimation 
in Time, 
radix-2, 
inplace, 
l-dimension 
II 
(C) Copyright 
1989 
INTEL 
Corporation. 
ALL RIGHTS 
RESERVED. 
II 7/15/89 
. 


11------------ 
II Intel 
is not 
responsible 
for use nor 
for misuse 
of this 
program. 
11------------ 
II Do one entire 
stage 
(n/2 butter~lies). 
Sample 
invocation: 
II 
call 
ditstep(a,w,groups,offset,wincr,wlimit) 
/!--================ 
II Inputs: 
II 
A= 
complex 
array 
of input, 
single-prec 
float 
II 
(complex 
stored 
as 4byte 
real, 
4byte 
imag 
contiguously) 
II 
W= pointer 
to array 
of twiddle 
factors. 
Assuming 
W(k) 
is 
II 
CMPLX(cos(2pi*k/N)),-sin(2pi*k/N)) 
for k=O to 
(N/2)-1. 
II 
offset 
= distance 
(except 
for scale-by-8byte 
sizeof(complex)) 
between 
lIthe 
2 input 
values 
for each 
butterfly. 
II 
Offset 
also 
is the number 
of butterflies 
done 
per 
"group". 
II 
groups 
= N/(2*offset). 
The number 
of sub-DFTs 
this 
stage 
is split 
into. 
II 
wincr 
= distance 
(except 
for scale-by-8byte 
sizeof(complex)) 
between 
II 
successive 
w values 
for successive 
butterflies 
II 
wlimit 
=max 
index, 
in bytes, 
of W table. 
/!/! Outputs: 
II 
A= 
complex 
radix-2 
butterflied 
version 
of input. 
/! 
11------------------- 
define (astart , r16) 
II input 
data 
base 
address 
define (wstart,r17) 
Iltwiddle 
array 
ptr. 
Because 
w-contents 
depend 
on N, 
II we will 
assume 
the 
caller 
has 
initialized 
w() 
array. 
define (groups,r18) 
Ilgroups=number 
of sub-DFTs 
this 
stage 
is split 
into. 
define(offset,r19) 
Iloffset 
(initially 
elements, 
mult 
by 8 to get bytes) 
II between 
node 
and 
its dual 
(the 2 numbers 
to butterfly, 
ie. A and B) 
define (wincr,r20) 
Ilincrement 
between 
successive 
W values. 
Remains 
constant 
II within 
a given 
stage. 
define (wlimit ,r21) Ilmax index, 
in bytes, 
of W table. 
define (wind,r22) 
Ilcurrent 
index, 
in bytes, 
of W table. 
define(offset2,r23) 
Iloffset*2 


define (decrem,r24) 
lIbla decrement 
define (somecount,r25) 
II bla 
counter 


define (FEtch, 
r26) 
define (STore,r27) 
Ilpointer 
to 1st component 
of butterfly 
(load) 
II " " 1st component 
of butterfly 
(store) 


define (offsetp8,r28) 
Iloffset+8 
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11 f4:f7 
spare 
define (ARe,fl2) 
define (Ale,fl3) 
define (ARo,fl4) 
define (Alo,fl5) 
define (BRe,fl6) 
define(Ble,fl7) 
define(BRo,fl8) 
define(Blo,fl9) 


Ilelement 
A, real 
component 
I1 " ", imag 
II extra A value, 
for prefetch 
(o="odd") 


Ilelement 
B, real 
component 


II extra 
B value, 
for prefetch 


define (ERe,f20) 
define(Ele,f21) 
define (ERo,f22) 
define (Elo,(23) 


IIA+(B*W), 
'real (ER = AR + BR) 
II" 
imag" 
II previous 
loop's 
value 
11" 
imag" 


define (FRe ,(24) IIA-(B*W), 
real 
define(Fle,f25) 
11 " 
imag 
" 
define (FRo,f26) 1I previous 
loop's 
value 
define(Flo,f27) 
11" 
imag" 


define (PR, (28) 
II(B*W), 
real 
define(PI, 
(29) 
II(B*W), 
imag 


define (WRe,f30) 
define (Wle,(31) 
IIW 
(twiddle 
factor), 
real part 
II " " , imag 


define (WRo,flO) 
define (Wlo,fll) 
IIW 
(twiddle 
factor), 
real part 
(EXTRA copy) 
II 
" " , imag 


.text 
.align 
.quad 
_ditstep_ :: 
Id.l 
O(groups) ,groups 
Id.l 
O(offset) ,offset 
shl 
3,offset,offset 
shl 
1,offset,offset2 
adds 
8,offset,offsetp8 


Ilfix 
Fortran 
call-by-ref 
11 
II change 
from elements 
to bytes 


fst.q 
fst.q 
f8 ,-16(sp)++ 
Ilsave 
"local" 
regs 
f12,-16(sp)++ 
II " " 


adds 
adds 
-l,groups,groups 
II pre-decrement 
for bnc usage, 
or bla usage 
-16,rO,decrem 
Ilbla 
decrement 


II 
We code the last 2 stages 
as special 
cases: 
11-------- 
8,offset,rO 
Iloffset=l, 
~pecial case, no complex mult, 
funny addressing 
offset_lll 
(ASSUMING 
offset=l 
means 
wincr=O, 
and no twiddle 
used) 
l6,offset,rO 
Iloffset=2, 
special 
case, 
no complex 
mult 
offset_2 


xor 
be 
xor 
be 
11-------- 
Id.l 
O(wincr),wincr 
Id.l 
O(wlimit) ,wlimit 
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pfadd.ss 
fO,fO,fO 
pfadd.ss 
fO,fO,fO 
pfadd.ss 
fO,fO,fO 
II init Al,A2,A3=O 
pfmul.ss 
fO,fO,fO 
pfmul.ss 
fO,fO,fO 
pfmul.ss 
fO,fO,fO 
11-------- 
II init pointers: 
shl 
3,wincr,wincr 
Ilscale 
for bytes. 
shl 
l,wincr,wind 
Ilinit wind =2*wincr 


pfld.d 
0 
( wstart) ,fO 
pfld.d 
wincr 
( wstart) ,fO 
adds 
-8,astart,FEtch 
pfld.d 
wind 
(wstart),fO 
adds 
wincr,wind,wind 
Ilwind 
now 3*wincr 
II here 
fetch 
first 
set of B,W before 
bla-loop 
pfld.d 
wind 
(wstart),WRe 
adds 
wincr,wind,wind 
Ilfirst 
Bfetch 
from offset, 
then 
1st afetch 
from O. 
fld.d 
offsetp8 
(FEtch),BRe 
Ilfirst 
B value 


and 
wlimit,wind,wind 
Ilmodulo-wlimit 
the w index 
II We do modulo-addressing 
on W(), 
to keep the pfld pipeline 
full. We 
II 
never 
do a W-fetch 
beyond 
the end of the table. 
II And the modulo-check 
needs 
to be done only every 4th pfld, 
as always 
II 
we use a multiple 
of 4 W() 
factors. 


d.r2apl.ss 
fO,fO,fO 
Ilclear 
Treg. 
adds -32,offset,somecount 
II bla counter 
(predecrement 
by 4 elements) 
II ----------- 
II Definitions 
for pipe diagram: 
II 
Anew = E = A+(B*W) 
II 
Bnew = F = A-(B*W) 
II 
Let P=(B*W). 
11-------------- 
II 
(the complex 
multiply 
product, 
P, broken 
into 4 real mult and 2 adds) : 


II 
WR = cost), WI=-sin(). 
II 
PR = K - L; where 
K= WR*BR, 
L=WI*BI 
II 
PI = N + M; where 
N= WI*BR, 
M=WR*BI 
II 
ER = AR + PR 
(Overwrites 
AR) 
/I 
El = AI + PI 
(. 
AI) 
/I 
FR = AR - PR 
( 
BR) 
/I 
FI = AI - PI 
( 
BI) 


II For 1st time thru 
inner--loop, 
don't 
II Must 
do 1 loop before 
the loop, 
sans 
11----------------- 
first_bfly:: 
Ilfill 
pipe 


have 
correct 
values 
to store. 
the stores. 
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T 
Al ••••A2 ••••A3 ••••Write 
// KR •••KI •••Ml ••••M2 ••••M3 
d.r2pt.ss 
WRe,fO,fO 
// WRe 
pfld.d 
wind 
(wstart) ,WRo 
d.i2st.ss 
Wle,fO,fO 
// 
adds 
wincr,wind,wind 
d.r2apl.ss 
fO 
,BRe,fO // 
fld.d 
8 
(FEtch)++,ARe 
//first 
d.pfmul.ss 
Wle,Ble,fO 
// 
pfld.d 
wind 
(wstart) ,WRe 
d.r2pt.ss 
WRo,Ble,fO 
// WRo 
fld.d 
offsetp8 
(FEtch),BRo 
d.ratls2.ss 
fO 
,PR ,fOIl 
adds 
wincr,wind,wind 
d.i2st.ss 
Wlo,BRe,fO 
// 
nop 
If •••••••••••••.•••••••••••••••••••••••••••••••••••••••••••••••••• 
d.r2apl.ss 
fO 
,BRo,fO // 
Kl 
NO 
MO 
PRO 
and 
wlimit,wind,wind 
d.pfsub.ss 
fO 
,PI ,fO // 
fld.d 
8 
(FEtch)++,ARo 
d.pfadd.ss 
ARe,PR 
,PR // 
fld.d 
offsetp8 
(FEtch) ,BRe 
d.pfmul.ss 
Wlo,Blo,fO 
// 
nop 
d.r2pt.ss 
WRe,Blo,fO 
// WRe 
loll 
Ll 
Kl 
bla 
decrem,somecount,restart 
//init LCC 
d.ratls2.ss 
ARe,PR 
,fOIl 
loll 
Ll 
nop 
restart: : 
d.i2st.ss 
Wle,BRo,ERe// 
adds 
-16,astart,STore 
//------------------- 
// Each 
butterfly 
= 1 complx multiply, 
1 complx 
add, 1 complx 
subtract 
// = 
4 multiply, 
3 add, 3 subtract 
// 
3 8-byte 
fetches 
(A, B, W) 
// 
2 8-byte 
stores 
(A, B) 
11 
// 7 cycles per butterfly 
11 
// inner_loop: 
iterates 
"offset/2" 
times 
// 
for each group. 
It does 2 butterflies 
per iteration 


Wle 


KO 
A value 
LO 
KO 


PRO 


PRO 


ERO 


ERO 


inner_loop:: 
// KR •••KI •••Ml ••••M2 ••••M3 
T 
Al ••••A2 ••••A 3••••Write 
d.r2apl.ss 
Ale,BRe,PI 
// 
K2 
Nl 
loll EIO 
PRl 
FRO 
PlO 
pfld.d 
wind 
(wstart) ,WRo 
d.pfsub.ss 
Ale,PI 
,FRell 
K2 
Nl 
loll FIO 
EIO 
PRl 
FRO 
fld.d 
8 (FEtch)++,ARe 
d.pfadd.ss 
ARo,PR 
,PR // 
K2 
Nl 
loll ERl 
FIO 
EIO 
PRl 
fld.d 
offsetp8 
(FEtch),BRo 
d.pfmul.ss 
Wle,Ble,fO 
// 
L2 
K2 
Nl 
loll ERl 
FIO 
EIO 
adds 
wincr,wind,wind 


MO 
LO 
KO 


MO 
LO 
KO 


Wlo 
NO 
MO 
KO 
K-LO 


NO 
MO 
Kl 


Kl 
MO 
ERO 
NO 


Ll 
Kl 
NO 
MO 
ERO 


MO 
M+NO 
ERO 


FRO 
PlO 
Kl 


Wle 
Nl 
loll 
Kl 
K-Ll 
FRO 
PlO 
// ptrs 
init 16 low, for fst.q 
instructions 


// AR/AI 
fetches 
need to be a cycle behind 
BR/BI 
fetches 
here. 
So we 
// 
must 
index with 
offset+8 
into B. 
// AR is used 
1/2 loop before 
AI. 
/1 Pattern= 
AIO,AR1,BR2,BI2;AI1,AR2,BR3,BI3. 
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112 
L2 
K2 
intel~ 


II+Nl ERl 
FIO 
EIO 


K2 
FRl 
Pll 
ERl 
FIO 


K2 
K-L2 
FRl 
Pll 
ERl 


T 
Al ••••A2 ••••A3 ••••Write 
112 Ell 
PR2 
FRl 
Pll 


112 Fll 
Ell 
PR2 
FRl 


112 ER2 
Fll 
Ell 
PR2 


M2 
ER2 
Fll 
Ell 


M+N2 
ER2 
Fll 
Ell 


.r 


K3 
FR2 
PI2 
ER2 
Fll 


K3 
K-L3 
FR2 
PI2 
ER2 


d.r2pt.ss 
WRo,Ble,Ele 
11 WRo 
pfld.d wind 
(wstart) ,WRe 
d.ratls2.ss 
ARo,PR 
,Flell 
adds 
winer,wind,wind 
d.i2st.ss 
Wlo,BRe,ERol1 
and 
wlimit,wind,wind 
11 
Alo,BRo,PI 
11 


M2 


Wlo 
N2 
Ilmodulo. 
KR •••KI •••Ml ••••M2 ••••M3 
K3 
N2 
d.r2apl.ss 
nop 
d.pfsub.ss 
Alo,PI 
,FRol1 
fld.d 
8 
(FEteh)++,ARo 
d.pfadd.ss 
ARe,PR 
,PR 11 


K3 
N2 


K3 
N2 


fld.d offsetp8 
(FEteh),BRe 
d.pfmul.ss 
Wlo,Blo,fO 
11 
nop 
d.r2pt.ss 
WRe,Blo,Elo 
11 WRe 
fst.q ERe,l6(STore)++ 
Ilupdate 
d.ratls2.ss 
ARe,PR 
,Flol1 
bla deerem,someeount, 
inner_loop 
d.i2st.ss 
Wle,BRo,ERel1 
Wle 
fst.q 
FRe, offset 
(STore) 
Ilupdate 
FRe/Fle/FRo/Flo 


L3 
K3 


M3 
L3 
K3 
ERe/Ele/ERo/Elo 
M3 
L3 


N3 


L2 


M2 


N2 


M3 


end_inner_loop:: 
IIKEEP Pipelines 
full 
11 RE-init 
pointers 
for fetches 
d.fiadd.ss 
fO,fO,fO 
adds 
offset2,astart,astart 
Ilbump to next group 
Ilredo A,B 
fetches, 
with proper 
ptr. 
d.fiadd.ss 
fO,fO,fO 
fld.d 
offset 
(astart),BRe 
Ilget first BR/BI 
in next group 
d.fiadd.ss 
fO,fO,fO 
adds 
-8 ,astart ,FEtch 


last_bfly:: 
lIdo final 2 
11 
d.r2apl.ss 
Ale,BRe,PI 
11 
pfld.d 
wind 
(wstart),WRo 
d.pfsub.ss 
Ale,PI 
,FRel1 
fld.d 
8(FEteh)++,ARe 
d.pfadd.ss 
ARo,PR 
,PR 11 
fld.d 
offsetp8 
(FEteh),BRo 
d.pfmul.ss 
Wle,Ble,fO 
11 
adds 
winer,wind,wind 
d.r2pt.ss 
WRo,Ble,Ele 
11 WRo 
pfld.d 
wind 
(wstart),WRe 
d.ratls2.ss 
ARo,PR 
,Flell 
adds 
winer,wind,wind 
d.i2st.ss 
Wlo,BRe,ERol1 
Wlo 
NO 
and 
wlimit,wind,wind 
Ilmodulo 
11· •••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••• 
d.r2apl.ss 
Alo,BRo,PI· 11 
Kl 
NO 
110 EI3 
PRO 
adds -32,offset,someeount 
11 reset bla counter 
d.pfsub.ss 
Alo,PI 
,FRo11 
Kl 
NO 
fld.d 
8 
(FEteh)++,ARo 


b~tterflies, 
start next group 
KR •••KI •••Ml ••••1I2••••M3 
T 
Al ••••A2 ••••A3 ••••Write 
KO 
N3 
113 EI2 
PR3 
FR2 
PI2 


KO 
N3 


KO 
N3 


LO 
KO 


MO 
LO 


MO 
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113 FI2 
EI2 
PR3 
FR2 


M3 
ER3 
FI2 
PR3 
EI2 


N3 
M3 
ER3 
FI2 
EI2 


KO 
M+N3 
ER3 
FI2 
EI2 


LO 
ER3 
FI2 
KO 
FR3 
PI3 


MO 
FR3 
PI3 
ER3 
KO 
K-LO 


FR3 
PI3 


110 
EI3 
FR3 
FI3 
PRO 


d.pfadd.ss 
ARe.PR 
.PR 11 
Kl 
NO 
MO 
ERO 
FI3 
EI3 
PRO 
fld.d 
offsetp8 
(FEtch) .BRe 
d.pfmul.ss 
Wlo.Blo.fO 
11 
Ll 
Kl 
NO 
MO 
ERO 
FI3 
EI3 
bla 
decrem.somecount.nowhere 
Ilre-1n1t LCC=l 
d.r2pt.ss 
WRe.Blo.Elo 
11WRe 
Ml 
Ll 
Kl 
M+NO 
ERO 
FI3 
EI3 
adds 
-l.groups.groups 
nowhere: : 
d.ratls2.ss 
ARe.PR 
.Floll 
Ml 
Ll 
Kl 
FRO 
PlO 
ERO 
FI3 
fst.q 
,ERe.16(STore)++ 
d.fnop 
bnc.t 
restart 
Ilbranch 
on value 
of groups 
d.fnop 
fst.q 
FRe. 
offset 
(STore) 


end_last_bfly: 
: 
d.fnop 
br end1t 
fiadd.ss 
fO.fO.fO 
fst.q 
FRe. 
offset 
(STore) 
Ilrepeated 
for bnc.t 
untaken 
case 
.align 
•quad 
1/= 
offset_l: : 
11want-FEtch=0.2.4.6.8 
•••• elements. 
ASSUMING 
wincr=O. 
11 
and that w=(1.0). 
so that 
no complex 
mult 
needed. 
11 E=A+B. 
F=A-B. 
(Per double-butterfly 
loop: 
8 pfadd.4 
dword 
fld. 4 fst. 
11 
1 bla) (fld.q used 
to reduce # flds) 
11 Performance 
= 4 cyc/bfly 
best 
case. 


IIRedefine 
regs 
for fld.q.fst.q 
usage. 
when 
A and B adjacent: 
define (AR3.f12) 
Ilelement 
A. real 
component 
def1ne(AI3.f13) 
11 • '. 1mag 


def1ne(BR3.f14) 
Ilelement 
B. real 
component 
def1ne(BI3.fl5) 
def1ne (AR4. f16) 11extra 
A value. 
for prefetch 
def1ne(AI4.f17) 
define (BR4.f18) 
def1ne (BI4. fl9) 


def1ne(ER3. 
f20) IIA+B. real 
(ER = AR + BR) 
def1ne(EI3. 
f21) 11' 
1mag' 
def1ne(FR3. 
f22) II(A-B). real 
def1ne(FI3. 
f23) 11• 
imag 


def1ne(ER4.f24) 
IIA+B. real 
def1ne(EI4.f25) 
11' 
imag 
def1ne(FR4.f26) 
II(A-B). real 
def1ne(FI4.f27) 
11' 
imag 
1/'=-=============== 
adds 
-16.astart.FEtch 
fld.q 
16 
(FEtch)++.AR4 
adds 
-l.groups.somecount 
11 bla 
counter 
(predecremented 
already 
IIus1ng groups=blacount 
on the 
offset_l 
loop. 
1ntent10nally. 
adds 
-16.FEtch.STore 
Ilstartup 
the 
loop: 


by 1) 
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II ---------------------11 
Al ••••••A2 ••••••A3 ••••••Write: 
d.pfadd.ss 
AR4.BR4.fO 
II ARn+BRn 
- 
fld.q 
16 
(FEtch)++.AR3 
d.pfadd.ss 
AI4.BI4.fO 
II Aln+Bln 
ERn 
adds 
-2.rO.decrem 
112 bflies 
per loop 
d.pfsub.ss 
AR4.BR4.fO 
II ARn-BRn 
Eln 
ERn 
bla decrem.somecount. 
offsetl_loop 
Ilinit LCC' 


d.pfsub.ss 
AI4.BI4.ER4 
II Aln-Bln 
FRn 
Eln 
nop 
II ---------------------11 
Al ••••••A2 ••••••A3 •••••• Write: 
offsetLloop: 
: 
d.pfadd.ss 
AR3.BR3.EI4 
II AR+BR 
FI- 
nop 
d.pfadd.ss 
AI3.BI3.FR4 
/I AI+BI 
l:R 
fld.q 
16 
(FEtch)++.AR4 
d.pfsub.ss 
AR3.BR3.FI4 
lIAR-BR 
El 
fst.q 
ER4.16(STore)++ 
d.pfsub.ss 
AI3.BI3.ER3 
II AI-BI 
nop 
d.pfadd.ss 
AR4.BR4.EI3 
II AR2+BR2 
FI 
fld.q 
16 
(FEtch)++.AR3 
d.pfadd.ss 
AI4.BI4.FR3 
II AI2+BI2 
ER2 
nop 
d.pfsub.'ss AR4.BR4.FI3 
bla decrem.somecount. 
d.pfsub.ss 
AI4.BI4.ER4 
fst.q 
ER3.16(STore)++ 
11--------- 
end_offsetl_loQP:: 
d.fiadd.ss 
fO.fO.fO 
br endit 
fladd.ss 
fO.fO.fO 
nop 
11--------- 
.align 
.quad 
offset_2: : 
II want 
FEtch=O.1;4.5;8.9;12.13; 
••• elements. 
II ASSUMING 
wincr=N/4, 
(W_addr=O.N/4.0.N/4.0 
••••). Trivial 
W() 
factors. 
II 
Even-indexed 
elements 
identical 
to offset_l.W=WO. 
no complex 
multo 
II 
SO 
EReven=(AR+BR). 
Eleven=(AI+BI). 
II 
SO 
FReven=(AR-BR). 
Fleven=(AI-BI). 


ERnext 


, 
FR- 
EI- 


FI- 
FR- 


ER 
FI- 


FR 
El 
ER 


FR 
El 


FI 
FR 


II AR2-BR2 
EI2 
offsetl_loop 
II AI2-BI2 
FR2 


ER2 
FI 


EI2 
ERnext 


II 
Odd components 
have W=(O.-l). 
So B*W = 
(BI.-BR). 
II 
SO ERodd=Re(A+(B*W)) 
= 
(AR+BI) 
Elodd=(AI-BR). 
III 
SO FRodd=Re(A-(B*W)) 
= 
(AR-BI) 
Flodd=(AI+BR). 
II 
Each 
fld.q 
fetches 
AReven.Aleven.ARodd.Alodd. 


IIAssume ERe.Ele.ERo.Elo 
are 4 contiguous 
regs. 
IIAssume FRe.Fle.FRo.Flo 
are 4 contiguous 
regs. 
IIAssume ARe.Ale.ARo.Alo 
are 4 contiguous 
regs. 
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adds 
-16 ,astart ,FEtch 
fld.q 
16 
(FEtch)++,ARe 
fld.q 
16 
(FEtch)++;BRe 
adds 
O,groups,somecount 
Ilbla 
counter 
Ilstartup 
the loop: 
//------ 
pfadd.ss 
pfadd.ss 
d.pfadd.ss 
nop 
d.pfsub.ss 
Alo,BRo,ERe 
11 Alo-BRo 
ERo 
nop 
d.pfsub.ss 
ARe,BRe,Ele 
11 AR-BRe 
ads 
-l,rO,decrem 
112 bflies 
d.pfsub.ss 
Ale,Ble,ERo 
11 AI-Ble 
adds 
-16,astart,STore 
d.pfsub.ss 
ARo,Blo,Elo 
11 ARo-Blo 
FI 
FR 
bla decrem,somecount, 
offset2_100p 
Ilinit LCC 
d.pfadd.ss 
Alo,BRo,FRe 
11 Alo+BRo 
FRo 
FI 
nop 
offset2_100p:: 
d.fnop 
fld.q 


ARe,BRe,fO 
Ale,Ble,fO 
ARo,Blo,fO 


-11 Al ••••••A2 ••••••A3 ••••••Write: 
11 AR+BRe 
11 AI+Ble 
ER 
11 ARo+Blo 
El 


Elo 
ERo 
El 
per loop, but groups 
is half 
desired 
value. 
FR 
Elo 
ERo 


d.fnop 
fld.q 
16 
(FEtch)++,BRe 
11 ------- 
11 Al ••••••A2 ••••••A3 ••••••Write: 
d.pfadd.ss 
ARe,BRe,Fle 
11 AR+BRe 
Flo 
FRo 
FI 
nop 
d.pfadd.ss 
Ale,Ble,FRo 
11 AI+Ble 
ER 
nop 
d.pfadd.ss 
fst.q 
d.pfsub.ss 
nop 
d.pfsub.ss 
ARe,BRe,Ele 
11 AR-BRe 
Elo 
nop 
d.pfsub.ss 
Ale,Ble,ERo 
11 AI-Ble 
FR 
fst.q 
FRe,16(STore)++ 
d.pfsub.ss 
ARo,Blo,Elo 
11 ARo-Blo 
FI 
bla decrem,somecount,offset2_100p 
d.pfadd.ss 
Alo,BRo,FRe 
11 Alo+BRo 
FRo 
nop 
endit:: 
11 restore 
regs 
fiadd.ss 
fO,fO,fO 
Ilexit DIM 
fld.q 
O(sp),f12 
fiadd.ss 
fO,fO,fO 
Illast 
DIM pair 
fld.q 
16(sp),f8 
adds 
32,sp,sp 
bri rl 
nop 
11'============================== 


ER 


El 


16 
(FEtch)++,ARellfetch 
AR,AI,ARo,Alo 


Flo 


ARo,Blo,Flo 
11 ARo+Blo 
El 
ER 
Flo 
ERe,16(STore)++ 
Ilupdate 
ER 
,El ,ERo,Elo 
Alo,BRo,ERe 
11 Alo-BRo 
ERo 
El 
ER 


ERo 


Elo 


FR 


FI 


ER 


Elo 


FR 


FRo 


El 


ERo 


Elo 


FR 
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C--------------- 
C File: 
dirr.f 
C 
FFT 
- Decimation 
in Freq, 
radix-2, 
inplace, 
I-dimen, 
C 
REAL 
input 
C 
Intel 
is not responsible 
for use nor misuse 
of this 
code. 


C 
8/14/89 


C Inputs: 
C 
A= REAL 
array 
of input, up to 1024 pts, 
single-prec 
float 
C 
M= log of number 
of pts 
C 
= 
(Number of stages 
of FFT) 
C 
N = number 
of points. 
ie, N= 2**M = number 
of pts 
C 
W= complex 
array 
of twiddle 
factors, 
length 
N/2. 
C 
REV= 
0 if bitreversed 
output 
ok. l=must 
re-order 
output 
C 
(REV will 
be ignored, 
and output will 
be properly 
ordered. 
C 
reversal 
WILL be done.) 


C 
C 
C 
C 
C 


Bit 


Outputs: 
A= complex 
fft of input A, but only the positive 
frequency 
half. 
Length 
= N/2+1 
complex 
numbers. 
A(0:n/2) 


subroutine 
dirr(a,m,N,W,REV) 
integer 
m,N, 
i, j,k, REV,wlimit 
integer 
offset, 
stage, 
groups, 
wincr,powers2(0:10) 
real a(N) 
complex 
w(N/2) ,temp 


data powers2 
/1,2,4,8,16,32,64,128,256,512,1024/ 
C Powers2 
to avoid 
calls to POW, DIV 


C Twiddle 
factor 
array w(k) has 
(cos,-sin) 
of 2pi*k/N 
CC Assume 
the caller provides 
w(k) 
constants 
ALREADY 
initialized 
C------------ 
C Pre-touch 
data, 
for 8kByte 
fft: 
(2048 points 
real) 
IF 
(N .gt. 1025) 
THEN 
call fetch(a,%VAL(n/2)) 
ENDIF 
C------------ 
wlimit 
= 8*((N/2) 
- 1) 


C "DO 20" stage-loop: 
doing 
Complex 
FFT·on 
length 
N/2 array. 
Twiddles 
are 
C 
for a length N array, 
so wincr 
gets 
scaled by 2. 
DO 20 stage = l,m-l 
groups = powers2(stage-l) 
C groups=number 
of times 
the twiddle 
factors 
are used, 
ie, the number 
of 
C 
smaller 
DFTs the stage 
is split 
1nto. 


C 
offset 
gets N/4,N/8,N/16, ••• 
offset = powers2(m-l-stage) 
w1ncr 
= groups 
* 2 
call d1fstep(a,w,groups,offset,w1ncr,w11m1t) 
20 CONTINUE 


call b1trev(a,%VAL(M-l),n/2) 
call realf1x(a,w,%VAL(n)) 


RETURN 
END 
C------------ 
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realfix.ss: 
This 
is i860(tm) 
CPU assembly 
code 
to revise 
data 
from 
an 
N/2 length 
Complex 
FFT. 
(assumes 
the 
input 
data 
fed to Complex 
FFT was 
N real 
values) 


/I 
/I 
/I 
/I 
/I 
/I 
11 8/14/89 
11 
This 
18-cycle-butterfly 
loop 
may 
be sub-optimal. 
/I 
/I 
/I 
/I 
/I 
/I 
/I 
/I 
11 
Algorithm 
from 
"Numerical 
Recipes 
in Co, by Flannery, 
Press, 
Teukolsky, 
and 
11 
Vetterling, 
Cambridge 
Univ. 
Press 
1988, 
p.417. 
11*************************1 


INTEL 
is not 
responsible 
for use 
nor 
misuse 
of this 
code. 


output 
= overwrite 
the 
data 
array 
used 
for 
input. 
Results 
are 
complex. 
ReO,ImO,Rel,Iml, 
••• , Re(N/2),Im(N/2). 
NOTE 
that 
output 
array 
is 1 element 
longer 
than 
input. 


Input 
is H(k), 
output 
is F(k) ••• 
F(k)=.5*( 
H(k)+ 
Hconj(N/2~k) 
-j*(H(k) 
-Hconj (N/2-k»*Wconj 
(k» 


11* The 
C-version 
of realfix: 
*1 void 
realfix_(a,w,n) 
/l1*Input 
= 
11 a(O:n+l): 
length 
n/2+1 
complex 
array. 
Entries 
0:n/2-1 
are 
the 
complex 
FFT 
11 * 
result, 
in correct 
(NON BIT 
REVERSED) 
order. 
Entry 
n/2 
is undefined. 
11 * w: length 
n/2 
complex 
array 
of twiddles. 
(cos,-sin(2pi*k/n» 
11 * n: call-by-value, 
number 
of REAL 
input 
samples 


11 *Output 
= 
11 * 
a(O:n+l): 
length 
n/2+1 
complex 
array. 
11 * 
Format 
is ReO,ImO,Rel,Iml, 
••• , Re(N/2),Im(N/2). 
11 * 
NOTE: 
To generate 
entire 
N-length 
complex 
output 
11 * 
conjugate 
of element(i) 
to element(N-i). 
/I *1 
Ilfloat 
/I 
/I 
/I 
/I 
/I 
/I 


spectrum, 
you 
can 
copy 


all , w[]; int n; 
(int 
aptr,bptr, 
wptr; 
float 
half=0.5, 
AR,AI,BR,BI, 
1* input 
values 
for A,B*I 
PR,PI,SR,SI,DR,DI, 
I*temporary 
differences 
,sums ,products*1 
K,L,M,N, 
I*temporary 
products 
*1 
ER,EI,ERO,EID, 
FR,FI,FRO,FID, 
WR,WI; 


/l1*We do first 
/I 
AR = elol : 
11 elol = AR + AI; 
11 alnl = AR - AI; 


and 
last 
elements 
AI = a[l]; 
all] = 0; 
a[n+l] = 0; 


as special 
case(Imag:O, 
W=(l,O»*1 
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Ilfor(aptr=2, 
bptr=(n-2), 
wptr=2; 
aptr 
< n/2; 
aptr 
+=2, 
bptr 
-=2, wptr 
+=2) 
I/(WR = w[wptr] ; 
WI = w[wptr+l]; 
11 AR = alapt rl : 
AI = a[aptr+l] ; 
11 BR = a[bptr]; 
BI = a[bptr+l]; 
11 1* aptr 
=2,4,6 ••• ,14; 
bptr=30,28,26, 
••• ,18 
(if n=32) 
*1 
11 1* Note 
that 
there 
is no need 
to revise 
the value 
at the middle 
of the 
11 
list, 
as it is already 
correct. 
(.5*(H(n/4)+Hconj(n/4)) 
*1 
11 SI = 
(AI + BI) ; 
11 DR = 
(BR - AR) ; 
11 K = WR*SI; 
L= WI*DR; 
11 M = WR*DR; 
N= WI*SI; 
11 SR = 
(AR + BR) ; 
1/ DI = 
(AI - BI) ; 


PR = K-L; 
PI = M+N; 


11 ERD = SR+PR; 
ER = half*ERD; 
11 
alapt r] = ER; 
11 EID = DI+PI; 
El = half*EID; 
1/ 
a[aptr+l]= El; 
11 FRD = SR-PR; 
FR = half*FRD; 
1/ 
albpt r] = FR; 
/ 


11 FID = PI-DI; 
FI = half*FID; 
11 
a[bptr+l]= FI; 
I rend 
of for-loop 
*1 I 
11************* End 
of C-code 
for realfix.*********************** 
.text 
•align 
•quad 
11-------------- 
define (astart, 
r16) 
Ilinput 
data 
base 
address 


define (wptr,r17) 
11 pointer 
to W table. 
Because 
w-contents 
depend 
on N, 
11 we will 
assume 
the 
caller 
has 
initialized 
w() 
array. 
define(N,r18) 
11 
define(aptr, 
r20) 
define (bptr, r21) 
Ilpointer 
to 1st component 
of butterfly 
(load) 
Ilpointer 
to 2nd 
component 
of bfly 
(load); DOWNCOUNTER 


define (decrem,r24) 
Ilbla 
decrement 
define (count,r25) 
11 bla 
counter 


define(WR, 
fl8) 
define(WI, 
fl9) 
IIW 
(twiddle 
factor), 
real part 
11 " " , imag 


define(AR, 
fl2) 
define (AI, fl3) 
define (ARo,fl4) 
define (Alo,fl5) 
define (BR, fl6) 
define(BI, 
fl7) 


Ilelement 
A, real 
component 
11 " ", 
imag 
11 extra 
A value. 
for prefetch 
(o="odd") 


Ilelement 
B, real 
component 


define(ER, 
f20) 
IIResult 
of butterfly 
which 
overwrites 
AR 
define(EI, 
f21) 11 " 
AI 


define(half,f22) 
Ilconstant 
0.5 


define(FR, 
f24) IIResult 
of butterfly 
which 
overwrites 
BR 
define(FI, 
f25) 
define (PR,f26) 
define (PI,f27) 


define (DR, f28) 
define (DI, f29) 
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define(SR, 
f30) IISum of A+B, 
real part 
define(SI, 
f31) II " ", imag 
" 


.data 
.align 
.double 
halfloc:: 
.float 0.5 
11-------- 
.text 
.align 
.quad 
_realfix_ :: 
fst.q 
f12,-lQ(sp)++ 
Ilsave 
"local" 
regs 
adds 
-4,rO,decrem 
Ilbla decrement 
11-------- 
II We do not bother 
to initialize 
FP pipes 
to zero here, 
as we assume 
II 
this routine 
is called after 
another,"safe", 
pipelined 
FP routine. 


pfld.l 
halfloc,fO 
pfld.d 
8( wptr)++,fO 
Ilskip W(O) 
in~entionally. 
Is a trivial 
(1,0) value 
II init pointers: 
adds 
O,astart,aptr 
pfld.d 
8( wptr)++,fO 
shl 
2,N,bptr 
Ilbptr:total 
# bytes 
of input data 
pfld.d 
8( wptr)++,half 
110.5 into an fpr 
adds 
bptr,astart,bptr 
II bptr points 
to a(N) 


'11 here 
fetch 
first 
set of A,B,W 
before 
bla-loop 
pfld.d 
8( wptr)++,WR 
fld.d 0 
(aptr),AR Ilfor 1st and last elements 
adds -8,N,count 
II bla counter 
(predecrement 
by 2 butterflies 
worth) 
II ----------- 
II Do n/4 butterflies: 
(computing 
only N/2 elements 
of complex 
output, 
because 
II 
the second N/2 are just complex 
conjugates 
of the 1st N/2) 


II Definitions 
for pipe diagram: 
II 
WR = cos(), WI=-sin(). 
II 
DR = BR - AR; 
(diffence 
of Real 
components 
of A,B) 
II 
DI = AI - BI; 
(diffence 
of Imag components) 
II 
SR,SI = sum of A,B 
II 
PR = K - L; where 
K= WR*SI, 
L=WI*DR 
II 
PI = M + N; where 
M= WR*DR, 
N=WI*SI 
il 
(ER,EI)=complex 
result 
to overwrite 
A. 
II 
(FR,FI)="""" 
B. 


first_fly:: 
Ilfill 
pipe. 
II For Oth butterfly: 
/I AR = alol : 
AI = a[l]; 
I I elol = AR + AI; 
all] = 0; 
/I aln] = AR - AI; 
aln+Ll = 0; 


/I KR ••KI ••Ml ••••M2 ••••M3 
T 
Al ••••A2 ••••A3 ••••Write 
r2pt.ss 
fO,fO,fO 
/I 0 
0 
mrmlp2.ss 
AR,AI,fO 
/I 
0 
0 
ERO 
mrmls2.ss 
AR,AI,fO 
/I 
0 
0 
0 
FR 
ER 
rre ,e 
8. (aptr)++,AR 
fld.d 
-8(bptr)++,BR 
d.pfadd.ss 
fO,fO,fO 
/I 
0 
0 
0 
0 
FR 
ER 
d.pfadd.ss 
fO,fO,ER 
/I 
0 
0 
0 
0 
0 
PR 
ERO 
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Al ••••A2 ••••A3 ••••Write 
DIl 
PRl 


PIl 
DIl 
PRl 


ERD 
PIl 
DIl 
PRl 


FRD 
ERD 
PIl 
DIl 


EID 
FRD 
ERD 
PIl 


FID 
EID 
FRD 


FID 
EID 


FID 


-FID 


// KR ••KI ••Ml ••••M2 ••••M3 
T 
d.ratls2.ss 
AI 
,BI ,to // 
Nl 
Ml 
nop 
d.i2pt.ss 
to ,to, 
tOIl 
to 
- 
Ml 
tld.d 
8 
(aptr)++,AR 
d.r2apl.ss 
SR 
,to, 
PR// 
tld.d 
-8(bptr)++,BR 
d.rals2.ss 
SR 
,PR, DI // 
ptld.d 
8( wptr)++,WR 
d.r2apl.ss 
DI 
,to, 
PIlI 
nop 
d.rals2.ss 
PI 
,DI ,to // 
ERl 
nop 
d.ralp2.ss 
to ,to ,to // 
FRl 
ERl 
nop 
d.rals2.ss 
to ,to ,to // 
Ell 
FRl 
ERl 
- 
bla decrem,count,tix_loop 
d.ptadd.ss 
to ,to ,FI // 
Ell 
FRl 
ERl 
- 
nop 
//------------------- 
// Each 
buttertly = 1 complx multiply, 
3 complx 
add, 1 real multiply 
// 
= 
8 multiply, 
10 add/subtract 
// 
3 8-byte 
tetches 
(A, B, W) 
// 
2 8-byte 
stores 
(E, F) 
11 
11 approx. 
18 cycles per buttertly 
11 
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fix_loop: : 
11 KR ••KI ••Ml ••••M2 ••••M3 
T 
Al ••••A2 ••••A3 ••••Write 
d.mr2pt.ss 
fO ,FI ,ER 11 0 
Fll 
Ell 
FRl - 
ERl 
nop 
d.mrmlp2.ss 
AI 
,BI ,FR 11 
Fll 
Ell - 
SI2 
FRl 
nop 
d.mrmls2.ss 
BR 
,AR ,El 11 
Fll - 
DR2 
SI2 
Ell 
fst.d ER,-8(aptr) 
d.mr2pt.ss 
WR 
,fO, FI 11 WR 
DR2 
SI2 
F11 
fst.d FR, 8(bptr) 
d.ralp2.ss 
BR 
,AR ,SI 11 
K2 
SR2 
DR2 
SI2 
andh Ox8000,count,rO 
Ilcheck for negative 
d.m12tpm.ss 
WI 
,DR ,DR 11 
L2 
K2 
SR2 
DR2 
bnc 
endfix 
d.r2pt.ss. 
half,DR, 
fO IIhalf 
M2 
L2 
K2 
SR2 
nop 
d.m12ttpa.ss 
WI 
,SI ,SRII 
N2 
M2 
L2 
K2 - 
SR2 • 


nop 
d.i2st.ss 
fO 
,fO ,fOil 
fO - 
N2 
M2 
K2 
PR2 
nop 
11 KR ••KI ••Ml ••••M2 ••••M3 
T 
Al ••••A2 ••••A3 ••••Write 
d.ratls2.ss 
AI 
,BI , fOIl 
N2 
M2 
DI2 
PR2 
nop 
d.i2pt.ss 
fO ,fO, 
fOIl 
fO 
- 
M2 
PI2 
DI2 
PR2 
fld.d 
8 
(aptr)++,AR 
d.r2apl.ss 
SR ,fO, 
PRII 
ERD 
PI2 
DI2 
PR2 
fld.d -8(bptr)++,BR 
d.rals2.ss 
SR ,PR, 
DIll 
FRD 
ERD 
PI2 
DI2 
pfld.d 
8( wptr)++,WR 
d.r2apl.ss 
DI 
,fO, 
PIlI 
EID 
FRD 
ERD 
PI2 
nop 
d.rals2.ss 
PI 
,DI .ro 11 
ER2 
FID 
EID 
FRD 
nop 
d.ralp2.ss 
ro 
,fO .ro 11 
FR2 
ER2 
FID 
EID 
nop 
d.rals2.ss 
ro .ro .ro 11 
EI2 
FR2 
ER2 
- 
FID 
bla decrem,count,fix_loop 
d.pfadd.ss 
fO ,fO ,FIll 
EI2 
FR2 
ER2 
- 
FID 
nop 
11--------- 
endfix: : 
11 restore 
regs 
fiadd.ss 
fO,fO,fO 
Ilexit DIM 
fld.q 
O(sp) ,f12 
fiadd.ss 
fO,fO,fO 
Illast 
DIM pair 
adds 
l6,sp,sp 
bri rl 
nop 
11------------ 
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PROGRAM 
FFTTEST 
c 
file = real.f 
C 
C 
l-D FFT TEST PROGRAM 


C 
C 8/14/89 


C Intel 
assumes 
no responsibility 
for use 
or misuse 
of this 
code. 
C------------------ 
PARAMETER 
(IREV=l) 
character*8 
really 
PARAMETER 
(REALLY='real') 
c 
PARAMETER 
(REALLY='complex') 
PARAMETER 
(TIMEIT=O, 
CACHETIME=O) 
c REALLY='real' 
means 
real-only 
input, 
otherwise 
assume 
complex 
input 
DATA 
IT/200000/ 
c 
PARAMETER 
(N=2048,M=11) 
PARAMETER 
(N=1024,M=10) 
c 
PARAMETER 
(N=512,M= 
9) 
c 
PARAMETER 
(N=256,M= 
8) 
c 
PARAMETER 
(N=128,M= 
7) 
c 
PARAMETER 
(N=64,M= 
6) 
c 
PARAMETER 
(N=3Z.M= 
5) 
c 
PARAMETER 
(N=16, M=4) 
PARAMETER 
(PI=3.1415926536) 
COMPLEX 
X2(N),X(N) .X3(N), W(N/2) 


Real 
ASQR(N) ,ASQR2(N),XR(N+2) 
,XR1(N+2) ,XR2(N+2),XR3(N 
+2) 
complex 
wtemp 
real 
rtemp 
C 
PRINT 
*,' FFT test program 
print 
*,'============================= 
IF 
(IREV 
.eq. 0) THEN 
print 
*,'NOT 
counting 
time 
for bit-reversal.' 
print 
*,'00 NOT expect 
matching 
answers,without 
bit-rev' 
ELSE 
print 
*, 
'Time for bit-reversal 
included.' 
ENDIF 


print 
*, 'Time for cache writeback 
and 
IF 
(CACHETIME 
.eq. 0) THEN 
print *,' 
NOT 
included, 
if iterating.' 
ELSE 
print *,' 
included. ' 
ENDIF 


fills •••' 


print 
*, 
print 
*, 'If iterating ••• Number 
of Iterations 
=' ,IT 
print * , 
print 
*, 'Number 
of Points 
= 
N 
print * , '('.REALLY.' 
data) , 
print *. 
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C------------------ 
C Init 
twiddle 
tactor 
array 
w(k) 
with 
(cos,-sin) 
ot 2pi*k/N 
rtemp = 2.0*pi/N 
wtemp: 
CMPLX(cos(rtemp), 
-sin(rtemp)) 
w(l) = u ,o, 
0.0) 
DO 
200 
k 
= 
2,N/2 
200 
w(k) = wtemp 
* w(k-l) 
cc print 
*,' W 
(twiddle) 
initialization 
completed •••••• • 
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC 
C 
INITIALIZE 
input 
data 


C 
DO 100 I = 1, N 
c:constant: 
c 
Treal = 1.0 
c 
Timag = 0.0 


c:squarewave: 
cc IF 
(I .It. N/2) 
THEN 
cc 
Treal = 1.0 
cc 
Timag = 0.5 


cc ELSE 
cc 
Treal = 0.0 
cc 
Timag = 0.0 
cc ENDIF 
C: ramp 
tunction: 
Treal = I - 1.0 
Timag = Treal + 0.5 
IF 
(REALLY 
.ne. 
'real') 
THEN 
XlI) = CMPLX 
(Treal, 
Timag) 
X2(I) = CMPLX 
(Treal, 
Timag) 
X3(I) = CMPLX 
(Treal, 
Timag) 
ELSE 
XlI) 
= 
X2(I) = 
XR(I) = 
XRl(I) = 
XR2(I) 
= 
XR3(I) 
= 
ENDIF 
100 
CONTINUE 
C 
CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC 
CALL ttt 
(X2, M, N) 
cc Subroutine 
ttt 
is Decimation-In-Time, 
Fortran 
version. 
CALL 
dirr(XR,M,N,W,l) 
(Assuming 
dirr 
produces 
inplace 
result, 
items 
0:N/2 
complex 


CMPLX 
CMPLX 
Treal 
Treal 
Treal 
Treal 


(Treal,O.O) 
(Treal,O.O) 


results) 
c 
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ccccccccccccccccccccccccccccccccccccccc 
IF 
(IREV 
.ne. 0) THEN 
IF 
(TIMEIT 
.eq. 0) THEN 
call 
vcompare(XR,X2,N/2+2) 
call 
cmags(XR,N/2+1,ASQE) 
c cmags 
to take 
squared 
magnitude 
of complex 
values 
in X 
call 
cmags(X2,N,ASQR2) 
c----------------------c 
C print 
non-zero 
results: 
J=O 


DO 700 
I = l,N/2+1 
IF 
«ASQE(I) 
.GT. 1.0) 
.OR. 
(ASQR2(I) 
.GT. 1.0» 
THEN 
WRITE 
(6,22) 
(1-1), ASQR(I), 
ASQR2(I) 
22 
FORMAT 
(' 1-1=',14,' 
ASQR(I)= 
',F14.2, 
' 
ASQE2(I)= 
',F14.2//) 
J = J+l 
IF 
(J 
.GT. 32) 
GOTO 
725 
ENDIF 
700 
CONTINUE 


725 
CALL 
TIME 
ENDIF 
ENDIF 


IF 
(TIMEIT 
.ne. 0) THEN 
ccccccccccccccccccccccccccccccccccccccc 
cc- 
Timing 
loop 
follows: 


print *,' 
Start 
Ass.FFT' 
IF 
(CACHETIME 
.eq. 0) THEN 
DO 500 
I = 1, IT,4 
C Reuse 
same 
array, 
so cache 
fill 
and 
CALL 
dirr(XR, 
M, N,W,IREV) 
CALL 
dirr(XR, 
M, N,W,IREV) 
CALL 
dirr(XR, 
M, N,W,IREV) 
CALL 
dirr(XR, 
M, N,W,IREV) 


writeback 
time 
NOT 
included. 


500 
ELSE 
DO 504 
I = 1, IT,4 
C Alternating 
between 
XR,XR1,XR2,XR3 
CALL 
dirr(XR, 
M, N,W,IREV) 
CALL 
dirr(XR1, 
M, N,W,IREV) 
CALL 
dirr(XR2, 
M, N,W,IREV) 
CALL 
dirr(XR3, 
M, N,W,IREV) 


should 
provide 
cache 
misses. 


504 
ENDIF 
print 
*,' 
END 
Ass. 
FFT' 
ccccccccccccccccccccccccccccccccccccccc 
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ENDIF 


STOP 
END 
c----------------------c 
subroutine 
vcompare(res,exp,n) 
c VCOMPARE 
compares 
2 vectors, 
prints 
out 1st few miscompares 
c 
integer n, errcnt 
real res(n), 
exp(n) 


write(6,12) 
12 
format('*** 
VCOMPARE: 
vector 
comparison 
beginning 
***') 


data errcnt/Oj 
do 30 i = 1,n 
if(AINT(res(i)) 
.ne. AINT(exp(i))) 
then 
c 
(print out error, 
exit if alot already} 


120 
print 
.,'.** Error 
in compares 
••• ' 
write(6,121) 
i 
121 
format (, 
Item number = 
',16) 
write (6,124) res (i), exp(i) 
124 
format(' 
Res_=',F14.2,' 
Expected_=',F14.2) 
errcnt = errcnt + 1 
if (errcnt .gt. 19) then 


return 
end if 
end if 
30 
continue 


if 
(errcnt .eq. 0) then 
190 
print 
.,' ••• vector 
compares 
SUCCESSFUL 
.*.' 
end if 


99 return 
end 
c----------------------c 
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C--------------- 
C file: 
fft.f 


C FFT routine .from Rabiner 
a: Gold. 
1975. who copied 
it 
C 
from Cooley. 
Lewis. 
Welch 
C 6/02/89 
C 
C 
C 
C 
C 
C 
C 
C 
C 
C Outputs: 
C 
A= complex 
fft of input A. in NON-bit-reversed 
order. 


C 
C w 
(twiddle factor) 
calculated 
by recursion. 
Supposedly 
takes 
15% more 
C 
operations 
than keeping 
entire 
twiddle 
array 
as constants 
pre-allocated. 
C 


Decimation 
in Time. 
radix-2. 
inplace. 
I-dimen 
Inputs: 
A= 
complex 
array 
of input. up to 1024 pts. 
single-prec 
float 
(maybe more 
than 1024. uncertain 
what 
limit 
is) 
M= log of number 
of pts 
= (Number of stages 
of FFT) 
N = number 
of points. 
ie. N= 2**M = number 
of pts 


subroutine 
fft(a.m.n) 
integer 
m.n. 
i. j.k. ndiv2.powers2(0:10) 
integer 
iplus.offset. 
stage. 
indexl. 
groups 
complex 
a(n) .wtemp(2).w(11) .temp 


C Init twiddle 
factor array w() with 
(cos.-sin) 
of pi.pi/2.pi/4 •••• 
data w(l) 
/(-1.0.0.0) 
/ 
data w(2) 
/(0.0.-1.0) 
/ 
data w(3) 
/(0.7071068.-0.7071068)/ 
data w(4) 
/(0.9238795.-0.3826834)/ 
data w(5) 
/(0.9807853.-0.1950903)/ 
data w(6) 
/(0.9951847.-0.0980171)/ 
data w(7) 
/(0.9987955.-0.0490677)/ 
data w(8) 
/(0.9996988.-0.0245412)/ 
data w(9) 
/(0.9999247.-0.0122715)/ 
data w(10) 
/(0.9999812.-0.0061359) 
/ 
data w(ll) 
/(0.9999953.-0.003068) 
/ 


data powers2 
/1.2.4.8.16.32.64.128.256.512.1024/ 
C Powers2 
to avoid 
calls to POW. DIV 


C Setup 
for bit-reversal 
loop: 
ndiv2 = n / 2 
j =1 
C----------- 
C "DO 7" loop to in-place-bit-reverse-shuffle 
input 
DO 7 
i= 1. n-l 
IF 
(i .1t. j) THEN 
temp = a(j) 
a(j) = a(i) 
a(i) = temp 
ENDIF 
k = ndiv2 
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C 
"While 
(j 
.gt. k)" /*decrease 
j 
by 2**something 
*/ 
6 
IF 
(j .gt. k) THEN 
j = j-k 
k = k / 2 
GOTO 6 
ENDIF 
C 
Add next lower power of 2 to j 
7 
j = j+k 
C------------ 
C Special 
case for stage 1: no complex multiplies. 
simple add 
C 
(Performance enhancement) 
groups = 2 
offset = 1 
indexl = 1 
C 
i-loop iterates N/2 times for 1st stage 
(and would do twice N/4 x for 2nd) 
CVD$ 
NODEPCHK 
DO 8 i = 1.n.2 
iplus = i + 1 


temp = a(iplus) 
a(iplus) 
= a(i) - temp 
8 
a(i) = a(i) + temp 
C------------ 
C Special 
case for stage 2: no complex mUltiplies. 
simple add 
C 
(Performance enhancement) 
groups = 4 
offset = 2 
indexl = 1 


C 
i-loop iterates N/4 times for 2nd stage 
C 
1st call to i-loop.in 
stage2: indexl=l. wtemp(l)=(l.O) 
CVD$ 
NODEPCHK 
DO 90 i = 1.n.4 
iplus = i + 2 
temp = a(iplus) 
a(iplus) 
= a(i) - temp 
90 
a(i) = a(i) + temp 


indexl = 2 
CVD$ 
NODEPCHK 
CVD$ 
NOVECTOR 
DO 92 i = 2.n.4 
iplus = i + 2 
temp = CMPLX(AIMAG(a(iplus)) .-REAL(a(iplus))) 
a(iplus) = a(i) - temp 
92 
a(i) = a(i) + temp 
CVD$ 
VECTOR 
C------------ 
C "DO 20· stage-loop 
executed 
once for each of the 
(m) stages of FFT 
C 
(Except 1st aod 2nd stage) 
C 
offset gets 4.8.16.32.64.128.256 ••• 
DO 20 stage = 3.m 
groups = powers2(stage) 
offset = groups/2 
wtemp(l) 
=(1.0. 
0.0) 
C 
One twiddle 
seed 
(W) calc per stage. 
C 
We pre-allocated 
w(12)-array 
with those values. 
avoid cos/sin calls 
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C------------------- 
DO 20 indexl = l.offset 


C "DO 10" i-loop does 
each butterfly 
of each stage. with varying 
twiddles 
C 
i-loop 
iterates 
N/2 times 
for 1st stage. N/4 x for 2nd. N/8 x for 3rd 
C 
stage. N/16 x for 4th stage •••• 1 time 
for last stage. 


CVD$ 
CVD$ 
NODEPCHK 
ALTCODE 
DO 10 i = indexl.n.groups 
iplus = i + offset 
temp = a(iplus) 
* wtemp(l) 
a(iplus) 
= a(i) - temp 
a(i) = a(i) + temp 
= wtemp(l) 
* w(stage) 
10 
20 wtemp(l) 
RETURN 
END 
C------------ 
subroutine 
cmags(a.n.asqr) 
C Complex 
magnitude 
squared. 
C Inputs: 
C 
A= complex 
array 
of input. 
single-prec 
float 
C 
N = number 
of input points 
(and output points) 
C Ouput: 
C 
asqr = real squared 
magnitude 
(R*R + 1*1). N elements. 
single-prec 
float 


integer 
n.i 
real asqr(n) 
complex 
a(n) 


DO 100 i = 1. n 
asqr(i) 
= 
(REAL(a(i»*REAL(a(i») 
+ (AIMAG(a(i»*AIMAG(a(i») 
100 
CONTINUE 
RETURN 
END 
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## makefile 
for i860(tml 
CPU FFTs 
(for Unix 
V/386 
programming 
environmentl 
## 
8/7/89 
## 
GH=/usr/i860/bin 
GHL=/usr/i860/lib 
CC=$(GHl/c860 
FC=$(GHl/f860 


CFLAGS= 
-OLM 
-X393 
-X405 
-X188 
-X370 


FFLAGS= 
-OLM 
-X370 
-X393 
-X7l 
-X422 
## -X7l 
uses 
single-precision 
math 
routines 


FLFLAGS= 
-Mx map 
-e 
start 


LFLAGS= 
-Mx map 
-e _main 
CLIB=$(GHLl/libc.a 
MLIBPSR=$(GHLl/860mtlib.a 


MLIB=$(GHLl/libm.a 
FLIB=$(GHLl/libf.a 


ASM=$(GHl/as860 


FLINK=$(GHl/ld860 
$(FLFLAGSl 


RT=$(GHLl/s5lib.a 


LIBS= 
$(FLIBl 
$ (MLIBPSRl 
$(MLIBl 
$(CLIBl 
$(RTl 


LIBCC= 
$(MLIBl 
$(CLIBl 
$(RTl 
## NOTE: 
Order 
of linked 
files 
is CRUCIAL. 
other 
orders 
may 
give 
errors 


.SUFFIXES: 
•SUFFIXES: 
.t 
.c .s .ss 
.0 .8 


•IGNORE: 
## 
.ignore 
causes 
make 
to 
ignore 
error 
codes 
from 
compilers 


## 
To test 
Fortran 
plus 
assembler-fft-stage 
version: 
FILE: 
ffttest.o 
fft.o 
diff.o 
bitrev.o 
difstep.o 
start.o 
time.o 


## To test 
all-Fortran 
version 
of fft: 
##FILE= 
ffttest.o 
fft.o 
diff.o 
difstepf.o 
start.o 
time.o 


## To test 
REAL-input 
version 
of fft: 
RFILE= 
real.o 
fft.o 
dirr.o 
realfix.o 
difstep.o 
bitrev.o 
start.o 
time.o 


.f.o: 
$(FCl 
$ (FFLAGSl 
$*.f 
$(ASMl 
-x -0 $*.0 
$*.s 


.c.o: 
$(CCl 
$ (CFLAGSl 
$*.c 
$(ASMl 
-x -0 $*.0 
$*.s 
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nop 
tinish: : 
call 
_exit 
nop 
.tile 
·start.c· 


.data 
.a11gn 
.1coDIDI 
.end 


•quad 
_stack ,262144+262144 


//========================================= 


/* tile: time.c. 
Purpose: establish a label to use tor breakpoints 
*/ 
long 
Ume_ 
(x) 
long 
*x; 
x = x+4; 
return ((long) x); 
I 
long 
timestop_(x) 
long 
*x; 


( 
x = x+4; 
return( (long) x) ; 
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1.0 
BACKGROUND 


The Intel 82495 Cache Controller and 82490 Cache 
RAM form a high-speed cache subsystem for the In- 
tel486 DX CPU (82495DX/490DX) 
or the i860 XP 
CPU (82495XP/490XP). The reader should be familiar 
with these chips, as described in: 


I) i860 XP CPU Microprocessor Data Sheet (Intel or- 
der #240874) 
2) Intel486 DX Microprocessor Data Sheet (Intel order 
#240440) 


3) 82495XP Cache Controller/82490XP 
Cache RAM 
Data Sheet (Inte1 order #240956, June 1991) 
or Intel486 DX CPU Microprocessor Cache-Chip 
Set Data Sheet (Intel order # 241084, June 1991) 


Diagrams of systems containing the 82495 and 82490 
appear in Figure 1, and a more detailed diagram of the 
CPU/82495/82490 
core appears in Figure 2. (Note: for 
simplicity, 
the 
82495XP/82490XP 
and 
82495DX/ 
82490DX 
will 
be 
referred 
to 
generally 
as 
82495/82490--the 
XP or DX should be inferred de- 
pending upon the CPU being utilized.) In such systems, 
the 82495 controls a cache external to the CPU, and 
includes the cache tags. It can interface gluelessly to an 
Intel486 DX CPU or i860 XP CPU microprocessor, 
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allowing the processor bus to run at 50 MHz with zero 
wait-states, while the memory bus can remain at a low- 
er frequency. Both writeback and writethrough proto- 
cols are supported. Concurrent operations can occur 
simultaneously on the local CPUbus and the shared 
memory bus. All requisites for multiprocessors are in- 
cluded in the 82495, Intel486 DX CPU, and i860 XP 
CPUs, but the 82495 also is useful for a uniprocessor 
system performance enhancement. 


The 82490 cache RAM contains 32 kBytes per chip, 
and is used in groups of 4, 8, or 16to implement caches 
from 128 to 512 kBytes. It supports two-way associativ- 
ity, delayed writebacks, burst transfers, and boundary 
scan test. The 82490 contains much more than RAM 
cells-it 
includes various buffers, queues, and support 
for several bus protocols. It is two-ported, with simulta- 
neous access on both the CPU side and Memory-Bus 
side. The cache optionally supports parity using addi- 
tional 82490 chips. 


Configuration options allow a variety of memory bus 
widths (32 to 144 bits), cache line widths (16 to 128 
bytes), and asynchronous 
or synchronous 
transfers. 
The configuration is selected by the polarity of various 
pins at reset time. 


1. Unlprocessor 


240957-1 


2. Homogeneous 
Multiprocessor 
r----::":"---. 
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3. Heterogeneous 
Multiprocessor 
...--- ....• 


U,128 
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Figure 1. CPU + 82495 + 82490 Systems 
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Figure 
2. CPU + 82495 + 82490 Core 


The Memory Bus Controller (MBe) portion of the sys- 
tem interfaces the 82495 and CPU to the system bus. 
The MBC converts bus status and command lines into 
requests to the 82495, for example, to monitor the prog- 
ress of an ongoing bus transaction from another CPU 
subsystem to ensure consistency with 82495+ 82490 
cache contents. Likewise the MBC adapts 82495 re- 
quests to the bus protocol and arbitrates for ownership 
of the bus. Most CPU requests will not require MBC 
action; only I/O 
cycles, cache bypass requests, and 


82495 cache misses are forwarded by 82495 to the 
MBC, while external cache hits are handled totally by 
82495+82490. 


2.0 
WHY A CUSTOM 
BUS 
INTERFACE? 


Clearly the entire interface to a memory bus (abbreviat- 
ed M-bus) could have been incorporated in the 82495 
and 82490 chips. This approach has been followed by 
some other cache chipsets. 


However, such integration suffers from inflexibility and 
bandwidth limitations. As shown in Figure 3, the per- 
formance and cost targets of the system determine the 
size and complexity of the bus, so if the bus is "hard- 
wired" into the cache controller chip, it will be too 
costly for small systems and too slow for larger sys- 
tems. With the bus interface implemented separately, it 
can be a complex ASIC for a high-bandwidth complex 
system, or a few EPLDs for a PC. The same cache 
controller can improve performance of a variety of bus- 
based CPUs. 


For a desktop PC, a 32-bit simple memory bus is ade- 
quate. For a workstation or small multiprocessor of 
two CPUs, a faster 64-bit bus may be required to give 
adequate bandwidth for graphics frame buffers and in- 
tensive numeric calculations. Bus bandwidth require- 
ments grow as the MIPS rating of each CPU in a sys- 
tem grows; for example, a bus adequate for 12 386 
CPUs may be too slow for 6 Intel486 OX CPUs, as 
they process far more data per second. 
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A large multiprocessor of 6 or more CPUs needs a wide 
and fast bus such as Futurebus +, with split-transac- 
tion capability to prevent bus bottlenecks from slowing 
the performance of every processor. Hierarchies of bus- 
es and caches can further allow more CPUs with rea- 
sonable performance increases as CPUs are added. A 
Futurebus + hierarchy maintains concurrent transac- 
tions on each bus, and "bridge" caches at the junctions 
of buses echo them from bus to bus when the bridge 
detects that one transaction may affect cached copies 
on the other bus. 


Compatibility with existing buses is often crucial in 
product design, so that new faster components can plug 
into existing machines and I/O devices. The flexible 
82495/82490 bus interface allows compatibility as well 
as extension. 


Thus the 82495 and 82490 will be used in a wide variety 
of systems, including standard buses like Futurebus + . 
For proprietary buses, the "proprietor" 
can design an 
ASIC or PAL MBC incorporating the required fea- 
tures. 


3.0 
GUIDELINES 


This document exists to clarify the necessary compo- 
nents and tradeoffs of a Memory Bus Controller. The 
example designs here have not been tested, and signal 
definitions of the i860 XP CPU, Intel486 DX CPU, 
82495, and 82490 chips are subject to change. 


The memory bus controller is not allowed to use (and 
thus add capacitance to) any of the CPU pins used by 
the 82495/82490, except those listed in the 82495 Data 
Sheet [82495/49ODS] description of the BLE# 
pin. 
Only the CPU pins BE7-0#, 
PWT, 
PCD, 
LEN, 
CACHE # , BRDY#, 
PCYC, and CTYP have suffi- 
cient timing margin to tolerate the MBC load. 
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Small I 
Medium I 
Large 
2-3 
CPU. 
4-8 
CPUs 
8+ CPUs 
Feature 
Unlproc 
Multlproce.sor 
I 
CPU<->Memory 
Simple 
Pipelinod BUI 
Crossbar or 
Interconnect 
Bus 
~ 
...""~''''' 
Bus Width, 
32 bit 
32 or 64 
64 or 128 
64 or 128 bit 
Frequency 
20-40 
104Hz 
33 ~Hz or more 
• 


Cache 
WrlteBack ~ 
i : 
WriteThru 
82~95 + 82490 
_ 
3rd Le.el Cache __ 
I 
Arbitration 
Central 
Central 
Distributod 
Arbitration 
(HOLD/HLDA) 
Bus Parking 
I 
I 
LOCKing - 
Bu. Lock 
Addres. Lock 
• 
I 
I 
Error Detect 
• 
Parity 
ECCon Memory, Retry- 
I 
I 
Bu. Protocol 
- 
Simple-- 
Pipelinod 
-- 
Split Transaction 
--- 


I 
Extra 
Read-for-Ownership 
Featur •• 
Cache-to-Cache 
Transfer. 
External FIFOs 
. 
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Shared Bus Interconnect 


Figure 3. System Type and Bus Requirements 


4.0 MBC BLOCK DIAGRAM 


When used in a multiprocessor, the 82495 assumes a 
shared-memory, shared-bus environment so that it can 
observe and "snoop" accesses by others which might 
conflict with the memory locations it has cached, In a 
crossbar or other multipath interconnect, shared-bus 
coherency can be emulated for the 82495 or it can be 
used non-coherently, Either a centralized directory or a 
hierarchy of buses and caches can do the emulation. A 
directory would keep a record, for each line of main 
memory, of caches which have the line. When a cache 
first writes to a line of memory, the central directory 
broadcasts an invalidation message to all other caches 
containing that line. [Agarwal88] 


Shown in Figure 4 is a high-level block diagram of the 
functions and interfaces involved in the Memory Bus 
Controller. Part of the MBC operates on the high-speed 
clock (CLK) which the CPU and 82495 use. While the 
M-bus could use the 50 MHz CPU CLK, such a fast 
M-bus is hard to design. The part of the MBC which 
interacts with the memory bus protocol rnns on an M- 
bus clock (MCLK), if that protocol is clocked. Also 
possible is an unclocked M-bus protocol 
using the 
82495/82490 in "strobed" 
mode. The MBC contains 
synchronizers and a few signals which cross between 
the two clock domains. Synchronizers, consisting of 
specially-designed flip-flops, allow a clocked state ma- 
chine to use data which may be transitioning near the 
edge of the clock. Unsynchronized data can cause me- 
tastability 
in latches, where their output changes slowly 
and unpredictably. 
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Figure 4. Generic Block Diagram of MBC 
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5.0 
DESIGN 
EXAMPLE: 
A 
UNIPROCESSOR 
MBC 


A simple MBC design example is an adapter to allow 
plugging a daughtercard module with an Inte1486 DX 
CPU. 82495. and 4 82490s into an Intel486 DX CPU 
microprocessor PGA socket. The memory bus is an In- 
tel486 DX CPU-bus. allowing the external cache to be 
a performance enhancing option. It assumes a "divided 
synchronous" M-bus clock. where the M-bus runs at '/2 
the CPU CLK speed. Thus no synchronizers are need- 
ed. The MBC uses both the CPU CLK and the M-bus 
MCLK. 


This design requires 
• 
1 74F377 latch 
• 6 PLDs containing 10 state machines 


• 2 chips for clock generation. not part of the MBC 


Approximately 70 signal pins connect the MBC block 
to the CPU. cache. and memory. Only a uniprocessor is 
supported. although the bus protocol and MBC could 
be enhanced for multiprocessing coherency. Figure 5 
shows a block diagram. Details of the design can be 
found in Appendix B. 


6.0 
DESIGN 
EXAMPLE: 
A 
MULTIPROCESSOR 
MBC 
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An i860 XP CPU multiprocessor-capable MBC (figure 
6) using an M-bus similar to the i860 XP CPU bus is 
proposed. For clocking. it uses an MCLK of 33 MHz. 
totally asynchronous to the 50 MHz CPU CLK. It 
could therefore be upgraded to faster CPU CLK rates 
in the future without changing the design or M-bus. 


The design requires: 
• 2 74F377 octal latches (for BE7-0#. etc...) 
• 2 74AS4374 dual-rank-synchronizer 
octal registers 


• 
16 PLDs 
• 2 GA 1110 clock drivers for clock distribution 


These components could be integrated into a single • 
ASIC chip. as about 120 signals connect to the MBC. 
The MBC can be used for a uniprocessor or multipro- 
cessor i860 XP CPU design. Details can be found in 
Appendix C. 
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MBC Functions for Uni and 
Multiprocessors 
Table 1 shows the responsibilities of the Memory Bus 
Controller for uniprocessors and multiprocessors (MP). 
The multiprocessor features exist mainly to prevent bus 
over-utilization. However, some of the jobs common to 
both are more complex in MP for example, arbitration 
and snooping. The pin lists in the table are not exhaus- 
tive. 


Reset and configuration control includes strapping of 
the following pins to resistors at VCC or Ground, or 
"temporary 
strapping" 
of multifunction 
pins whose 
state during the last 16 clocks before falling edge of 
RESET determines 82495, 82490, or CPU configura- 


Table 1. Functions of the Memory Bus Controller 


MBC Functions for Unl and Multiprocessors 
1. RESET and Configuration 
2. FLUSH #" and SYNC #" 


* * 3. Bus Error Detection, 
Retry 
4. CPU transfer 
tracking 
(burst count) 
5. Mbus transfer 
tracking 
(burst count) 
(including 
writeback, 
allocation) 
6. Synchronization 
between 
clock domains 
** 
7. Memory-bus 
pipelining 
* * 8. MBC-to-82495 
pipelining 
9. Memory 
Bus Arbitration 
10. Cacheability 
decode 
* *11. Redrive bus signals for BTL or ECL levels or heavy capacitive 
loads 
"12. 
Packing (convert 
32-bit M-bus for 64-bit 82490 size, or 8-bit ROM) 
"13. 
Bus messages 
(interrupts, 
flushes) 
"14. 
Boundary 
scan and selftest 
"15. 
Performance 
monitoring 
(M-bus utilization, 
read vs. write) 
16. Snoop handshake 
(snooping 
DMA or other CPU) 
17. Snoop writebacks 


Additional MBC Functions for Multiprocessors 
M1. Snoop window 
(as master) 
M2. Backoff 
82495 when request was to M-line in another 
82495 
"M3. 
Snoop filtering 
(via SMLN #") 
**M4. 
Cache-to-cache 
transfers 
(CTCT) 
"M5. 
Read-For-Ownership 
(RFO) 
"M6. 
Split transactions 
(requires duplicate 
tag array) 
"M7. 
Memory cycle abort (after MHITM #") 
M8. LOCK #" protection 
"M9. 
LOCK#" de-assertion 
(for back-to-back 
Intel486 
DX CPU locks) 
"M10. 
CPLOCK#" 
(Inte1486 DX CPU only) 
* *M 11. Snoop during LOCK #" 
"M12. 
Multiprocessor 
Interrupts 
(for Message-Based 
Interrupts 
or TLB shootdown) 


* * = optional 
and implementation 
dependent 


Pins 
RESET,HOLD,CAHOLD 
CAHOLD,FSIOUT 
#",FLUSH #",SYNC#" 
PCHK#",BERR 
CLEN1 :O,RDYSRC,BRDY 
#" 
CRDY#",MBRDY#" 


BGT #",CADS #",MBRDY #" 


BGT #",CNA #",MEOC #",CRDY #" 


CNA#",MALE 
BGT#" 


KWEND#",MKEN#" 


MBRDY#" 


INT(R),FLUSH 
#" 


TCK,TMS,SLFTST 
#" 


CW/R#",CADS#" 


SNPSTB #",SNPCLK,SNPCYC 
#" 


MHITM#" ,SNPADS#" 


Pins 
SWEND#", 
MWB/WT#" 


MAOE#" 
SMLN#" 


DRCTM #",MBAOE #" 


PALLC #",DRCTM #",MFRZ #" 


CWAY 
MHITM#" 


KLOCK #",CAHOLD,SNPCYC#" 


KLOCK#" 


CPLOCK#" 


KLOCK#" 


INT,NMI(BERR) 
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tion. The circuit feeding RESET to these chips should 
keep it active at least 16 CLK periods. "Temporary 
strapping" means including RESET or 
1\RESET in 
the logic equation for the pin. The multifunction pins 
are indicated with brackets [ j below: 


i860 XP CPU pins: 


PEN#, FLINE#, 
HOLD 


Intel486 DX CPU pins: 


RDY#, BOFF#, BS8#, BSI6#, HOLD, FLUSH 


82495 pins: 


CFG3, CFG2[KWEND#], 
CFGl [SWEND#], 


CFGO [CNA#], CPUTYP[HITM#], 
FPFLDEN[FPFLD#], 
NCPFLD# [FLUSH#], 


SNPMD[SNPCLKj, C490LDRV [BGT#], 
MEMLDRV[SYNC#j, SLFTST#[CRDY#], 
TEST, 
HIGHZ# [MBALE], CACHE# (NOTE: the 
FPFLDEN pin is definedfor Intel486 DX CPU as 
PLOCKEN[CPLOCK # j. The 82495XPdoes not use 
CFG3 for configuration in i860XP CPU systems.) 


82490 pins: 


MTR4/TR8# [MSEL#], MX4!MX8# [MZBT# l. 
MSTBM[MCLKj, MEMLDRV[MFRZ# j PAR #, 
MOCLK, (BOFF #, HlTM #) 


Intel486 DX CPU: The "unused" Intel486 DX CPU 
inputs (RDY#, 
BS8#, BSI6#, BOFF#) with 82495 
should be connected as described in the Intel486 DX 
CPl} Chipset EDS. 


The Intel486 DX CPU FLUSH # input should be tied 
up, unless the system requires FLUSH messages from 
the M-bus to be interpreted. Then the MBC must assert 
the FLUSH # inputs to both Intel486 DX CPU and 
82495, because 82495 does not do back-invalidates to 
the Intel486 DX CPU for FLUSH#. During RESET, 
the Intel486 DX CPU FLUSH # input must be kept 
high to avoid putting the CPU in tristate-output-test- 
mode (Inte1486DX CPU Data Sheet Section 8.4). 


i860 XP CPU: The i860 XP CPU input PEN# (parity 
trap ENable) must be strapped high unless the memory 
data bus feeding the 82490salways contains good pari- 
ty and the i860 XP CPU system uses 2 82490sin parity 
mode; in the latter case, strap PEN# 
low. HOLD 
should be strapped low and FLINE# strapped high, as 
those features cannot be used with 82495. 


82495: 
The 
multiplexed 
82495 
pin 
FPFLDEN 
[FPFLD# j becomes an output after RESET, so the 
PAL or ASIC which creates FPFLDEN must float it 
as soon as RESET = O.The same multiplexing applies 
to Intel486 DX CPU mode, where the pin is named 
PLOCKEN[CPLOCK#j. 
Likewise, the multiplexed 
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input FLUSH # [NCPFLD # j should be driven high 
the same clock that RESET falls, to prevent an unnec- 
essary 82495 cache flush. In Intel486 DX CPU sys- 
tems, the 82495input CACHE# must be tied low and 
HITM#[CPUTypj 
must be tied LOW, as it signals 
CPUTYPE to 82495. 


82490: The 82490DX inputs HITM# 
and BOFF# 
must be tied high in an Intel486 DX CPU system, as 
they exist to support the i860 XP CPU writeback 
cache. With an i860 XP CPU, the 82490XP input 
BOFF# comes from 82495XP but HITM# from i860 
XP CPU feeds 82495XP and 82490XP. 


The 82490input MOCLK must also be tied low or to a 
delayed version of MCLK, if clocked-M-bus mode is 
used. This is because the 82490 senses the state of 
MOCLK after RESET ends-if 
MOCLK stays low, 
the 82490 uses MCLK to drive MDATA. If MOCLK 
toggles after RESET, the 82490 will use MOCLK to 
switch output data. Using a delay-line externally to the 
82490 to generate MOCLK from MCLK allows the 
design a longer hold-time at other-receivers of MDA- 
TA in the system. For a clocked-M-bus (non- synchro- 
nous to CLK), the undelayed MCLK should be con- 
nected to the 82495's SNPCLK input and should be 
toggling during RESET to tell the 82495 to snoop in 
clocked mode. 


During RESET, the 82495and 82490 will float the bi- 
directional lines they share with the CPU, such as 
CDATA and A31:A3. Thus driver contention is avoid- 
ed. The RESET input should be synchronous to CLK 
and deasserted to the 82495, 824905,and CPU at the 
same time, to assure that the configuration controls get 
properly passed between them. 


For Intel486 DX CPU resets, refer to [82495/490DSj 
for the sequencing of HOLD, HLDA, CAHOLD, and 
RESET required to reset only the processor without 
destroying 82495 cache contents. For that purpose, a 
separate RESET line is advised for the CPU and 
82495/82490. The CPU RESET line must be wired to 
the WRMRST input of 82495, to force 82495 to assert 
the BRDYI # input to the CPU during a reset of CPU- 
only (the CPU uses the BRDYI # input during RESET 
to know of the 82495's existance). The HOLD input of 
the Intel486 DX CPU and i860 XP CPU processors 
should be kept low during normal operation with the 
82495,because floating the processor outputs may yield 
undefined 82495 behavior. 


FLUSH# 
(and SYNC#) of caches requested by soft- 
ware must be decoded from the 82495 outputs CW 
10#, 
CD/C#, 
and CW/R# 
(=001) 
and latched 
BE3-0# 
from the CPU. BE3-0# 
values of 0111 or 
1101should activate the 82495 FLUSH # input, as the 
Intel486 DX CPU outputs them in response to the 
INVD and WBINVD instructions, respectively. Synch 
and flush commands may also come from the bus as a 
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message in a multiprocessor system. The 82495 is smart 
enough to allow assertion of FLUSH# 
or SYNC# 
at 
any time, and will delay the beginning of the flushing 
action until all current CPU and M-bus cycles have 
completed. The inputs are edge-sensitive. If the bus de- 
fines cache flush messages, the MBC may activate the 
Intel486 DX CPU FLUSH # 
input as well as the 
82495's in response to bus message decodes. 


Bus Error or Timeout Detection logic in the MBC can 
use the CPU's PCHK # output or other M-bus-specific 
signals to detect errors. Note that the assertion of 
PCHK # will occur near the time of the error on the 
M-bus ONLY for non-cacheable reads or 82495-cache- 
miss 
reads. 
For 
82495-hits 
and 
CPU-idle 
cycles, 
PCHK # may arise due to a floating or erroneous CPU 
data bus value transferred on the M-bus much earlier. 
PCHK # must be ignored by the MBC except during 
the CLK after data transfer to the CPU was signalled 
by the MBC's CPU BRDY #, because PCHK # indi- 
cates i860 XP CPU bus parity status at all times, not 
just during clocks of BRDY # activation. The proces- 
sor inputs INT, BERR, or NMI can be asserted by the 
MBC to signal errors. To detect errors originating in 
the CPU or 82490 upon a write(back), the MBC can 
check parity on the 82490 MDATA pins or on the M- 
bus. 


If the memory bus includes a retry protocol, the MBC 
bears the responsibility to implement it, because the 
82495 will not retry accesses. For a pipelined MBC in- 
terface when the retry occurs after CNA # 
to the 
82495, the MBC must latch the address and other con- 
trols (CW/R#, 
CMIIO#, 
etc...) from the 82495 to use 
in retries. Retry should be triggered by signals other 
than the CPU PCHK # output, because the CPU data 
transfer cannot be retried although the M-bus transfer 
can. 


The 82490 can restart a burst data transfer (for the case 
of an error detected after the first MBRDY # but be- 
fore MEOC# 
and before CRDY#). 
To restart the 
82490, the MBC must deassert MSEL# 
for at least 1 
MCLK. 


While parity is supported by the 82495 and 82490, 
ECC (Error Correcting Codes) cannot conveniently be 
used within the cache. ECC can be implemented on the 
memory system, but no loads are permitted on the 
CPU-to-82495/82490 interface wires for error checking 
logic. 


Scenarios requiring MBC action are 
1) CPU based requests ("Master" mode): 
• 82495 cache read miss (and line fill) 


• 82495 cache write miss 
• Non-cacheable CPU read (including i860 XP CPU 
pfld) 


• Writethrough 
(to S-state line) or Non-cacheable 
CPU write 
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• I/O reads and writes 
• LOCKed reads and writes (will be readthrough or 
writethrough) 
2) 82495 based requests ("Master" mode): 


• Allocation due to write-miss (line fill) 


• Replacement writebacks 
• SNPADS# 
writebacks 
3) Requests from other masters ("Slave" mode): 


• Snooping of DMA accesses 
• Snooping of accesses of other CPUs (in a multipro- 
cessor) 


• Bus-specific requests, like interrupt messages, reset 
requests, cache flushes, configuration registers, ID. 
registers, 
timeout 
detection, 
acknowledgements, 


TLB shootdown 


Transfer Tracking 


Tracking of transfers on the M-bus and CPUbus is re- 
quired of the MBC during all of the above scenarios. 
This tracking (counting) of transfers involves activating 
BRDY # the correct number of times for the CPU and 
MBRDY # (a possibly different number) for the 82495 
and 82490. Transactions on the CPUbus which must be 
MBC-controlled can be 1,2, or 4 data transfers, decod- 
ed from the BLE#-Iatched 
CPU pins: 


Intel486 DX CPU: BE3-0#, 
PWT, PCD 
i860 XP CPU: BE7-O#, PWT, PCD, LEN, 
CACHE # 


and 
from the 
82495 pins CW/R#, 
MCACHE#, 
RDYSRC (and CLEN1:CLENO for Intel486 DX CPU 
mode). 


See [82495/490DS) for a complete definition of the en- 
codings. The BRDY # activations must be done only if 
RDYSRC= 
I, and always correspond to the first 1, 2, 
or 4 MBRDY#s 
for the 82490-M-bus interface. The 
number of MBRDY#s 
always exceeds or is equal to 
the number of BRDY#s, 
even for a 128-bit M-bus. 


Bursts for line fills and writebacks on the CPUbus al- 
ways are 4 transfers, but with some 82495 configura- 
tions the M-bus isS transfers. The addresses are nonse- 
quential when the first access is not at the zeroth word 
of the line. The 
addresses 
corresponding 
to each 
BRDY # and MBRDY # follow these rules: 
1) CPU burst addresses wrap at CPU line length. 
2) When the line address is odd (A2 = 1 for 4-byte bus; 
A3= 1 for 8-byte bus; A4= 1for 16-byte M-bus), the 
next address transferred on CDATA and MDATA 
is the LOWER address (eg., 3 followed by 2). The 
odd-first-then-even pattern continues for all transfers 
of the 
burst. 
This 
order 
optimizes 
interleaved 
DRAM systems, and applies to both the M-bus and 
CPUbus. 


• 
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3) 82490 bursts on CDATA wrap at CPU line length. 
82490 MDA TA burst addresses wrap at 82490 line 
length. For example, a linefill with LR = 4 and a first 
InteI486 DX CPU address (A5:A2) = E, 


• 82490 CDAT A ordering is E F C D 
• 82490 MDATA ordering is CDEF 89AB 4567 0123 
(l28-bit M-bus) OR EF CD AB 89 67 45 23 01 (64- 
bit M-bus) 


For LR = 2 (Line Ratio of 82495 to CPU) and CPUbus width = M-bus, below are the burst orders. Each address 
corresponds to one 4-byte transfer (for InteI486 DX CPUs) or 8-bytes (for i860 XP CPU). Time is increasing left-to- 
right: 


First Address: 0 
CPUtransfers: 
0 1 2 3 
M-bus transfers: 0 1 2 3 4 5 6 7 


First Address: 2 
CPUtransfers: 
2 3 0 1 
M-bus transfers: 2 3 0 1 6 7 4 5 


First Address: 4 
CPUtransfers: 
4 5 6 7 
M-bus transfers: 4 5 6 7 0 1 2 3 


First Address: 6 
CPUtransfers: 
6 7 4 5 
M-bus transfers: 6 7 4 5 2 3 0 1 


First Address: 1 
1 032 
1 0 3 2 5 4 7 6 


First Address: 3 
321 
0 
321 
0 7 6 5 4 


First Address: 5 
5476 
5 4 7 6 1 032 


First Address: 7 
7654 
7 6 543 
2 1 0 


For LR = 2 and M-bus = 2*CPUbus width (both buses using 4 transfers), 
First Address: 0 
CPUtransfers: 
0 1 2 3 
M-bus transfers: 01 23 45 67 


First Address: 2 
CPUtransfers: 
2 3 0 1 
M-bus transfers: 23 01 67 45 


First Address: 4 
CPUtransfers: 
4 5 6 7 
M-bus transfers: 45 67 01 23 


First Address: 6 
CPU transfers: 6 7 4 5 
M-bus Transfers: 67 45 23 01 


The remaining transfer orderings for other LR values 
can be generated similarly, as an exercise for the reader. 


For requests originated by the 82495, the MBC must 
ignore the CPU pins (CACHE#, 
LEN, PWT, PCD, 
PCYC, CTYP, and BE7#-BEO#). 
These requests are 
writebacks, allocations, or linefills. Also the MBC must 
prevent the transfer of those signals to the M-bus for 
82495 requests-for 
example, it must force all BE7# - 
BEO# active during writebacks. The 82495 based re- 
quests can be recognized by: 


RDYSRC=O 
.AND. 
MCACHE# 
=0 
(for write- 
backs, linefills, allocations) 
RDYSRC=O 
.AND. 
MCACHE# 
=0 
.AND. 


MKEN# 
=0 (for linefills, allocations) 
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First Address: 
1 032 
01 23 45 67 


First Address: 3 
321 
0 
23 01 67 45 


First Address: 5 
5476 
45 67 01 23 


First Address: 7 
7 654 
67 45 23 01 


For posted write requests (RDYSRC=O 
and MCA- 
CHE # = I), the length is 1, 2, or 4 transfers and the 
MBC must heed the BLE#-Iatched 
BE7-0#, 
LEN, 
and CACHE#. 


Clock Boundaries and Synchronization 


To optimize performance, the 82495/82490 allow to- 
tal/decoupling of the CPU clock at 50 MHz from the 
M-bus clock. While both the CPU and M-bus could 
run at 50 MHz, the physical size of the M-bus would be 
severely constrained. Future faster versions of CPU and 
82495/82490 would make a synchronous M-bus even 
less feasible. However, with a 100% synchronous inter- 


face, little time is lost in relaying requests from the 
82495 CADS# to the M-bus, and in transfering data 
from the M-bus to the CPUbus. 


Yet with careful design, a slower M-bus such as 
33 MHz can handshake with a 50 MHz 82495 with 
only a couple of clocks spent on synchronizing. Fur- 
thermore, the transfers requiring synchronizing are 
fairly rare uncached cycles,cache misses,and snooping. 
CPU 
performance 
is 
improved 
further 
because 
82495/82490 always post writes destined for the 
M-bus, allowing the CPU to continue processing upon 
write cache-misses and non-cacheable writes. 


Most of the 82495operates on the CPU CLK. Only the 
snooping control inputs operate on another clock, 
called SNPCLK (SNPSTB#, 
SNPINV, SNPNCA). 


SNPCLK can be the same as the MCLK controlling 
82490 MDATA. A SNPCLK can be used with 82495, 
even if the 82490 is strobed without an MCLK. All 
82495 outputs, including snooping results (MHITM #, 
MTHIT#, 
SNPCYC#, 
and SNPBSY#) remain on 
the CPU CLK. 


The 82490 operates half in the CPU CLK domain and 
half in the M-bus domain. While no control signalsflow 
through 82490 between memory and the CPU, 82490 
implements a flow-through data connection of CDA- 
TA to MDATA. Synchronization of the 2 DATA 
paths is unneeded, as the control signal MBRDY # gets 
synched by the MBC to the CPU clocked BRDY #. 
The MBRDY # and BRDY # inputs control multiplex- 
ers inside 82490 to choose which part of a line-fill or 
write is transferred to/from the bus. The MDATA in- 
put latches are closed on MCLK (or MISTB for non- 
clocked operation), and CDATA input latches are 
closed with CLK. 
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If MCLK =CLK at 50 MHz, approximately 1.5 CLK 
periods are required to transfer data through the 82490, 
including 82490 propogation delay (15 ns) and setup 
time to both the 82490 (5 ns) and CPU (7 ns for i860 
XP CPU "CMOS" levels).The MBC must assure data 
setup time at the CPU DO-D31 (D63) pins to the ris- 
ing edge of CLK for the cycle of BRDY # assertion 
during reads, based on the propogation delay from 
MDATA to CDATA listed in the 82490 AC timing 
specs. Writes are not flow-through, as 82490 always 
.butTersthe write-data and later 82495givesCDTS# for 
the write. 


Most of the MBC-to-82490 signals are sampled by 
82490with MCLK, except for BRDY # and CRDY #: 


MBC •.......• 


MClK 
MBRDY# 
MFRZ# 
MZBT# 
M DATA 
MSEL# 
MEOC# 
MDOE # (asynchronous 
to both clocks) 


82490 Signals 
ClK 
BRDY# 
CRDY# 


CDATA 


The MBC must be partitioned into an MCLK side and 
a CLK side. Fortunately, the CPU-side of MBC passes 
only a few signals to the MCLK side, and visa versa. 
The signals listed below from the dual-i860 XP CPU 
MBC design in Appendix C must go through a syn- 
chronizer. Refer to the Appendix for signal definitions. 
In the following diagram, a right-arrow (- 
) identi- 
fies synchronizing to CLK, while a left-arrow ( +- ) 
means synchronizers on MCLK: 
, 


MClK or SNPClK 
MRESET 
YBGT# 
YMEOC# 
YCEOC# 
MBRDY# 
MSWEND# 
MADS# 


Clock Domain of "theSignal: 
Neither 
ClK 


- 
RESET 
- 
BGT# 
- 
CRDY# 
- 
BRDY # _maybe 
- 
BRDY # _maybe 
+- MSWENDA 
- 
SWEND# 
+- 
CADS# 
.or. SNPADS# 
.or. CDTS# 


The signals MKWEND# 
and MNA# 
might also need synchronizing to CLK, if they are derived from M-bus 
responses. 


2-509 


•• 


AP-452 


Two 
TI 
74AS4374 
"Dual-Rank 
Synchronizer" 
chips 
(Figure 
7) are used to transfer 
critical 
signals 
between 
clock 
domains, 
while 
avoiding 
metastability. 
This 
20- 


pin DIP 
has one clock 
input 
and 
8 pairs 
of flip-flops. 


Thus 
each of the 8 "Q" outputs 
reflects the value of its 


"0" input after 2 clock periods. 
One chip is clocked 
by 
CLK 
and the other 
by MCLK. 
If fewer than 
8 signals 
need 
synchronizing, 
chips 
such 
as 
the 
Signetics 
74F50728 
or Intel's 
85C220 
EPLD 
can combine 
syn- 
chronization 
with other 
functions 
[Ham90]. 


For 
an asynchronous 
or strobed 
memory 
bus, M-bus 
signals 
(such 
as MBRDY*,,) 
get delayed 
by the syn- 
chronizer 
for 2 CLK periods 
before 
the 82495 can see 


them. 
For 
a clocked 
(but not by CLK) 
M-bus, 
82495 
outputs 
(such as CADS*") 
get delayed 
by 2 MCLKs 
by 
the other 
synchronizer 
before the M-bus 
sees them. 


The following 
82495 signals 
are defined 
as "asynchro- 
nous", 
meaning 
that 
no external 
synchronizer 
is re- 
quired: 


• 
FLUSH*", 
SYNC*" 


• 
MALE, 
MBALE 


• 
MAOE*", 
MBAOE*" 


Many 
signals 
can cross clock boundaries 
without 
syn- 
chronizing, 
because 
they 
will be ignored 
until 
corre- 


intel~ 


sponding 
status 
signals 
such 
as 
SWEND 
*" 
and 
CADS *" have been 
synchronized 
by the 
MBC. 
Thus 
they will be stable when sampled: 


• 
MWB/WT*" 
, DRCTM*", 
MTHIT*" 
, MHITM*" 
(sampled 
when SWEND*") 


• 
RDYSRC, 
KLOCK 
*", 
CPLOCK 
*", 
CW /R *", 
CD/C*", 
CM/IO*", 
MCACHE*", 
BE7:*" (sampled 
when CADS*") 


Other 
signals do not cross clock boundaries, 
but remain 
within 
the MBC CLK 
logic: 


• 
CNA*", 
PALLC*", 
CACHE*", 
LEN, 
PCD, 
PWT, 


CTYP, 
PCYC, 
MFRZ*" 
... 


Synchronizer Delays 


To avoid lost time due to synchronizer 
delays, 
the fol- 
lowing options 
exist: 
. 


1. Pipeline 
the 
82495/MBC 
interface. 
This 
hides 
the 
delay in synchronizing 
CADS*" 
to its MCLK 
coun- 
terpart 
MADS *". 


2. Define 
the M-bus 
protocol 
so that 
MBRDY 
*" pre- 
cedes 
MDATA 
by 1 MCLK 
for reads. 
Thus 
the 2 
CLK 
delay in creating 
BRDY *" from MBRDY 
*" is 
hidden. 
Likewise 
defme 
MSWEND 
*" to 
precede 


x 


ID 
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Figure 7. Synchronlzer Hardware and Waveforms 
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MHITM # and MTHIT # by a CLK, by generating 
MSWEND# 
from SNPCYC#. 


3. Keep the snooping signals (SWEND#, 
MHITM#, 
MTHIT#, 
SNPINV, SNPCYC#) 
which flow be- 
tween 82495s on the same CLK, so that no synchro- 
nizers enter the snoop path. This is feasible only for a 
small number of physically proximate CPUs. 


4. Synchronize the snooping feedback signals from the 
M-bus (MSWEND#, 
etc...) only at the destination. 


They will be asynchronous to MCLK, transitioning 
with the individual CLK of their source. 


5. Avoid MCLK, using a strobed-only M·bus. Strobed 
buses appear in single-CPU systems with an un- 
clocked DRAM interface. 


6. Activate MEOC# 
to 82490 as soon as possible after 
the last MBRDY#. 
MEOC# 
allows 82490 to begin 


the next data transfer without waiting for CRDY # 
synchronization. 


BRDY # Generation 


Below are recommended sequences of the 82490 and 
CPU burst-transfer "Readys" for CPU reads, assuming 
the bus widths are equal. Sequences with more clocks of 
delay are acceptable but suboptimal. 


I) Synchronous M·bus (MCLK = CLK): MBRDY# 
precedes BRDY # by I or 2 CLKs, to allow propo- 
gation time for data through the 82490 and setup 
time at the CPU pins. 
2) "Divided 
Synchronous" 
M-bus 
(e.g., 
CLK= 50 
MHz, 
MCLK = 25 
MHz, 
skew 
controlled): 


MBRDY# 
precedes BRDY# 
by I or 2 CLKs. The 
BRDY # state machine must ignore MBRDY # in 
the CLK period after it was sampled active. 


3) Other 
Clocked 
M·hus 
(MCLK 
< 
CLK): 


MBRDY # must go through a dual-rank synchroni- 
zer latch (such as the TI 74AS4374) clocked by 
CLK to produce BRDY #. That means 2 CLK de- 
lays between MBRDY # and BRDY #. MBRDY # 
MUST remain active for at least I CLK period to 
assure that the synchronizer latched it active. To 
avoid one MBRDY # getting wrongly sampled ac- 
tive twice, the BRDY # state machine should ignore 
any second MBRDY # in the CLK period after it 
was sampled active. 


4) Strohed M·bus: here MISTB# 
must go through the 
synchronizer with 2 CLK delays to create BRDY #. 
An edge-sensitive strobed M-bus.avoids the problem 
of wrongly converting 
one M-bus transfer 
to 2 
BRDY#s, 
as a level-change marks each M-bus 
transfer: 
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When M-bus width is greater than CPUbus width, the 
above rule holds only for the first BRDY #. Successive 
BRDY # activations follow the rules below: 
• Msbus = 2'CPUbus: 
2 BRDY#s 
occur for each of 


the first 2 MBRDY#s. 
The second BRDY# 
should 


occur I CLK after the first. The third BRDY # can- 
not begin until after the second MBRDY #. 


• M·bus 
= 
4"CPUbus: 
4 BRDY#s 
occur for the 


MBRDY#. 
The last 3 BRDY#s 
can occur immedi- 


ately in the 3 CLKs after the first BRDY # . 


For asynchronous systems (MCLK < CLK), high per- 
formance design choices are: 


M-bus width = 2 • CPUbus width OR 
M-bus width = 4 • CPUbus width 


The wider M-bus allows each M-bus transfer to satisfy 
2 or 4 CPU transfers, so that the CPU is not starved for 
data during a line fill. The 82490 switches its CDATA 
outputs to the next value the CLK after BRDY # asser- 
tion by the MBC for the current value, so the MBC 
controls the provision of data to the CPU on linefills. 


A low-cost MBC can use M-bus width = CPUbus with 
a slower MCLK, by converting the first MBRDY # to 
BRDY# 
through a synchronizer. The last 3 BRDY#s 


can be asserted by MBC after completion of all the M- 
bus transfers. That will allow the CPU to proceed exe- 
cuting after receiving the first datum, which is the one 
it was waiting for in most cases. Alternatively, the M- 
bus protocol can be defined so that no idle clocks occur 
on M-bus after the first MBRDY # 
and the MBC 


knows by counting CLKs when to assert successive 
BRDY#s. 


Shown in the following timing diagrams are data trans- 
fers on both buses for CPU reads. Although they as- 
sume no dead clocks (wait states) during the M-bus 
burst, dead clocks are allowable. 


Writes are not shown in the diagrams because the MBC 
never supplies the CPU BRDY#s 
for burst writes. 


RDYSRC = 0 for most writes, and the 82495 controls 
the CPUbus transfers. The exception to this rule is I/O 
writes, which 82495 does not post; for I/O writes, the 
MBC supplies BRDY # to the CPU, but I/O accesses 
are always 1 non-bursting transfer. 
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Figure 8. Data Transfers, M-bus Width = CPUbus Width. MClK = ClK 
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Figure 9. Data Transfers, M-bus Width = CPUbus Width. ClK/2 
< MClK < ClK. 
Note the starvation on the CPUbus (extra wait state) 
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Figure 10. Data Transfers, M-bus Width = 2·CPUbus. ClK/2 
< MClK < ClK 
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Figure 11. Data Transfers, M-bus Width = 4'CPUbus 
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Plpellnlng 


Pipelining the MBC-to-82495 interface reduces latency 
by allowing the MBC to arbitrate for the next M-bus 
transaction while the f1I'Stis proceeding. If the M-bus is 
also pipelined, it allows the snoop for the next to begin 
during the data transfer for the first. 


Signals used in pipelining 
the 
82495 are CNA 11, 


BGTII, 
MALE, 
KWENDII, 
SWEND 11, 
and 
CDTSII. The 82495 will not listen to CNAII until the 
clock ofBGT# 
activation. Also, KWENDII 
activation 
sometimes allows the 82495 to create a next cycle, such 
as an allocation after a write miss. MALE deassertion 
allows the memory address to remain at the value for a 
previous request, even though the next request CADSII 
and other control signals have already occurred in re- 
sponse to CNA 11. The MBC must latch the 82495 out- 
put signals which change in response to CNA 11, until 
their status no longer matters to ongoing cycles. 


Note that 82495 and 82490 automatically pipeline the 
CPUbus interface to i860 XP CPU by activating NA 11 
and latching address and data. 


Pipelining the M-bus itself involves sending a next ad- 
dress for snooping and DRAM access while data trans- 
fer from the current address still remains incomplete. 
This increases bandwidth by overlapping slow DRAM 
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access with bus data and address transfers, as in the 
i860 XP CPU pipelined bus. 


While each 82495 allows only a one-stage deep pipeline, 
the M-bus can have a deeper pipe as requests from sev- 
eral different 82495s can be in progress. The number of 
stages in the M-bus pipe should match memory access 
latency. For example, use 3 stages for a 240 ns mem- 
ory with a 
120 ns bus MADSII-to-MNAII 
(and 


SWENDII) time, so that a second and third request get 
issued during the memory latency of the first. Pipelin- 
ing does not imply that multiple snoops are ongoing 
waiting for SWENDII; 
that is a split-transaction 
bus, 


defined in a later section. Thus a quick SWEND 11 
turnaround time speeds a new request onto the M-bus. 


The advantage of a pipelined bus using a 4-transfer 
burst is illustrated in Figures 12 and 13. Assumed is a 
fast memory access time of 4 MCLKs. With a slower 
access time, pipelining becomes more important 
for 
maintaining 
data 
bus 
bandwidth; 
even 
with 
the 


4-MCLK access, the unpipelined data bus is idle 50% 
of the time. 


M-bus Arbitration 


If the M-bus possesses more than one master, each 
MBC must arbitrate to gain control of the M-bus when- 
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Figure 12. Data Transfers for Non-Plpellned M-bus. Note low MDATA Bandwidth. 
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Figure 13. Data Transfers for Plpellned M-bus 
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ever its 82495 activates CADS #. No arbitration logic is 
included in 82495 nor 82490, except for the ability to 
float (Hi-Impedance) the 82495 and 82490 M-bus out- 
puts via the MAOE# 
and MDOE# 
signals. The 
BOT# 
and MAOE# 
inputs to 82495 are from MBC 
arbitration 
logic. The simplest 
systems can use a 
HOLD/HLDA/BREQ 
protocol like the i860 XP CPU 
and Intel486 DX CPUs themselves, which is central- 
ized arbitration. 


Expandible buses like Futurebus + and Multibus-II use 
distributed arbitration to allow a variable number of 
masters. Bus parking (retaining ownership of the M-bus 
until another master requests it) is advised to avoid un- 
necessary delay. 


The "restricted 
backoff protocol" 
of 82495 requires 
that it be granted the bus for a modified-line writeback 
after it activates MHITM #, before it will snoop or ini- 
tiate any other transactions. The snooping MBC must 
relinquish the M-bus immediately after the CRDY # of 
the M-line writeback so that the original owner can 
complete its work. 
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Sequencing 


A typical sequence of request and response signals be- 
tween the 82495 and MBC is shown in Figure 14. The 
"SL" entities (CPUSL, 82495SL, 82490sL, MBCsL> are 
for another CPU/Cache core, the SLave(s) who snoop 
when the master CPU owns the bus. No DMA (such as 
EISA or MCA) interaction is shown, but it will be simi- 
lar to the CPU responses, except that no writeback will 
be done by DMA. Time increases downward. A minus- 
sign prefix means deassertion. 


The arbitration for the M-bus shown in the diagram 
assumes a HOLD/HLDA 
protocol like the CPUs use. 


That is a primitive centralized scheme, suitable only for 
a small number of processors. 


The sequencing may vary from that shown; for exam- 
ple, 
MSEL# 
may 
precede 
CDTS#. 
MADS#, 


MW/R#, 
MA31:3, 
MM/IO#, 
MD/C#, 
and 
MBE7-0# would all be valid simultaneously. The sig- 
nals in parentheses would be asserted only in the case of 
a M-line hit in the snooper, and some signals for that 
writeback and possible cache-to-cache transfer are not 
shown. 


/ 
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CPU 
112<490112<495 
"-BUS 
- - 
-CLK- 
-CLKsL- - - 
- 
• 
• 
IICLK 
• 
• - 
ADS. 
> 
CADS. 
> 
..BREQ 
> 
••HOLD 
> 
-WADE. 
> 
< 
••HLDA 
< 
-"HOLD 
, 


< 
••AOE· 


< 
••ALE 
••ADS• 
> 
> 


"A31:3 
> 
> 
> 


IIW/R.,.tc 
> 
> 
SNPSTBIf > 


SNPINW' > 
<••KEN••• 
RO. 


< KWEND· 


< BGT. CNA.· 


<SNPBSY.· 
COTS.· 
> 


< 
••SEL.·, 
••OOE. 
<SNPCYC• 


< 
< 
••HITW. 


< 
••WB/WT. 
< 
< 
11THIT. 


DRCT••• • 


.. 
< 
(-"ADE.) 
< 
< 
(••ADS.) 
< 
(SNPADS.) 
•• 
•• 
< 
(-"OOE.) 
(••SEL.) 
•• 
•• 
(MBRDY.) > 
> 
•• 


< 
< 
••BRDY. 


< 
SWEND" 
< 
••SWEND.· 


< 
< 
< 
BRDY. 


< 
••EOC. 


< 
< 
CRDY. 


< 
- ••ADE. 


••BC 


CPU 
82<490 112<495 
••BC 
"-BUS 


82<4"5st. 112•• 0SL CPUSL 


112<4"5st.112<4110SL 
CPUSL 


82"SsL 
8Z•• 0SL CPUSL 


"-BUS 
82<485SL 8Z•• 0SL CPUSL 
••BC 


Figure 14. MBC Signals and Protocol Layers 


• = Signal 
might occur 
sooner 
or not at all. depending 
on the type of request 
and bus protocol. 


•• = These lines of the sequence occur only on a Hit·to·Modified (MHITM#) 


CPU 
82<490 82<495 
••BC 
"-BUS 


CPU 
82<490 82<495 
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Flowchart 
of MBC Algorithm 
(not applicable 
to all cases) 


Forward snoop rupon ••• 
to mast.r 
82.495 
!using SWENO., WWBjWY.,ORCT•••. 


Signal burst 
transfers 
of "-bus 
via ••BROY•. 


If ROYSRC=I,.cho 
burst 
transfer 
acknowledgments 
on BROY•. 


Compensate 
for LR<> 1 by stopping 
BROY. a•• ertion 
when CPU line filled. 
! 
Notify 82495 
and 82490 
of completion of transfer 
via ••EOC. and CROY•. 
! 


CAOS.! 
"-bus 
already 
owned by this ••BC ? 


YDN 
Arbitrate 
for bus. 


lable 
82495 
to drive address 
to bus (••AOE., ••ALE). 


Echo other 
r.cjuest 
parameters 
(WWjR., 
"CACHE., etc •.•) to the bus. 
! 
~A 
•• ert SGT•• 
! 
Determine cacheabllity, 
auert 
pins KWENO.,"KEN., ••RO•. 


Latch control signals 
(••W/R., 
etc ..•). 
i.. 
ert CNA. to invoke next 82495 
request. 


••HIT••• from other masters 
? 
NOY 
Abort "emory 
cycle. Do Cache-to-cache 
transfer. 


!ait 
for COTS. (befor. 
beginning 
data 
transfer). 
! 


New CAOS. ? 
Y 
N 


Relinquish 
bus ownership. 


Oea••••rt ••AOE. to re-.nable 
snooping 
by this 82495. 
240957-25 
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"" 
lL auowe 
0.1. •• ,,:> to cecioe wnat tne next CADS# 
should be-for 
example, to begin an allocation for a 
write miss, or to start another writethrough. 


A typical implementation would activate KWEND# 
2 
clocks after CADS#, 
using a PLD or fast SRAM to 
decode the upper 
bits of the address 
to generate 
MKEN# 
and MRO#. 


Note that KWEND#, 
SWEND#, 
and BGT# 
need 
not be asserted by the MBC for SNPADS # cycles 
(snoop writebacks), but it may be simpler to assert 
them always. 


Snooping 


Snoop handshaking (bus watching) is useful in a multi- 
processor system, and may be needed in a uniprocessor 
system where the 82495 and CPU caches must be kept 
consistent with DMA accesses. The 82495 must snoop 
all DMA accesses to memory. The MBC sees requests 
from DMA (or other processors) on M-bus and con- 
verts them to SNPSTB# 
activations to the 82495. The 
following scenarios are possible: 
• DMA 
(or 
other 
processor) 
read 
causes 
82495 
MHITM #: 82495/82490 must writeback the modi- 
fied line to memory before the first DMA data 
transfer occurs (unless the DMA controller is capa- 
ble of re-trying the read. If the DMA can retry, then 
the 82495 writeback must cause the initial DMA 
access 
to 
be 
aborted.) 
The 
MBC 
can 
assert 
SNPNCA (SNooP Non-CAcheable Access) to the 
82495 for a DMA read, so that the 82495 knows it 
can keep the block Exclusive upon a hit. 


• DMA (or other) read causes 82495 MTHIT# 
but 
not MHITM #: 
MBC must assert the "shared" 
status line of the M-bus, if the bus includes such a 
line. 


• DMA (or other) write causes 82495 MHITM#: 
82495/82490 must writeback the modified line to 
memory before the first DMA data transfer occurs. 
SNPINV should be activated to 82495 to invalidate 
the line. 


• DMA (or other) write causes 82495 MTHIT# 
but 
not MHITM #: SNPINV 
should be activated to 


when the BS8# (Inte1486 DX CPU) or CS8 (i860 XP 
CPU) pin is asserted. However, if a byte-wide ROM is 
used with an 82495/82490, adaptation of this byte in- 
terface is required from the MBC. 


If the ROM code is to be cacheable, the MBC must 
convert the 82495 line fetches at the ROM location to 
the appropriate 
number 
of byte-wide ROM 
reads. 
Latching transceivers must be employed at the 82490 
MDATA inputs or at the ROM output, to assemble the 
single-byte ROM reads into 4 (or 8) bus-width-wide 
transfers to the 82490s. 


If the 
particular 
M-bus protocol 
requires 
transfer 
widths shorter than the 82490 data width used, the ad- 
dress range requiring such transfers can be made non- 
cacheable to force 82495 and 82490 to use the width 
given in the request from the CPU. 


Bus size adaptation would also be needed to support a 
512kB cache on a 32-bit memory bus. In that case, the 
MBC must control transceivers and MBRDY#s 
to in- 
terface between the 64-bit 82490 MDAT A path and the 
32-bit M-bus. 


Bus Signal Levels 


Redriving 82495/82490 signals to the M-bus (such as 
MDA TA, addresses, and 82495 control outputs) can 
optionally be done by the MBC. If the M-bus signal 
levels are not TTL, like ECL or Futurebus + BTL 
(Backplane Transceiver Level), then appropriate trans- 
ceivers must lie between the M-bus and 82495/82490. 
Also M-buses with heavy capacitive loads should be 
redriven by transceivers, although 82495 and 82490 can 
tolerate loads of up to lOOpF. 


An additional advantage of butTering the 82495/82490 
signals with transceivers in a multiprocessor is that a 
"local M-bus" will exist between the chips and the 
main system M-bus. That allows some local traffic from 
the CPU module to attached peripherals to avoid tra- 
versing the M-bus. Such peripherals might include an 
MPIC/CCU 
(MultiProcessor 
Interrupt 
Controller/ 
Concurrency 
Control Unit), a JTAG boundary-scan 
controller, or a time-of-day clock, as in the Sequent 
Symmetry multiprocessor. 


• 


2-517 


infel~ 
AP-452 


through 
requires 
no 
feedback). 
These 
results 
(DRCfM#, 
MWB/WT#) 
are translations 
of the 
slaves' MHITM # and MTHIT # outputs. As shown in 
Figure 15, typically all MHITM# 
outputs would be 
wired-or via open-collector transceivers. Because slaves 
on the bus may be busy with CPU operations and back- 
invalidations, the snoop delay can vary. Thus a latched 
derivative of the SNPCYC# 
output 
of all 82495s 
would be wired-or to derive SWEND#. 
Alternatively, 
the MBC can count CLKs to generate SWEND#, 
us- 
ing the worst-case upper-bound of CLKs required for 
all 82495s to snoop, but that makes all snoop windows 
long. 


8.0 
MBC FUNCTIONS 
FOR 
MULTIPROCESSORS 


Multiprocessor cache designs have additional motiva- 
tions beyond the uniprocessor goal of reducing memory 
access latency. Reducing memory bus usage is especial- 
ly important because the sharing of the bus creates a 
bottleneck. Thus multi-82495 systems need to minimize 
the number of transactions and make each one as short 
as possible. Large caches (256k or 5I2k) are recom- 
mended for multis, to keep the miss rate as low as pos- 
sible. 


In addition to the uniprocessor functions, an MBC in a 
multiprocessor must handle consistency with caching 
agents other than its own 82495. The multiprocessor 
MBC may also for performance reasons implement 
snoop filtering, cache-to-cache transfers, read-for-own- 
ership, and split transactions. 


Because 82495 will tolerate SWEND # arrival up until 
CRDY #, the M-bus data transfer for reads can overlap 
the snooping delay. The transfers (MBRDY #s) can oc- 
cur during snoop latency, and an MHITM # activation 
would cause the MBC to restart the transfer using 
82490's MSEL# pin. 


Snooping results from listeners (slaves) on the bus must 
be fed back to the master 82495 by the time SWEND# 
is activated, if the system uses writeback policy (write- 
If a 82495 linefill or writethrough hits a dirty line in 
another cache, the MBC cannot BACKOFF the 82495. 
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Figure 15. Creating 
Snoop Results from MHITM#, 
MTHIT#, 
and SNPCYC# 
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Labeling that other cache "the dirty 82495," and the 
initiating 82495 "the master 82495". The master MBC 
must force a retry of the access after the dirty 82495 
dumps the line, but the master 82495 has no "Backoff 
and Retry" input pin. Rather, on a linefill the master 
82495must see the data transfer as if it had come from 
memory. On a write, the master 82495/82490 data 
write must wait until the modified line from the dirty 
82495has been dumped to memory. To do so, the mas- 
ter MBC can either: 
I) Delay the corresponding MBRDY#s to the master 
82490 until the modified line is completely written 
into memory and read out of memory. That implies 
the master MBC will remake the initial request to 
the memory controller after the writeback. 
OR 
2) Create a cache-to-cache transfer, so that the write- 
back data movements go directly into the master 
82490 over the M-bus. A later section describes 
cache-to-cache transfers. Such transfers are quicker 
than waiting for the entire modified line to be writ- 
ten back to memory. 


Note that the 82490 can restart the data transfer for 
reads or writes, in the case of MHITM # activation 
after the first MBRDY# but before MEOC# and be- 
fore CRDY #. To restart the 82490, the MBC must 
deassert MSEL# for at least I MCLK. 


Snoop Window Time (the delay from MADS# 
to 
SWEND#) limits address-bus bandwidth. In the inter- 
val from the address on M-bus until the acknowledge- 
ment (SNPCYC#) by all listeners, no more requests 
(addresses) can be on the bus. This restriction is im- 
plied by: 
I) A typical M-bus has only one MSWEND# 
wire, 
which cannot be identified with the proper request if 
several requsts are outstanding. 


2) 82495 
does 
not 
snoop 
between 
BGT# 
and 
SWEND#. 
3) 82495's "restricted backoff protocol". That protocol 
requires the M-line writeback to be the first transac- 
tion by any 82495 which generates MHITM #, and 
82495 cannot snoop anymore until it finishes the 
MHITM # writeback. 


Data for read-misses cannot be transferred on the 
CPUbus until SWEND#, 
because the MBC cannot 
abort a CPU transfer after giving the first BRDY#. 
Thus the snoop window length influences CPU per- 
formance. Depending on the number of processors, bus 
speed, and memory speed, two scenarios arise from 
snoop window length versus memory access latency: 
I) SWindow < Memory Latency: SWEND# precedes 
the MBRDY#s. If MHITM# 
occurs, the original 
memory access can be aborted and its MBRDY#s 
must be ignored. 
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2) SWindow > Memory Latency: data transfer on 
M-bus can proceed, with MBRDY#s causing 82490 
linefill buffers to advance. After SWEND #, the 
MBC can begin BRDY#s to the CPU and 82490 if 
MHITM # is inactive. If MHITM # is active, the 
MBC must restart the M-bus data transfers after (or 
during) the writeback from the modified snooper, 
and can begin BRDY#s immediately after the first 
MBRDY#. 


The typical snoop window in a multiprocessor using 
the hardware of Figure 15is about 7 CLKs total snoop 
turnaround delay, shown in Figure 16: 
CLK for propagation delay of master's 
MADS# (to slave 82495s' SNPSTB# in- 
puts) 
+ 0.5 to I CLK 
for 
82495 
to 
internally 
latch. 
SNPSTB# and synchronize it to CLK. 


+ I 
CLK for 82495tag lookup and SNPCYC# 
(or more, if 82495is busy with SNPBSY#) 


+ I 
CLK to latch SNPCYC# 
into the MBC 


Set/Reset 
flip-flop generating MSWEN- 


DA. 


+ I 
CLK for MSWENDA open-collector buff- 
er and settling time from all slaves. 


+ 2 
CLKs for MSWENDA to get through syn- 
chronizer (on the master MBC's CLK) and 
inverter to generate SWEND# to the mas- 
ter 82495. 


The window total assumes that the slave 82495s' one 
CLK delay from SNPCYC# until MHITM# 
is con- 
current 
with the synchronizer delay for creating 


SWEND# from MSWENDA at the master. Those 2 
CLKs can overlap with the next MADS# if it is asyn- 
chronously 
generated 
from 
MSWENDA. 
Shorter 


snoop window times can be obtained using duplicate 
external tags as explained later, but this is not trivial. 


Read for Ownership (RFO) protocols decrease bus traf- 
fic by avoiding the M-bus write which would occur 
upon a write-miss.That is, a write-misswould go to the 
bus, followedby a 82495 line allocation request for the 
missed area. With RFO, the MBC does not echo the 
82495 write request to the M-bus. Instead, it asserts 
MFRZ# to freeze the written data in the 82490memo- 
ry buffer, and allows the subsequent 82495allocate line 
request to go to the bus. When the line data returns on 
the M-bus, MBC asserts DRCTM # to cause the 82495 
to mark the line as Modified (the memory system and 
other caching agents do not know of the original write 
miss, so they have invalid copies of the line). 


Signals which the MBC must use to do RFO are: 
1) PALLC# 
(potential ALLoCate): from the 82495 


must be active on the write miss.lf not, RFO cannot 
be performed. 


2) MKEN # and CRDY #: must be activated by the 


MBC for the write, to trigger the 82495'ssubsequent 
allocation request 
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Figure 16. Snoop Waveforms 


3) MFRZ,*: 
must be activated by the MBC to the 
82490 at the time of the MEOC'* and CRDY,* for 
the write. 


4) INVAL (memory bus' Invalidate indication): must 
be asserted by the MBC during the allocate-read to 
force all other 82495s to invalidate their now-obso- 
lete copies of the line. Slave MBCs will assert 
SNPINV to 82495s. 


5) DRCTM'* 
(DiReCt To Modified): must be asserted 
by the MBC during the SWEND'* of the allocate, to 
make the 82495 put the line in M-state. 


6) MWB/WT'*: 
must 
be 
asserted 
during 
the 
SWEND'* of the allocate. 
7) CPLOCK'* 
(82495 Psuedo Lock in Intel486 DX 
CPU systems): if active, the MBC must NOT do 
RFO, because 82495 will activate PALLC'* only on 
the second of the 2 writes. If the MBC tried to RFO, 
it would merge only half of the data into the modi- 
fied line. 


See [82495/490DSl for RFO information. 


Cache-to-cache transfers (CTCT) optimize the speed of 
consistency actions in a multiprocessor. For a read line- 
fill by a master causing an MHITM'* from a slave, the 
writeback data movements go directly into the master 
82490 over the M-bus from the dirty 82490. For a 
write, Read-for-Ownership (RFO) is required for the 
CTCT. If RFO is not implemented, then the cache-to- 
cache option can be used only on linefill (read) misses. 
In fact, RFO makes every write-miss into a linefill. The 
82495/82490 do CTCT only on entire lines, not bytes 
or words. 
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For CTCT on a linefill causing MHITM '*, the MBC 
doing the writeback must initiate the writeback at the 
subline address of the initial read. Starting the write- 
back from the first word of the line is NOT acceptable. 


While CTCT is faster than re-reading the line after 
waiting for the dirty writeback, the latency will be long- 
er in most systems than for fetching lines from main 
memory. CTCT would actually waste time for such 
items as shared instruction pages. For non-written data, 
transferring from memory to a CPU is probably faster 
than tranferring from another cache. So 82495 supports 
only 
M-line 
CTCT 
(no 
writeback 
occurs 
unless 
MHITM'*). 


Signals involved in CTCT are DRCTM,*, 
MZBT'*, 


MHITM'*, 
MBAOE,*, 
and 
MSEL'*. 
See 
[82495/490DSl for CTCT information. 


Snoop filtering can be implemented by the MBC using 
the 82495 SMLN'* (SaMe LiNe) output to reduce the 
latency for snooping. That is, SWEND'* can be assert- 
ed immediately to the requesting 82495, if the 82495 
asserts SMLN'* to indicate the current request is to the 
same line as the previous request. In that case, other 
caches already have checked this line. SMLN'* must 
be ignored if the M-bus has been used by other agents 
between the 2 82495 requests. The M-bus protocol need 
not include a "non-snooped transfer type" for the use 
of this feature, as the MBC can simply ignore the snoop 
responses from other MBC/82495 modules. 


Split transaction (ST) memory-buses such as Future- 
bus + prove valuable in high performance systems. An 
ST 
(also 
called 
"connect/disconnect" 
or 
"packet 
switching") bus divides a single read request into a sep- 
arate address-transfer phase and a data-transfer phase. 
Thus the bus is not monopolized during the long laten- 
cy involved in accessing data across bus hierarchies. 
Writes typically are not split, as the data and address 
are available simultaneously from the writer. In a hier- 
archical bus, requests must be forwarded across bridges 
for the purposes of snooping and memory access at re- 
mote nodes, and the snoop latency may be long. Thus 
the bus should be freed between initial request and 
snoop-response for use in other transactions. 


The 82495 does not support ST directly. That would 
require snooping current cache contents and queue-up 
possible writebacks, for the accesses from other bus 
agents between the time of the BOT 11 (the address 
phase) and SWENDII 
(end of the address phase or 
later). Also 82495 cannot writeback dirty data between 
SWENDII and CRDYII (end of the data phase) of an 
ongoing cycle; it cannot suspend a transfer for later 
resumption after a snoop writeback. 


CADS# 
BGT# 
SWEND# 
CRDY# 


1-----1 
NNNNNNNNNNN 1000000000001 


NN = No snooping by 82495 willoccur in this area 
DD= Delayed response by 82495 to snoop requests 
here. MTHIT'" 
and MHITM# 
asserted immedi- 
ately, but writebackof dirtydata delayed untilaf- 
ter CRDY'"for ongoingcycle. 


82495's inability to snoop during the NN period comes 
from the need to keep 2 addresses into the tags active-- 
one for the outstanding 82495 request, whose tag must 
be updated at SWEND 11 based on MWB/WT 11 and 
DRCTM 11, and one for the snoop inquiry. Further- 
more, any MHITMII 
on the M-bus could not be easily 
linked to the request causing the snoop if 2 snoops are 
outstanding. 


To support 
split transactions 
by snooping between 
BOTII and SWENDII, 
a set of tags external to the 
82495 can be implemented in the MBC. Those tags 
would replicate the contents of the 82495 internal tags, 
listening to all memory bus requests and responding 
with snoop results. Only when a 82495 state change (to 
I or S) is needed will the 82495 be informed of snooping 
action=only 
then will the external tags relay the snoop 
request to it. 


Duplicate tags provide quicker snoop turnaround 
be- 


cause no SNPCLK-to-CLK 
synchronization 
is re- 
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quired; the duplicate tags are in the SNPCLKlMCLK 
logic. While they are a high-performance option, they 
are costly and complex. 


Memory cycle ahort is required in multiprocessors 
when a snooping 82495 activates MHITM 11 to signal 
that the memory's copy of the data requested byanoth- 
er 82495 is obsolete. As explained above, the memory 
read or write must be INHIBITED 
until the writeback 
is done. Depending on implementation, the original ac- 
cess may need to be retried or abandoned. If CTCT and 
RFO are implemented, then abandonment is probably 
adequate. Although the complexity of aborting could 
be avoided 
by delaying 
all memory 
action 
until 
SWEND 11, 
that 
would decrease 
performance. 
An 


M-bus signal such as "SIV" (System InterVene) or 
"MBOFFII" 
(M-bus Back OFF) allows the MBC of 


the snooper to tell memory to abort. 


If the M-bus is pipelined, there may be constraints on 
when the MBC can assert the "abort" signal to avoid 
cancelling the access in progress for the transfer preced- 
ing the one causing MHITM 11. 


Locking 


Locking of the M-bus using the 82495's KLOCK 11 
output is required to ensure atomic accesses for CPU 
locks. For example, memory variables called sema- 
phores in a multitasking airline-reservation system pre- 
vent two processes from trying to update the same list 
of flight reservations simultaneously. A task would read 
the value of the semaphore in an uninterrupted 
read- 


modify-write (RMW) sequence, asserting the CPU's 
LOCKII signal during the RMW to block interrupts! 
(and block locked accesses by other processors to the 
same semaphore in a multiprocessor). If interrupts or 
other accesses were allowed during the sequence, two 
processes (or processors) might both read the sema- 
phore as "available" (zero) and both assume ownership, 
setting it to "unavailable" (nonzero). Then both might 
find the same empty seat and write their individual pas- 
senger's name in the same seat location. In the end, 101 
passengers would have tickets for a lOO-seat plane 
flight. 


The 82495 and i860 XP CPU implement locks in a 
sequentially 
consistent, 
or serializing, manner. That is, 


all data loads and stores within the locked sequence 
occur on the external bus in the same order as they 
appear in the program. Also, all accesses in the pro- 
gram before the LOCK instruction are completed be- 
fore the first locked read or write, and all the locked 
reads/writes 
complete before other accesses after the 


locked sequence. This sequentiality is required by the 
semaphore example above, to prevent the CPU from 
updating the reservation list before it has obtained own- 
ership using the semaphore. 


1The CPU automatically 
blocks 
interrupts 
during 
the 
LOCKed 
sequence. 
The bus arbiter 
is responsible 
for blocking 
other 
accesses. 
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KLOCK 11 activation. Most existing "standard" buses, 
such as Multibus-II, have lock protocols which do such 
an exclusive lock. 82495 snoop behavior during asser- 
tion of its own KLOCK 11 is undefined, since it expects 
no other requests will be permitted then. The' 82495's 
KLOCK 11 can remain asserted for multiple cycles 
when used with the i860 XP CPU, because the proces- 
sor allows up to 32 instructions inside a LOCKed se- 
quence. 


The 32-instruction i860 XP CPU LOCKed intervals 
may exceed 32 CLKs, as each. instruction could take 
several clocks and cause a TLB miss (the intervals 
would be even longer if the i860 XP CPU did data 
cache line fills and line writebacks during LOCK 11, but 
the 82495 prevents that by making KEN 11 = I). Unfor- 
tunately, this limits bus concurrency. 
When several 
82495s share a bus or interconnection network, per- 
formance would improve if a LOCK 11 from one proc- 
essor did not block all others from accessing memory 
and I/O. 
Multiprocessors based on the Intel486 DX 
CPU are not affected as severely by LOCK 11, because 
its lock endures ouly a few clocks-two 
memory ac- 
cesses at most. 


To improve performance of locks in a multiprocessor, a 
scheme of ADDRESS LOCKING may be implement- 
ed. This non-blocking protocol allows other accesses to 
the bus and memory in spite of LOCK 11 activation, 
and requires only that no other CPU tries to access the 
same LOCKed address. If another CPU does try to 
access the same location, that second CPU must be 
stalled until the first LOCK is de-asserted. To ensure 
that the second CPU continues to snoop accesses while 
stalled, BGTII 
to it for its request must be delayed 
until the lock is obtained, as signalled by the bus arbi- 
ter. Semaphore integrity is preserved if all CPUs follow 
the software convention of locking their RMW (Read- 
Modify-Write) semaphore accesses. Also by conven- 
tion, the address corresponding to the first access with 
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owner writes it with a new value. 


KLOCK 11 de-assertion for back-to-back Intel486 DX 
CPU locked accesses is required of the MBC if it uses 
address-based locking, so that the lock-manager knows 
the correct address. The i860 XP CPU always deacti- 
vates LOCK 11 for at least one clock between separate 
locked regions, by virtue of its deactivation in the clock 
after the last locked ADS 11 • However, the Intel486 DX 
CPU deactivates LOCK 11 only in the clock after the 
last BRDY 11 of the last locked access. Thus LOCK 11 
and KLOCKII 
may not deactivate when two XCHG 
instructions occur in succession. The MBC can insert a 
deactivation of the M-bus MLOCK 11 signal by know- 
ing all Intel486 DX CPU locked accesses are Read- 
Modify-Write sequences. The MBC should deassert 
MLOCK 11 regardless of KLOCK 11's value, after the 
write. 


Deassertion of KLOCK 11 by the MBC hardware may 
be required in any Intel486 DX CPU system, to avoid 
bus timeout and starvation of other bus masters when a 
continuous stream of locked accesses occurs in one 
processor's program. Without it, one processor could 
monopolize the bus and prevent re-arbitration. 


CPLOCK# 


CPLOCK 11 has a purpose similar to KLOCK 11 in In- 
tel486 DX CPU systems, but is unused in i860 XP 
CPU systems. PLOCK 11 (Psuedo-LOCK) indicates an 
atomic 8-byte 2-transfer write for floating-point data 
which should not be interrupted. The 4-byte bus of the 
Intel486 DX CPU requires 2 transfers for an 8-byte 
datum, and if only half the transfer gets done before 
another bus master reads memory, half-wrong data 
could be read. 


Thus the MBC should not relinquish the bus nor re- 
quire snoops of its 82495 from the time of the BGT# 
for the first write (when CPLOCK # was asserted by 
82495) through the BGT # of the second write. This 
increases the worst-case delay of writeback for a 82495- 
snoop-hit to a modified line; to avoid the delay, the 
MBC can tie the CPLOCK # [PLOCKENJ pin low to 
disable PLOCK functionality. 


t 


9.0 MORE ALTERNATIVES 


In addition to the options discussed above, several oth- 
er choices affect Memory Bus Controller design. 


M-bus clocking should be chosen to allow future ver- 
sions of 82495 and 82490 at higher clock speeds. Up- 
grading the CPU module performance by replacing the 
processor and 82495/82490 will be possible. While 
some redesign of the CPU-side MBC state machines 
may be needed for faster clocks, the memory bus can 
remain the same. Thus an asynchronous interface with 
either a strobed unclocked M-bus or a clocked M-bus at 
less than 50 MHz is advised. A fully synchronous 
M-bus/CPU MBC would be difficult to move to higher 
clock speed. 


One convenient way to design the MBC is with the 
M-bus MCLK = O.s-CLK. Probably it will be possi- 
ble to keep the M-bus at half the CPU CLK rate, even 
with faster CPUs. The big advantage of this half-speed 
link is that no synchronizers are needed within the 
MBC if the MCLK and CLK edges are skew-con- 
trolled. The MBC can be totally on CLK, as in the 
design example of Appendix B. 
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The choice between a Strobed or Clocked M·bus is of- 
ten determined 
by existing bus protocols in which 
82495/82490 will be used. Most existing buses are 
clocked; however, Futurebus+ 
requires all bus entities 
to use strobed tranfers, but allows an optional clocked 
mode for high-speed packet transfers [Fbus9OJ. The 
tradeoffs are shown in Table 2. 


Line size and M·bus width also determine upgradabil- 
ity to possible future versions of 82490 on the same 
M-bus, with more than 32kB per chip. If a higher-den- 
sity 82490 becomes available, the fact that 82495 has 8k 
tags requires: 


128 data bytes per tag (128 byte line, or sectored 
64-byte lines) 
AND 
8-byte or 16-byte memory bus width • 


to allow a 1 MByte or 2 MByte 82490 configuration. If 
a smaller bus is used, a larger 82490 is possible, but 
the bus-size multiplexing described earlier would be 
needed. 


Writeback (WB) cache policy is advised for high-per- 
formance (multi)processors to limit bus traffic. Howev- 
er, a writethru (WT) design is simpler for the MBC 
because there never is a need to backoff the 82495 due 
to MHITM #. In fact, the snoop window in a WT sys- 
tem becomes unnecessary and SWEND# 
can be acti- 


vated simultaneous with KWEND#. 
In such a system, 


the only states of cache lines are S or I. Snooping has 
no effect during reads and only causes invalidations (in 
the slaves) for writes in a WT design. Cache-to-cache 
transfers and RFO are irrelevant. 


Table 2. Clocked vs. Strobed 
MBUS Tradeoffs 


CLOCKED 
MBUS Advantages 


Design techniques 
for clocked 
systems are well 
known. 


Fast arbitration 
using MCLK state machines. 


Burst transfers 
proceed 
at one datum per MCLK 


CLOCKED 
MBUS Disadvantages 


Must round-up delays to MCLK period quanta EG., 
33 ns delay means two 30 ns MCLKs needed. 


Some 82495-to-82495 
signals must be twice 


synchronized: 
once at sender, once at receiver. 


Backplane 
length limited. 


MCLK skew must be controlled. 


Requires assumptions 
on CLK vS.MCLK speed 


ratio: for example, 
CLK > MCLK > CLK/2. 


STROBED 
MBUS Disadvantages 


MBC design may require delay lines and non- 
conventional 
design techniques. 


Arbitration 
slow because 
signal must be 
synchronized 
at arbiter and at modules. 


Burst throughput 
slowed if each transfer 
requires 


acknowledgement 
from receiver. 


STROBED 
MBUS Advantages 


Delays determined 
by device speed and physics, 


not by MCLK quanta. 


Each signal goes through sychronizer 
once, only at 
receiver, 
so less time is lost at synchronizers. 


Fewer limits on backplane 
length or capacitance 


or number of boards. 


No clock skew worries. 


Any ClK 
frequency 
will work. 
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10.0 
MBC DIFFERENCES 
FOR i860 
XP CPU VERSUS 
Intel486 
OX 
CPU 


The same MBC design can be used for either i860 XP 
CPU or Intel486 DX CPU if the MBC supersets the 
requirements of the two. A "CPU_TYPE" 
configura- 


tion pin can be included in the MBC to modify its be- 
havior. First, make the features as common as possible: 
• Choose a configuration acceptable for both CPUs: 


a) 256 kBytes, 4 transfers/line, 64-bit M-bus, 32-byte 
line. 
b) 512 kBytes, 
4 
transfers/line, 
128-bit M-bus, 


64-byte line. 
c) 256 kBytes, 8 transfers/line, 64-bit M-bus, 64-byte 
line. 


d) 512 kBytes, 
8 transfers/line, 
128-bit M-bus, 


128-byte line. 


• i860 XP CPU-pfld data is cached in 82490--no opti- 


mizations are included for pfld. 


• Assume that LOCK # duration does not matter (lE, 


that 
back-to-back 
LOCK#ed 
requests from In- 
tel486 DX CPUs and long LOCK # cycles in i860 
XP CPU do not cause bus ownership timeout). 


Features Strictly for the Intel486 DX CPU: 
• BE7-4# 
for M-bus must be synthesized by the 
MBC from A2 and BE3-0#. 


• CPLOCK # protection. 
• WRMRST (warm reset) can be included for both 
CPUs, but is optional. 


Features Strictly for the i860 XP CPU: 


• Burst 
writes 
from 
the 
CPU 
(Length = 2 
and 
Length=4). 


• A second 74F377 BE#-latch 
is needed, for i860 XP 
CPU pins BE7#-BE4#, 
LEN, and CACHE#. 


PCYC and CTYP can also be latched for debug pur- 
poses. 


• PCHK # output from i860 XP CPU must be ig- 
nored except during the CLK after BRDY # comes 
from the MBC. PCHK # from Intel486 DX CPU is 
always valid. 


Differences between the MBCs: 


• Configuration pin strapping of 82495 inputs. 
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• Decoding 
CPU 
request 
burst 
length 
from 
CLEN1:0(82495 pins in Intel486 DX CPU systems) 
or LEN and CACHE#(i860 
XP CPU). 


• CPU Line length-16 
bytes vs. 32 bytes (i860 XP 
CPU) means that the Intel486 DX CPU MBC will 
give 2 BRDY#s 
for every 1 BRDY# 
of the i860 
XPCPU 
MBC. 


Differences between Intel486 DX CPU and i860 XP 
CPUs which have no impact on MBC: 


• Intel486 DX CPU FLUSH # input pin. 
• i860 XP CPU writeback caching, HITM #, 
and 
BOFF#. 


• i860 XP CPU CS8 vs. Intel486 DX CPU BS8# , 
BSI6# 
(none are really useable). 


• Intel486 DX CPU RDY # pin and interruptable 
bursts (not useable with 82495). 
• i860 
XP 
CPU 
acknowledges 
HOLD 
during 


LOCK#. 


• EADS# 
duty cycle (50% maximum for i860 XP 


CPU and 100% for Intel486 DX CPU, but handled 
by 82495). 


• KEN# 
pin sampling interval by the CPU. 


• Behavior of CPU in response to BOFF # assertion. 
• i860 XP CPU BERR (Bus ERRor) pin versus In- 


tel486 DX CPU NMI (Non Maskable Interrupt). 


11.0 SUMMARY 


The interface between a CPU/82495/82490 
chip set 
and a system memory bus allows much flexibility and a 
wide range of performance options. The simplest MBC 
can be a few PALs, while a top-performance multipro- 
cessing version may take thousands of gates on an 
ASIC. Signal pin counts for the MBC can range from 
70 to 120, varying with the memory bus definition im- 
plemented by the MBC. 


While beyond the scope of this document, topics for 
consideration include detailed timing diagrams, critical 
path analysis, simulation of bus traffic, and hit rates. 
Useful also are simulations of performance impact of 
the number of CPUs, WB versus WT policy, memory 
latency, CTCT, RFO, and duplicate tags. Also at issue 
are interrupt controller hardware, PAX concurrency 
control, boundary scan and selftest, PC-compatibility- 
implications, i860 XP CPU pfld options, and high- 
speed design issues of impedance, termination, 
and 
noise. 
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Q: 
How 
does PAX 
multiprocessing 
work with 
82495 and an MBe? 
ANS: A CCU chip must be included on the M-bus side 
of 82495 and 82490 for each i860 XP CPU in a 
PAX multiprocessor. Refer to [MPIC90]. 


Q: 
Can the i860 XR CPU use a 82495/82490 
cache? 
ANS: No, 
the bus protocol 
of 82495 and 
82490 
matches Intel486 DX CPU and i860 XP CPUs, 
but not i860 XR CPU. 


Q: 
Can 2 CPUs plug into one 82495, getting effi- 
ciency from shared cache? 
ANS: No, the protocol and physical capacitance of the 
interface do not allow it. 


Q: 
Should the same MBC be used for Uni & Multi? 
(i.e., how much extra logic is added to make a 
multiprocessor MBe?) 
ANS: It is possible, and the extra logic is reasonable 
for a Uni which could be upgraded to multi by 
adding another CPU + cache module. 


Q: 
Are software models of 82495/82490 available 
for simulation of MBCs? What simulators are 
supported? 
ANS: As of September 1990, beta versions of models 


will be available Q4 1990from Silicon West, Inc. 
Phone 
= 
(213)597-5995, FAX 
= 
(213)494- 
4588. Contact Silicon West for-information on 
simulators supported (currently Workview, Ver- 
ilog, Zycad VHDL, Mentor Graphics). 


Q: 
What is the fastest possible transfer of data from 
Mdata to Cdata? (i.e., how many CPU elks are 
spent?) 
ANS: The initial timings are listed in [82495/490DS]. 


They are about 1.5 CLK periods including set- 
up-time at the CPU data pins. The connection 
from CDATA to MDATA is essentially a flow- 
through path. 
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Q: 
What are pin-counts for an MBC (i.e., will it fit 
in my ASIC)? 
ANS: 70 to 120 signal pins, depending on the bus pro- 
tocol and MBC features. 


. Q: 
How long is a reasonable cacheability window, 
in MCLKs? 
ANS: KWENDI< 
is activated when MKENI< 
and 
MROI< are stable. MKENI< and MROI< can 
come from address decoders in the MBC or on 
the MBUS. Thus KWENDI< could be 2 CLKs 
after CADS I< if the MBC itself determines 
cacheability, or as much as 5 MCLKs if the M- 
bus 
must 
see 
the 
request 
and 
determine 


MKENI<. 


Q: 
How long is a reasonable snooping window, in 
CLKs? 
ANS: MWB/WTI< 
and 
DRCTMI< 
are 
generated 
from the snoopers' MTHIT I< and MHITM I< 
signals. Thus SWEND I<is activated when those 
signals (MWB/WTI<, 
DRCTMI<) 
are stable. 


That would be at least 7 CLKs, not counting the 
possible delay between CADSI< and its M-bus 
counterpart 
MADS 1<. (see the discussion of 


snoop window above). 


Q: 
Is the SWEND I< window length deterministic, 
or must SNPBSY I< determine it? 


ANS: It is deterministic, but may be long when the 
82495 is busy. Yes, the SNPCYCI<· signal is re- 
quired to determine SWENDI<. If SNPCYCI< 
is not used, then the worst-case 82495 delay 
must be imbedded into the MBC logic, making 
the window longer than necessary most of the 
time. 


Q: 
How long can 
82495 be "busy", 
activating 
SNPBSY # and ignoring subsequent SNPSTB # 
activations? 
ANS: 82495 busy-ness is not due to CPU requests, be- 
cause 82495 gives higher priority to the snoops. 
But for snoops to M-state 82495 lines, 82495 
must do inquiries to the i86QXP CPU and get 
the more-recently modified data from i860 XP 
CPU before 82495 can writeback. A 82495 con- 
nected to an Intel486 DX CPU does not need to 
get modified data, as the Intel486 DX CPU has 
only S-state lines in the CPU cache. However, if 
SNPINV was active, 82495 must back-invali- 
date either CPU for S, E, or M state lines. The 
82495 must do multiple inquires or invalidates 
when the line ratio is 2 or 4. 
Q: 
What is the synchronization penalty in snooping 
(ie, how long from M-bus request to MHITM # 
validity)? 


ANS: About 3 CLKs. See the discussion of "snoop 
window" above. 


Q: 
What 
is 
optimal 
82495 
cache-line 
length 
(32,64,128)? 
ANS: This is TBD from simulations or measurements. 
It depends on the behavior of SW applications 
the HW is intended for. 
Q: 
Can Futurebus+ 
be used as the M-bus for a 
82495/82490 system? 
ANS: Yes. The Futurebus + spec is compatible with 
the 82495/82490. 
If supports 
MESI, strobed 
data transfer, address pipelining, cache to cache 
transfers, Read For Ownership, and many other 
features. 82490 would be used in strobed mode 
for Futurebus + . 
Q: 
Can 82495 do a split-transaction bus (if not, why 
not?)? 
ANS: Maybe. 82495 implements a restricted-backoff 
protocol to eliminate potential deadlock condi- 
tions in a shared bus multiprocessor environ- 
ment. Because of that protocol, and the fact that 
82495 will not 
snoop between 
BGT # 
and 
SWEND #, 
it is difficult to implement split 
transactions. It may be possible, using an addi- 
tional set of tags which replicate 82495's and 
allow snoops to continue between BGT # and 
SWEND#. 
Q: 
Can another 82495 be used for the "duplicate 
tags" for split transaction snooping? 
ANS: No, the 82495 signal definitions and protocols 
make that very difficult. 
Q: 
Why do the KWEND# 
and SWEND# 
signals 
exist? 
ANS: SWEND#, 
by gating 82490-to-CPU-data-trans- 
fer, allows the M-bus data transfer simultaneous 
with snooping. In the usual case, no modified 
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copy will be found by the snoopers, so that 
transfer was not wasted. The alternative (that 
data cannot be transfered from memory until 
snoops complete) costs performance or requires 
a central tag directory. SWEND# 
triggers the 
82495 to update its tags. 
KWEND # allows a variety of cacheability de- 
termination schemes-a 
long delay to determine 
MKEN # and MRO # might be needed if a pro- 
grammable RAM or EEPROM decodes cachea- 
bility based on address. If not, KWEND# 
can 
be activated quickly if there is a local MBC de- 
code of A31:A28 to determine MKEN#, 
for ex- 
ample. 
Why not just one WEND signal? 
Performance. 
KWEND# 
can be determined 
quicker than line-status in most implementa- 
tions. The early knowledge of cacheability to the 
82495 allows it to begin line replacements and 
allocations, and activate the next CADS # to 
MBC. 
Q: 
How to connect 8-bit (or 16-bit) devices such as 
ROM and serial ports to 82490? 
ANS: If the devices are made non-cacheable, they can 
be tied to the MDATA pins of the least-signifi- 
cant 82490s. However, if fetches from them 
must be cacheable, then byte assembly logic 
(latching transceivers) must exist to allow 82490 
to transfer from them 4 or 8 bytes at a time 
(I M-bus width per transfer). 82495 and 82490 
require all cacheable locations to do burst trans- 
fers an M-bus-width of data per transfer. 


Q: 
Does the 82495 have a CS8 mode? Does 82495 
support i860 XP CPU in CS8 mode? 
ANS: To support i860 XP CPU CS8 mode with 82495, 
the 8-bit ROM must be marked non-cacheable. 
This means that code being fetched in CS8 mode 
won't be cacheable in the 82495 or the i860 XP 
CPU. For an 8-byte M-bus, the ROM data pins 
must be wired to the M-bus (MDATA of 82490) 
bits 7:0. For a 16-byte M-bus, the ROM must 
attach to M-bus bits 7:0 AND bits 71:64, which 
would require an 8- bit transceiver at the ROM. 
Q: 
Should the DRAM 
controller be part of the 
MBC? 


Q: 
ANS: 
•• 


ANS: For a simple uniprocessor, perhaps. Multipro- 
cessors would have a DRAM 
controller 
for 
(each bank of) main memory, separate from the 
MBCs. 


Q: 
How can the system implement retry upon an 
M-bus parity error? 
ANS: The MBC must re-issue the initial request, and 
reset the 82490 transfer logic using the MSEL# 
signal.. 
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Q: 
ANS: 
Can 82490 use an ECC corrected-bus? 
ECC (Error Correcting Code) can be used on 
the main memory bus, but the ECC check bits 
must be converted to parity or discarded before 
feeding the 82490. ECC would have to be gener- 
ated at the 82490 MDA TA pins for writes to 
memory. 
Can the MBC implement cache-to-cache trans- 
fer on a write? 
No, the 82490 cannot "snarf" write data. That 
is, it does not merge a write (partial line) from 
the M-bus with existing cached lines. It can do 
Read-For-Ownership, 
merging write-miss data 
with an incoming line writeback from another 
cache. 
Can semaphores be cached in 82495/8249O? 
Yes, but all read/writes 
which are locked are 
forced onto M-bus. So the semaphore would be 
read 
repeatedly 
without 
locking, 
until 
it is 
"free". Then SW would re-read it in locked fash- 
ion to obtain ownership. 
Is there any advantage to making semaphores 
cacheable, if all locked accesses go to M-bus? 
Yes, SW can repeatedly read the semaphore 
without LOCKing it, and no bus traffic thus is 
generated, waiting for the release of the sema- 
phore by any other master. 
Can a single multiplexed address + data bus (like 
Multibus-II) be used for M-bus? 
Yes, but transceivers external to the 82495 and 
82490 are required. 
How does the MBC implement a "BACKOFF" 
when another 82495 activates MHITM#? 
If the data requested from a master 82495 is 
Modified in a snooper 82495, the master BC 
must postpone CRDY # until the modified line 
is deposited 
in the master 
82490, after the 
snooper flushes the modified line to M-bus. 
Can MBC duplicate the CPU cache tags, to 
avoid unnecessary inquire cycles? 
Yes, but the performance benefit may not war- 
rant the extra hardware. 


Q: 


ANS: 


Q: 
ANS: 


Q: 


ANS: 


Q: 


ANS: 


Q: 


ANS: 


Q: 


ANS: 


2-528 


Q: 
Can i860 XP CPU Late-Backoff mode be used 
with 82495? 


ANS: No. 


Q: 
What are the advantages and disadvantages of 
doing an asynchronous system (where MCLK is 
not the same as CLK)? 
ANS: Designers can easily upgrade the CPU side to 
higher frequencies (above 50 MHz) by faster 
PLDs in the CPU side of the MBC. The M-bus 
interface and all modules on the M-bus will not 
need to be changed. It easier to design a board 
when most parts run at a lower frequency. 


Q: 
If the 82490 is reading information from the 
memory 
bus 
and 
the 
MBC 
is 
generating 
BRDY#'s 
(RDYSRC= 
I), can the MBC abort 
the cycle by giving a premature CRDY #, and 
restart it? . 


ANS: The MBC can abort a memory bus cycle but 
cannot abort a CPUbus cycle. Once the first 
BRDY # is generated the cycle must complete. 
On the memory bus, a cycle is not aborted by 
giving an early CRDY #. In fact the 82495 does 
not understand that a cycle has been aborted. 
Only the MBC and 82490 are involved. The 
82490 allows its buffer to be reset using the 
MSEL# 
signal. 


Q: 
Whlj.tis the purpose of 82490 having a separate 
MOCLK 
for output data, in addition to the 


MCLK for input signals? 
ANS: MOCLK 
allows greater hold time for writes 


from 82495, if it is skewed slightly from the 
MCLK which M-bus receivers use. MOCLK 
and MCLK must be exactly the same frequency. 
If the skew is not needed, MOCLK can be tied 
low. 


Q: 
How many levels of pipelining can the 82495 use 
on the external memory bus? 
ANS: Each 82495 can use one level of pipeline on the 


memory bus, so the bus pipe depth can be great- 
er in a multiprocessor. A uniprocessor allows 
just one level of M-bus pipeline. 


intel~ 


APPENDIX B: 
Intel486 OX CPU Uniprocessor MBC Design 


Please refer to Application Note AP-458, Designing 
a 
Memory 
Bus Controller for a 50 MHz 
Inte/486 
DX Mi- 
croprocessor Based System. 
(Intel order #241166). 
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memory bus utiuzmg a bus protocol similar to that of 
the i860 XP CPU. 


.er systems based on this design. 


The memory bus is 64 bits data plus 8 bits parity. 


The design presented here features an i860 XP CPU 
and 256 KB of 82495/82490 cache running at 50 MHz 
in each core. The clocked 64 bit (+ 8 parity) memory 
bus is asynchronous to the CPU and cache clock, al- 
lowing memory to run at lower speeds for more eco- 
nomical and convenient memory design. The MBC fea- 
tures snooping and pipelining to the memory, as well as 
advanced 82495 processes like write allocation, read for 
ownership and cache-to-cache transfers. 


The MBC design allows the processor to run at a high- 
er clock frequency than the memory bus. The frequen- 
cies are constrained such that the ratio of the frequency 
of the processor CLK and the frequency of the memory 
bus MCLK is between I and 2: 
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Figure C-1. Pinout Environment of MBC 
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This constraint ensures proper synchronization of sig- 
nals which cross between the MCLK portion of the 
MBC and the CLK portion. The prototype was de- 
signed and simulated with a CPU speed of 50 MHz and 
a memory bus speed of 33 MHz. 


Snooping mode can be independently set to strobed, or 
clocked in each core. 


The main memory is responsible for returning 
the 
MKEN 11 attribute to the memory bus controller in the 
MCLK following MADSII assertion. 


To save synchronization clocks, the MBRDY 11 signal 
of the protocol is defined to be asserted one MCLK 
before data is actually available. 


. The 82495 operates with 32 bytes/line, 
I line/sector, 
and requires 4 memory bus transfers per line fill. 


OPTIONS 


With modifications the 82495 can operate in a mode 
with 64 bytes/line, 
I line/sector, requiring 8 memory 
bus transfers per line fill. 


The design here utilizes the 82490's clocked memory 
bus mode. The strobed mode can also be utilized by 
making modificaVon to the design. 


Support for various 82495 PFLD modes can be added 
to the design. 


Operation with either write-through or write-once pro- 
tocol can be performed. 


MEMORY 
BUS PROTOCOL 


M-bus Signals 


The system M-bus resembles the i860 XP CPU bus. It 
allows CPU modules with or without external cache on 
the same M-bus, so that balance between high perform- 
ance and low cost can be achieved. The signal specifica- 
tions below indicate Input (I), Output (0), or bidirec- 
tional (I/O) 
from the MBC's point of view. Output 
signals to the memory bus such as MADSII, MLEN, 
and MA31:MA3 are floated by all MBCs except the 
one currently owning the M-bus. 


Signals whose names begin with Y (as in YBGTII) are 
in the MCLK side of the MBC, while an X prefixed 
name is in the CPU CLK side of the MBC. The X and 
Y signals are internal to the MBC. 
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MRESET 
(I) - Memory 
bus RESET 


This signal forces the CPU to begin execution in a 
known state. It resets all MBC machines which are 
driven by MCLK. It is also synchronized (via a 2-stage 
synchronizer) to CLK and fed to the RESET inputs of 
the CPU, 82495, 82490s and all MBC machines which 
are driven by CLK. 


MADSII 
(1/0) 
- Memory 
bus ADdress 
Strobe 


This signal indicates that a new valid bus cycle is cur- 
rently being driven. The cycle address (A31:A3) and 
cycle specifications 
are 
valid 
in the 
MCLK 
that 
MADSII is asserted. A pipelined MADSII 
will be is- 
sued only after the MBC knows that the current cycle 
is guaranteed not to be aborted. For most memory ac- 
cesses, the master will assert MSNPSTB 11 to snoop 
other caches on the bus. When MSNPSTB 11 has been 
asserted, MNAII 
will cause a new MADSII to be is- 
sued after MSWENDIII 
signifies snooping has com- 
pleted. Furthermore, 
if MHITMIII 
was asserted with 
MSWENDIII 
in this case, the new MADSII cannot be 
issued until after the current cycle (now a snoop write- 
back) has been completed. When MHITMIII 
is not 
asserted with MSWENDIII, 
MADSII 
can be asserted 
immediately following MSWENDIII. 
If MSNPSTBII 
was not asserted for the current cycle, then MADSII 
could be issued immediately after MNA 11, without 
waiting for MSWENDI 11 . 


For read cycles MADSII is issued after CADSII, re- 
gardless of CDTS 11 state. Requesting the memory bus, 
via MBREQ, is also done immediately after CADS 11 . 
This is due to the fact that CDTS 11 in a read cycle does 
not affect the memory bus, but indicates when the first 
BRDYII can be issued to the CPU. 


For 
memory 
writes 
MADSII 
is issued only after 
CDTS 11. Requesting the memory bus, via MBREQ, is 
also done after CDTS 11. This guarantees that for write 
cycles the memory bus data is valid I MCLK after 
MADSII (similar to the CPU). 


MNA 11 (I) - Memory 
bus Next Address 
Acknowledgement 


This is the memory bus next address signal, driven by 
the memory controller. It indicates to the MBC that 
the memory bus is ready to accept a new bus cycle, 
although the previous one has not been completed yet. 
If the MBC has a new cycle pending and the current 
cycle is guaranteed not to be aborted (see MADS 11 
above), then a new MADSII will be issued. Note that 
the maximum level of pipelining on the memory bus is 
I. 
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MBRDY# (1/0) 
- Memory bus Burst ReaDY# 


This is the burst 
ready signal. For 
read cycles, 
MBRDY # indicates that in the following MCLK the 
memory bus will present valid data on the 82490 
MDATA pins. For writes, MBRDY# indicates that in 
the following MCLK the memory bus will accept the 
data from the 82490 MDATA pins. Note that this sig- 
nal is active I MCLK before the data is availableon the 
memory data bus. This reduces the synchronization 
penalty between the M-bus and CPUbus by I MCLK 
period. 


For a clocked-asynchronous MBC, MBRDY # is de- 
layed by the MBC I MCLK and passed to the 82490 
MBRDY # pin. For a strobed-asynchronous MBC, the 
82490 MISTB and MOSTB will change value in re- 
sponse to MBRDY #. 


For Cache to Cache Transfers, the MBC with the Mod- 
ified line drives MBRDY # active once per MCLK 
without wait states for the duration of the line burst. 


MSNPSTB# (1/0)· 
Memory bus SNPSTB# 


This is the memory bus snoop strobe signal. It is assert- 
ed I MCLK after MADS# by the MBC which asserted 
MADS#, 
for all cycles that could be M-state in the 
other 
MBC. 
In 
writebacks 
and 
I/O 
cycles, 
MSNPSTB# is not asserted. The MSNPSTB# output 
of each MBC is connected to the 82495 SNPSTB# in- 
put of the other MBC, in this two processor design. 


MSWENDO# (0)· 
Memory bus SWEND# 
Output 


This is the memory bus snoop window end indication 
which is driven by the snooping MBe. It is connected 
to the master MBC's SWENDI # input, indicating that 
snooping is finished and the snoop attributes are valid. 


MSWENDO# 
is an asynchronous signal which is 
triggered by the 82495 SNPCYC# 
falling edge, and 
is 
negated 
after 
sampling 
an 
active SNPSTB#. 
MSWENDO# of one MBC is connected directly to the 
MSWENDI # input of the other MBC. 


MSWENDI# (I)· 
Memory bus SWEND# Input 


MSWENDI # is connected directly to the other core's 
MSWENDO# output. It is internally sent to two syn- 
chronizers: synchronized to CLK to generate 82495 
SWEND#, and synchronized to MCLK for MBC state 
machines which determine whether the current bus cy- 
cle should be aborted. 
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MSWENDI # indicates the end of the snoop window 
and that the snoop results MHITMO# and MTHIT# 
are valid. An active MHITMI # indicates a snoop hit 
to a modified line, and causes the master MBC to dis- 
card any data which has arrived from main memory, so 
that new data, which is being written out as the snoop- 
ing core performs a snoop write back, can be accepted. 
MTHIT # 
of 
each 
core 
is 
connected 
to 
the 
MWB/WT # input of the other core, to generate the 
WB/WT # signal to the 82495. 


MHITMO# (0) • Memory bus HITM# Output 


This indicates a snoop hit to a modified line. In the two 
processor implementation of this MBC, it is connected 
directly to the other MBC's MHITMI # input. 


MHITMI# (I)· 
Memory bus HITM# Input 


MHITMI# 
is connected to the MHITMO# output of 
the other MBC, and determines if MBOFF# 
and 
MABORT# 
will be asserted. 
It 
is sampled on 
MSWENDI # activation. 


MTHIT # (0) . Memory bus Snoop Hit Indication 


This snoop hit indication is based on the 82495 
MTHIT # output. The MTHIT # ouput of the snoop- 
ing core is used by the master core to determine the 
WB/WT # 
state for the accessed line. The 82495 
MTHIT # signal is passed directly onto the memory 
bus when the SNPINV signal is inactive for the snoop. 
On snoops with SNPINV active, the memory bus 
MTHIT # line is driven low, regardless of the value at 
the 82495 MTHIT # pin. 


The MTHIT # signals from the memory bus control- 
lers on the bus are wire-anded together. Because the 
82495 MTHIT # output only changes state with each 
new snoop, the master memory bus controller must 
float its MTHIT # . 


MBOFF# (0) • Memory bus BOFF# 


This is the memory bus back-off signal which is driven 
by the master MBC. The master MBC floats its bus 
concurrent 
with MBOFF# 
activation. When 
the 
snooper MBC samples an active MBOFF# and it has a 
pending snoop write-back cycle, it issues the cycle to 
the memory bus. Note that the snooper issues the cycle 
even though it is still in a bus hold state (MHLDA 
asserted). If MHITMI # 
is sampled active during 
MSWENDI # and the previous cycle has completed, 
then MBOFF# 
will be asserted immediately after 


MSWENDI #. If the previous cycle has not completed 
and the pipelined cycle hits a modified line, then 
MBOFF# 
will be asserted only after the previous cycle 
completes. The snooping MBC floats its bus only after 
the snoop write-back cycle has completed. Note that 
from the arbiter's viewpoint the bus is still granted to 
the master MBC. 


MABORT # (0) - Memory bus Abort 


This is the memory bus abortion signal which is driven 
by the master MBC. When the main memory samples 
an active MABORT# 
it aborts any cycle that is cur- 
rently being serviced. The memory aborts the cycle re- 
gardless of the number of MBRDYs that have been 
issued. Thus MBRDY # of the aborted cycle will not 
be issued after MABORT#. 
A new cycle could be serv- 
iced immediately after MABORT#. 


If MHITMI # is sampled active during MSWENDI # 
and the previous cycle has been completed, then MA- 
BORT# 
is asserted immediately after MSWENDI#. 
If the previous cycle has not been completed and the 
pipelined cycle hits a modified line, then MABORT# 
is asserted only after the current cycle has completed. 


MABORT# 
can also be asserted during read for own- 
ership with a hidden write (allocation after a non-com- 
pleted write in the main memory). In this case if the 
master MBC samples an active MKEN# 
(1 MCLK 
after MADS #) during a potentially allocatable write 
cycle, 'it asserts 
MABORT# 
immediately, 
i.e. 
2 
MCLKs after MADS#. 


Note that MABORT# 
is always guaranteed to be a 1 
MCLK width pulse. " 


MLOCK # (I/O) - Memory bus LOCK 


This signal does not exist in the current implementa- 
tion. Instead, the MBC simply refuses to give up the 
M-bus to the arbiter when it is running locked accesses. 


MHOLD (I) - Memory bus Hold Request 


When this input to the MBC is asserted, the MBC as- 
serts MHLDA and floats all inputs and outputs except 
MBREQ, MHLDA, MSWENDO#, 
and MBOFF#. 
If 
the 
MBC 
has outstanding 
bus cycles in progress 
(MADS# 
has been asserted), they are completed be- 
fore the MBC relinquishes the bus. MHOLD is recog- 
nized during MRESET assertion. 
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MHLDA (0) - Memory bus Hold Acknowledge 


The memory bus hold acknowledge signal goes active 
when an MBC relinquishes the bus in response to an 
MHOLD request. The memory bus controller floats its 
bus in the same MCLK that it issues the MHLDA. 
When the MBC leaves bus hold, MHLDA is negated 
and the core resumes driving the bus. If a cycle is pend- 
ing when leaving bus hold, the MADS# 
will be issued 
in the same MCLK that MHLDA is negated. 


MINT (I) - Memory bus Interrupt 


This interrupt signal is connected directly to the i860 
XP CPU in the core. 


MKEN# (I) - Memory bus KEN# • 


This is the memory bus cache enable signal. It is used 
by the MBC to determine the length of the current bus 
cycle, and is also connected directly to the 82495 
MKEN # input. 


In potentially cacheable read cycles, it determines cycle 
length. In potentially allocatable write cycles, it deter- 
mines whether read for ownership with hidden write 
will be performed. 


In the current implementation, MKEN # must be driv- 
en by the memory controller 
in the MCLK 
after 
MADS# 
was issued. 


MRO# (I) - Memory bus Read Only 


Assertion of this signal causes an access to be treated as 
read only by the core. This signal is connected directly 
to the 82495 MRO# 
input, as well as to the MBC. 


MWB/WT # (I) - Memory bus WB/WT # 


This is the write-back/write-through. 
input connected 
to the memory bus. It is connected through MBC logic 
to the 82495 MWB/WT # input. 


MDRCTM (I) - Memory bus Dlrect-to-M 


This is the memory bus DRCTM # signal which forces 
a line entering the cache to be placed directly in the 
[M) (modified) state. In addition to this signal which is 
connected from the memory bus to the 82495, the MBC 
can interually drive the 82495's DRCTM # pin during 
read-for-ownership cycles. 
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MFLUSH#, MSYNC# (I) • Memory bus 
FLUSH#, SYNC# 


These signals cause the core to flush or sync its cache, 
by asserting FLUSH# 
or SYNC# 
to the 82495, re- 
spectively. The signals are driven by the main memory 
controller upon detecting a Core flush or sync corn. 
mand, which consists ofa 
special cycle with either 
MBE 1# or MBE3 # active, respectively. 


MBREQ (0) • Memory bus Request 


The MBREQ# 
signal is asserted by an MBC to indi- 
cate to the memory bus arbiter that the MBC needs the 
memory bus. An MBC will generate this signal regard- 
less of whether or not the MBC is currently driving the 
bus. 


MBREQ # is not issued for snoop write-back cycles. If 
the snooping core already had its MBREQ# 
pin assert- 
ed, the pending cycle which caused the MBREQ# 
is 
aborted by the snoop write-back, according to 82495 
protocol. The MBC state machines of the snooper, 
however, continue to assert MBREQ# 
until an internal 
time-out period has elapsed, allowing the snooping 
82495 to reissue the aborted cycle after the snoop write- 
back has completed. Therefore a core which is waiting 
for the bus can service a snoop write-back without los- 
ing its request for the bus. 


MLEN (0) • Memory bus LEN 


This signal together with MCACHE#, 
MW/R# 
and 
MKEN # determine the memory bus cycle length ac- 
cording to the following table: 


MW/R# 
MLEN 
MCACHE# 
MKEN# 
length 
Notes 


x 
0 
1 
x 
1 
1 
x 
1 
1 
x 
2 
1 
0 
0 
0 
1 
1 
2 
0 
1 
0 
1 
2 
2 
0 
x 
0 
0 
4 
1 
x 
0 
x 
4 


NOTES: 
1. Locked 
i860 XP CPU write-back 
cycles 
(length=4), 
caused 
by the i860 XP CPU executing 
a FLUSH 
instruction 
during 
a 
LOCKed 
sequence, 
are treated 
as normal 
write cycles 
(length = 1 or 2 according 
to LEN). This is allowed 
since i860 XP CPU 
write-back 
cycles 
always 
access 
a 82495 
modified 
line (in [M] 
state) 
and are only written 
into the 82490, 
without 
updating 
memory. 
2. MKEN# 
must be driven valid the clock 
following 
MADS# 
by the memory 
controller. 


MMI/O#, MD/C# (0) • Memory bus 1/0# and 
D/C# 


These signals, together with MW/R #, define the mem- 
ory bus cycle, according to the i860 XP CPU Data 
Sheet. 
They 
are 
driven 
in 
the 
same 
MCLK 
as 
MADS#. 


MBE[7:0]# (0) • Memory bus BE[7:0]# 


The byte enable signals to the memory bus identify 
which bytes are being accessed. They are identical to 
the CPU byte enables on CPU generated cycles. For 
82495 generated cycles (write-backs and allocations) all 
MBE # s are asserted. 


MCACHE# (I/O)· 
Memory bus CACHE# 


In a master core MCACHE# 
is an output; in a snoop- 
ing core it is an input. As an output, it indicates poten- 
tially 
cacheable 
reads 
or 
a 
82495 
write-back. 
MCACHE# 
is used by the system memory together 
with MLEN, MW/R # and MKEN # to determine cy- 
cle length. As an input, MCACHE# 
is connected to 
the 82495 SNPNCA pin. 
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MW/Ff# (I/O) • Memory bus W/R# ' 


This signal is an output for a master core, an input for a 
snooping core. As an output, it indicates whether the 
memory access is a read ar a write, and is used by the 
system memory along with MMI/O# 
and MD/C# 
to 
determine the cycle type, according to the i860 XP 
CPU Data Sheet. As an input, the signal is connected 
directly to the 82495 SNPINV pin. 


MA[31:3] (I/O) • Memory bus Address 


These are the memory bus address lines of the MBC. 
Along with the byte enable signals, they define the 
physical area of memory or I/O accesses. In a master 
MBC they are driven by the 82495 onto the memory 
bus together with MADS# 
(same MCLK). In a snoop- 
ing MBC, these lines are inputs to the 82495 which are 
latched by the MSNPSTB # signal. 
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MD[63:0], MDP[7:0] (I/O) • Memory bus Data 
and Data Parity 


64 bits of data, 8 bits of parity, connected through 
transceivers to the i860 XP CPU and 82490s. When an 
MBC does not own the bus, these pins are tristated. 


XAS# /XSA$# 
• X Unit Address Strobe 


XAS# is generated in the X-unit (sync to CLK), and is 
synchronized and sent to the Y-unit as XSAS#. 


XAS # indicates the start of a memory bus cycle from 
the X-unit (CLK side). XAS# is generated as a result 
of a CADS # 
from the 82495 on a read cycle or 
CDTS# 
from the 82495 on a write cycle. XAS# 
is 
held active until the X-unit receives YSBGT#. 


YBGT# /YSBGT # • Memory bus Guarent~ed 
Transfer 


YBGT# 
is generated in the Y-unit, and is synchroniz- 
ed and sent as YSBGT# 
to the X-unit. 


This signal is generated in the Y-unit after MADS# 
(the cycle has been issued on the memory bus). When 
YSBGT # arrives at the X-unit, the signal causes asser- 
tion of the 82495's BGT # input, and one clock later 
(non-pipelined 
cycle) the 
assertion 
of KWEND #. 
YSBGT # of a pipelined cycle (which is sampled during 
the initial cycle, i.e. before its CRDY#) 
causes the 
BGT# 
and KWEND# 
of the pipelined cycle to be 
issued immediately after CRDY # of the initial cycle. 


YBGT# 
of a I?ipelined cycle cannot be issued before 
the MSWEND # of the previous cycle. This is guaran- 
teed by the M-bus protocol, which ensures that a pipe- 
lined 
MADS # 
is 
not 
issued 
until 
after 
the 
MSWEND# 
of the previous cycle. 


BGT#, KWEND# (0)· Bus Guaranteed 
Transfer, Cache Window End to 82495 


BGT # and KWEND # are generated for every cycle 
(including snoop write-backs). In a non-pipelined cycle 
BGT# 
is issued immediately after sampling YSBGT# 
active, and KWEND# 
is issued 1 clock later. In pipe- 
lined 
cycles, 
these 
signals 
are 
asserted 
after 
the 
CRDY # of the initial cycle. 


I~OO~IJ..DIMlDOO~OOW 
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YMEOC#/YSMEOC# 
• (0) MBC Memory End 
Of Cycle 


YMEOC# 
is generated in the Y unit, and is synchro- 
nous to MCLK, and sent to the X-unit as YSMEOC# 
. 


It indicates the M-bus transfer has finished, based on 
the MBC's tranfer length count. YMEOC# 
directly 
drives the 82490s' MEOC# 
inputs. YSMEOC# 
causes 
generation of the CRDY # signal to the 82495 and 
82490s. For non-pipelined cycles CRDY # is issued im- 
mediately after an active YSMEOC# 
(if CDTS# 
was 
issued). For pipelined cycles CRDY # is issued after 
the CRDY# 
of the previous cycle (if'YMEOC#, 
CDTS# 
of the pipelined cycle were issued). 


YMEOC# 
is issued at least 2 MCLKs after YBGT# 
(for every cycle). 


YCEOC#/YSCEOC# 
• MBC CPU End Of Cycle 


This signal is internal to the MBC: YCEOC is generat- 
ed synchronous to MCLK, 
and is synchronized 
to 
CLK to produce YSCEOC#. 
It indicates that the 
CPUbus transfer has finished, based on the MBC's 
tranfer length count. It generates the BRDY#s 
to the 
82495, 82490, CPU, and to other MBC machines. For 
non-pipelined cycles all BRDY#s 
except the first are 
issued 
immediately 
after 
an 
active 
YCEOC# 
(if 
CDTS# was issued). For pipelined cycles all BRDY#s 
except the first are issued after the CRDY # or the last 
BRDY# 
(BRDY# 
• CLENl) of the previous cycle. 


YCEOC# 
can be issued before, with, or 1 clock after 
YMEOC#. 
When the line ratio is 2 or 4, YCEOC# 
precedes YMEOC# 
by a significant time, allowing 
CPU linefills to complete long before the M-bus tranfer 
completes. 


YCEOC# 
is asserted 
only if RDYSRC 
is active 
(High). 
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The i860 XP CPU microprocessor issues a read cycle in 
CLK 0, as indicated by the assertion of ADS #. The 
82495 performs the tag lookup, and finds the request a 
cache miss. In CLK 2, the 82495 issues CADS# 
and 
the cycle control signals, alerting the memory bus con- 
troller that a 4 transfer 82495 read is requested. 


The X side state machines, which run on the processor 
CLK, issue an XAS# on the CLK after CADS# for a 
82495 read cycle (CW/R# 
= 0). The XAS# 
signal 
passes through the synchronizer running on MCLK to 
become synchronized in two MCLKs. The synchroniz- 
ed XAS # signal, called XSAS#, is sent to the Y side of 
the MBC in MCLK 4. 


In MCLK 
5, XSAS# 
has initiated the assertion of 
MBREQ to request the memory bus from the memory 
bus arbiter. If the bus is already owned (or once it is 
owned) by this MBC, XSAS# causes the assertion of 
MADS# 
to the memory bus, MAOE# 
to the 82495, 
and the internal YBGT # signal. The assertion of the 
82495's MAOE signal allows the 82495 to drive its ad- 
dress lines to the memory bus. YBGT # indicates that 
the memory bus is owned by this MBC, and is sent to 
the synchronizer for the X side of the MBC as well as 
many Y side state machines. 


On the Y side, YBGT# is used to deassert MBREQ#, 
to 
sample 
YALLOC# 
on writes, 
and 
to 
initiate 
MSNPSTB#. 
MSNPSTB# 
is asserted in MCLK 6 to 
request a snoop in the other MBC. YBGT # is also 
synchronized 
to CLK, 
appearing as YSBGT #, 
by 
CLK 9. YSBGT# causes the assertion ofBGT# 
to the 
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ing MCLKs. The YMBRDY # output of the MBC is 
the MBRDY # signal delayed one clock, and drives the 
MBRDY # input on the 82490s to read in the incoming 
data. 


While the data transfer is occurring, the second memo- 
ry bus controller responds to the snoop request for this 
memory access in MCLK 8. Because the data is not 
present in the cache of the other core, that MBC will 
assert its MSWENDO# 
output 
with 
MHITMO# 
driven high. These outputs of the snooping core are tied 
directly to the MSWENDI # and MHITMI # inputs, 
respectively, of the master core in this two core imple- 
mentation. Both of these signals are passed to the 82495 
(MSWENDI # is synchronized first) as well as to the 
state machines of both sides of the MBC. The arrival of 
these signals allow the core to accept the data as valid, 
and conclude with the read operation when all of the 
data has been transferred. 


The arrival of the fourth MBRDY # 
generates the 
YMEOC# 
and 
YCEOC# 
signals in 
MCLK 
10. 


YMEOC# 
drives the MEOC# 
input on the 82490s. In 
addition, both signals are synchronized and sent to the 
X side of the MBC. Upon the arrival of YSCEOC#, 
the X state machines begin generating BRDY#s 
to the 
i860 XP CPU. Upon arrival of YSMEOC#, 
CRDY# 
is driven to the 82495, indicating the end of the cycle. 
YMEOC# 
and YCEOC# 
are used to reset many of 
the Y side state machines, including cycle type and 
length indicators, and the drivers of 82490 signals such 
as YMALE # and YMSEL #. On the X side, the reset 
functions 
are triggered 
by CRDY # 
and 
the 
last 
BRDY#. 
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Figure C-2. Non-Aborted Read Cycles 
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Figure C-2. Non-Aborted 
Read Cycles (Continued) 
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Aborted 
Non-Pipelined 
Cycles 


Figure C-3 illustrates an aborted non-pipelined cycle. 
MHITMI # 
is sampled active during MSWENDI # 


(clock 4) indicating a snoop hit to a modified line. Since 
the cycle is non-pipelined, MABORT# 
is issued imme- 
diately and the core floats its bus (clock 5). Although 
the bus is floated by the master core, the master still 
owns the bus (MHLDA remains inactive). 


MABORT# 
in clock 5 causes the main memory to 
abort its cycle regardless the number of MBRDYs that 
have been issued. MBOFF # is also asserted in clock 5 
to indicate to the snooping core that the master is float- 
ing its signals and the write-back may begin. The main 
memory floats its data bus in clock 6 in response to 
MABORT#. 
In the following clocks a snoop write- 
back cycle is performed by the snooper. The snooper 
will release the bus at the end of the write-back. 


Note that 
MSNPSTB# 
is not asserted during the 
write-back cycle since it obviously will not hit any 
cache. 


Aborted 
Plpellned Cycles 


Figure C-4 illustrates an aborted pipelined cycle. Al- 
though 
MHITMI # 
is 
sampled 
active 
during 
MSWENDI# 
(clock 7) MABORT# 
will not be issued 


immediately since the previous cycle has not been com- 
pleted yet. MABORT# 
is issued in clock 9 after 
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the last data slice was read into the core. The core floats 
its bus 
and 
asserts 
MBOFF# 
concurrently 
with 


MABORT#. 
Upon sampling MBOFF#, 
the snooping 


MBC begins the snoop write-back in clock 10. 


Write Allocate 


Figure C-5 illustrates a write cycle which is potentially 
allocatable. This write is performed on the bus only in 
order to sample the MKEN #, since the allocation cy- 
cle will only be guaranteed if MKEN # is active. 


MKEN # 
is sampled active in clock 2 causing the 


MABORT# 
to be issued immediately. The reason to 


abort the write cycle, even before MSWEND #, is due 
to the fact that a read for ownership cycle is guaranteed 
to be performed after the aborted write. 


In clock 4 the MADS# 
of the allocation cycle, which 
becomes the MADS # of the read for ownership cycle, 
is issued. This MADS # is issued only if MSWEND # 
has not been issued yet, or if MSWEND # was issued 
and MHITMI # was negated. If MHITMI # is asserted 
during the MSWEND# 
that was issued, MADS# 
will 


not be issued (since the snooper issues its MADS#). 


A second MABORT# 
is issued in clock 8 indicating 
the memory to abort the allocation, and the snooper to 
start flushing the modified line. Note that a second 
MABORT# 
will be issued regardless if MADS# 
of 
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Figure C-4. Aborted 
Plpellned 
Cycles 


the allocation was issued or not. The first MABORT# 
(clock 3) aborts the write cycle in the memory module 
and 
does 
not 
affect 
the 
snooper. 
The 
second 
MABORT# 
(clock 8) indicates to the snooper to start 
its write-back cycle (and if MADS# 
of an allocation 
was issued to also abort it in the memory module). 


MSNPSTB# 
is not issued for the allocation cycle since 
write and allocation cycles access the same line. 


If MKEN # had been negated in clock 2 then an alloca- 
tion would not have been performed and the write cycle 
would have continued as a non-allocatable write cycle 
(see figure C-6). 


Non-Allocatable 
Write 


Figure C-6 illustrates a write cycle without an alloca- 
tion. It can be either a non-potentially allocatable write 
cycle or a potentially allocatable write with inactive 
MKEN# 
(clock I). 


The write cycle is aborted (MABORT# 
in clock 3) 
after sampling active MHITM# 
during MSWEND# 


(clock 2). In clock 11 the master core re-issues the 
MADS# 
of the aborted write cycle (after the snoop 
write-back has been completed). MSNPSTB# 
will not 
be issued again since the updated data had been written 
into the main memory and the snooper has gone to the 
invalid state. 
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LIMITATIONS 
OF DESIGN 


The primary limitation of the implementation as it has 
been presented so far is that it includes only two proces- 
sors. The protocol set up in the desigu is not limited to 
two processors. The next section outlines the imple- 
mentation details which must be modified to extend the 
design to more than two processors. 


The design has no support for CS8 mode, so the proces- 
sors cannot be booted from 8 bit EPROMS. Instead, 
both processors boot in 64 bit mode, which may com- 
plicate the use of the design in stand-alone systems. 


The i860 XP CPU's BERR, or Bus ERRor, input is not 
utilized in this design. The pin could be used simply as 
a non-maskable interrupt pin, but the memory bus con- 
troller as designed makes no provision to use BERR to 
correct a faulty bus access. Likewise, the parity check 
results from the i860 XP CPU's PCHK # pin are of 
little value in this design outside of testing the i860 XP 
CPU's parity functions. The MBC itself does not check 
the PCHK # output, and has no means of reissuing an 
access in case of parity error. 


The memory bus controller design here does not decode 
and utilize the i860 XP CPU INT A cycles. The INT 
pin itself is connected directly to the i860 XP CPU, 
without affecting MBC operation. 
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Figure CoS. Potentially 
Allocatable 
Write 
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Figure C-6. Non-Allocatable 
Write 
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The MultiProcessor Interrupt Controller (MPIC) cur- 
rently being designed by Intel is not utilized in or sup- 
ported by this memory bus controller. 


The memory bus controller's treatment of LOCKed cy- 
cles is simple but straightfoward: when the 82495 issues 
a memory access which is LOCKed (KLOCK # ac- 
tive), the MBC will not relinquish the bus until a cycle 
which is not LOCKed is issued. While this is adequate 
for simple systems, it will not suffice for dual ported 
memories, where a given block of memory can be ac- 
cessed through more than one bus. In such systems, a 
LOCK signal must be introduced to alert all possible 
simultaneous users of memory that a LOCKed access is 
in progress. 


EXTENSION 
OF DESIGN TO THREE 
OR MORE CPUs 


Two Processor 
Implementation 
Overview 


Figure C-7 presents a simplified view of the multipro- 
cessing signals for the two processor implementation. 
The basic address, data, and memory cycle control lines 
are attached to a common bus. Only the core which 
controls the bus will drive these signals, with all other 
cores floating these lines and asserting MHLDA #. 


When 
the 
bus 
master 
MBC 
issues a 
cycle, the 
MCACHE# 
and MW/R# 
cycle attributes also serve 
to drive the 82495s' SNPINV and SNPNCA inputs of 
both cores. SNPSTB # is issued by the master in the 
clock following MADS #. In reality, both cores have a 
SNPSTB# 
output at their Y-side state machines driv- 
ing a common line which connects to the SNPSTB # 
input of both 82495s. The core which does not own the 
bus floats its state machine driver on MHLDA, so the 
signal acts only as an input in that core. The master 
drives the SNPSTB# line, but the action ofSNPSTB# 
is blocked in its own 82495 because its MAOE# 
signal 
is asserted. 


The results of the snoop are driven out on the snooping 
core's 
MTHIT# 
and 
MHITMO# 
outputs, 
and 
MSWENDO# 
is asserted. These signals are connected 
directly 
to 
the 
MHITMI #, 
MWB/WT #, 
and 
MSWENDI # inputs in the master core, respectively. 


The MBOFF# 
signals of the two MBCs are also con- 
nected 
together. 
During 
MHLDA 
(in a snooping 
MBC) MBOFF# 
is an input, and in the master it is an 
output. If the master asserts MBOFF, control of the 
data and control busses is given to the snooping MBC 
so that a snoop write-back can be performed. 


Three or More Processors 


This section gives one method of extending the design 
given here to three or more processors. The solution 
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presented here assumes that no changes are made to the 
state machines as they are written for the two processor 
system. Instead, some minor glue logic is added to three 
of the signals to make the core an element in a scalable 
multiprocessing system. However, modifying the state 
machines is also a plausible solution. 


In an implementation with three or more processors, 
the primary address, data, and cycle control lines are 
still connected to a common bus, as in the two proces- 
sor version. MCACHE# 
and MW/R# 
are also uti- 
lized in the same way as the two processor version: the 
outputs of the cores drive a common line which in turn 
also drives the 82495 SNPNCA and SNPINV inputs of 
all cores. 


The SNPSTB# 
signal connects directly from core to 
core in a two processor version. In an implementation 
with three or more processors, the SNPSTB # line is 
simply extended to all the processors in the system. 
Only the bus master will actually drive the line, and 
snoopers will be floating the SNPSTB# 
output from 
their state machines. Again, the snoop request is ig- 
nored in the master because its MAOE# 
is asserted. 
Similarly, the MBOFF# 
signal becomes a common line 
which only the master will drive and which all other 
cores will sample. 


The six signals in the upper portion of diagram C-7, 
which communicate MSWEND and the snoop results 
MHITMO# 
and MTHIT#, 
will require more glue 
logic to extend the design to three or more processors. 
The snoop results MHITMO# 
and MTHIT# 
must 
now be considered for multiple cores when a snoop has 
been issued, and the master MBC must not sample 
these results until all snooping cores have issued their 
MSWENDO#. 


To resolve these issues, common bus lines carrying 
these signals are introduced, where all cores have out- 
puts driving these lines, and inputs to sample them. The 
characteristics of such MTHIT # and MHITM # lines 
are straightforward: the line should default to I, and if 
any core drives one of these outputs 
low, the line 
should be pulled low. The MTHIT# 
line has the sim- 
plest solution. As shown in figure C-8, by passing the 
signal which is produced by the core through an open 
collector buffer, the buffered MTHIT#s 
can be tied to 
a single line which is sampled directly by all cores' 
MWB/WT# 
pins. The open collector buffer sinks cur- 
rent like a normal gate output to drive a logic 0, but 
instead of driving current for a logic I, the open collec- 
tor device assumes a high impedance state for logic 1. 
Thus, if all of the cores outputs MTHIT# 
as I, the 
MTHIT # line remains at a logic 1 level because of the 
pull-up resistor. If one or more cores outputs a logic 0, 
the MTHIT # line will be pulled to the logic 0 level. 
This 
precisely 
matches 
the 
desired 
behavior 
of 
MTHIT# 
for the system: if any I or more core(s) has 
the snooped data cached, the master MWB/WT# 
in- 
put must be asserted low. It is important to note that 
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Figure C-7. Interprocessor 
Communications 
In Two Processor System 
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the MTHIT 11 output of the master is floated: because 
the 82495 MTHIT 11 output only changes on each new 
snoop, the value of the master MTHIT 11 output for the 
previous snoop would erroneously be included in decid- 
ing the level of the MTHIT 11 line. 


The MHITM 11 line follows the same principle as the 
MTHIT 11 line. The MHITM 11 signal is not floated in 
the master core, and poses the problem which floating 
MTHITII avoids: the value of the master's last MHIT- 
MOll 
output is still present when the new access is 
being made. To resolve this, the inverted value of 
MHLDA 
is ORed with MHITMOII 
before going to 
the open collector buffer. The master's MHLDA is al- 
ways a 0, so the OR gate will always guarantee a 1 
being passed from the master to the MHITM 11 line. 
Again, if one or more of the snooping MBCs outputs a 
logic 0, the MHIT.M 11 line will properly assume a 0 
level. 


The open collector buffer presents an easy way to add 
new MBCs to the shared lines. The desired behavior of 
a shared MSWENDA (MSWEND All) line is different 
from the attribute 
lines, MTHIT 11 and MHITM 11. 


Where the master core should sample a 0 if anyone or 
more snooping core(s) drives a 0 on these attribute 
lines, the master core must not receive its MSWEN- 
DIll 
indication until all cores in the system have as- 
serted their MSWENDOII 
output. The answer is to 
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invert the MSWENDO output of each snooper, so that 
a zero is driven onto the MSWENDA line when the 
snoop is being performed, and a one is output if the 
snoop has completed. From the MSWENDIII 
perspec- 
tive, MSWENDIII should not be asserted at the master 
core if any snooping core is still driving a zero on the 
MSWENDA line (is not done snooping). Therefore, the 
MSWENDA line is the opposite logic polarity of the 
actual 
MSWENDOII 
signal. 
The 
master 
samples 
MSWENDA after the signal passes through an invert- 
et, to recorrect the logic level. The output of each core 
is passed through inverter before going to the open col- 
lector buffer. The inverting device is a NAND gate be- 
cause the SWENDOII 
signal shares the problem of 
MHITM 11, and must be "faked" by the master. In this 
case, instead of the last snoop's results causing the 
problem, the master's SWENDOII 
signal is reset to 1 
(still snooping) when the SNPSTBII line is asserted. 


Again, these simple adaptations can be implemented in 
a similar manner in the logic of the state machines. The 
MHITMOII 
line can be forced to a logic one or floated 
when the core is a master (after YBGT, for example). 
The MSWEND signal might be implemented as an as- 
serted-high system signal, if open collector buffers are 
used to attach new cores to the shared system bus. 
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Figure c-8. Extension Glue 
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STATE 
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AND SCHEMATICS 
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RESET 
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ELSE 


CRDY.WCPL8 
+CRDY• WCPL8' 
• PBGT' • YSBGT' 


CRDY' 
+(PBGT + YSBGT) • WCPL8' 
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LBRDY = CL EN 1.BRDY 


LBRDY 
• YSCEOC 


LBRDY" 


XCTRCK 
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COTS" 
• SHPADS" 
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WRESET 
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WBRO.WLEN 1.WBRDY.WWSWND.WABORT".(CLEN 
12+CLEN4) 
+.,BR1 •.,LEN 1.WWSWND.WABORT".(CLEN 
12+CLEN4) 
+.,BR 1.WLEH2 •.,BRDY .WWSWND•.,ABORT" .(CLEH 12+CLEN4) 
+WBR2.WLEN2. WWSWHD.WABORT" .(CLEN 12+CLEN4) 
+.,BR 1.WLEH4.WBRDY .WWSWND.WABORT" .CLEN 12 
+(WBR2+WBR3).WLEN4.WWSWHD.WABORT" 
.CLEN 12 
+WBR3.WLEN4.WBRDY.WWSWND.WABORT".CLEN4 
+(WBR4+WBR5+ 
WBR6+W BR7+WBR8) .WLEN4. WWSWN0 
.WABORT".(CLEH 
12+CLEH4) 
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YIWSWEHO = WSWEHOI.YALLOC".OISWHO" 


YIMSWND 
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WBRDY 


YMBRDY 
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Y•• EOC' 
.(YNOPIPE 
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YMEMEOC 
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+IIBR •• IILEN •• WIISWND.IIABORTlf. 
TR. 
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+YALLOC.IIAIlORT 
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PLD CODES 


---------------------------------- 
Declaration 
Segment 
------------ 
TITLE AYMBTRCK 
PATTERN 
A 
REVISION 
2.0 
AUTHOR 
ISIC SILAS 
COMPANY 
INTEL 
DATE 
2/4/91 
CHIP 
xOl 
85C22vlO 


This 
PLD contains 
the YMBTRCK 
state machine. 


;---------------------------------- 
PIN Declarations 
--------------- 
PIN 
1 
MCLK 
COMBINATORIAL 
; 
PIN 
2 
MRESET 
COMBINATORIAL 
; 


PIN 
3 
jWMSWND 
COMBINATORIAL 
; 


PIN 
4 
/MBOFFI 
COMBINATORIAL 
; 
PIN 
5 
/PXSAS 
COMBINATORIAL 
; 


PIN 
6 
/PSWBAS 
COMBINATORIAL 
PIN 
7 
MHOLD 
COMBINATORIAL 
; 


PIN 
8 
/MNA 
COMBINATORIAL 
; 
PIN 
9 
/WMNA 
COMBINATORIAL 
; 


PIN 
10 
/YMLOCK 
COMBINATORIAL 
PIN 
11 
/ySWEHITM 
COMBINATORIAL 
PIN 
12 
GND 
PIN 
13 
/PCTCXFR 
PIN 
14 
/RSTRT 
PIN 
15 
/YMEOC 
PIN 
16 
UNUSED 
PIN 
17 
/YBGT 
PIN 
18 
/yMADS 
PIN 
19 
/MAOE 
PIN 
20 
/YNOPIPE 
PIN 
21 
/YMSTR 
PIN 
22 
/YPIPE 
PIN 
23 
/YMSEL 
PIN 
24 
VCC 
;----------------------------------- 
Boo1ean 
Equation 
Segment 
EQUATIONS 


COMBINATORIAL 
COMBINATORIAL 
COMBINATORIAL 
registered 
; 


regis tered 
; 


registered 
; 


registered 
; 
registered 
registered 
; 
registered 
; 
registered 
; 


YNOPIPE 
:- 
/MRESET * PXSAS * /MHOLD * YMEOC * YNOPIPE 
+ /MRESET * PXSAS * XMLOCK * YMEOC * YNOPIPE 
+ /MRESET * /PXSAS * /YMEOC * /YSWEHITM * YNOPIPE 
+ /MRESET * /YMEOC * /WMSWND * /YSWEHITM * YNOPIPE 
+ /MRESET * YMEOC * /YSWEHITM * /PCTCXFR * YPIPE 
+ /MRESET * /PCTCXFR * RSTRT * YMSTR * /MAOE 
+ /MRESET * /MNA * /WMNA * /YMEOC * /YSWEHITM * YNOPIPE 
+ /MRESET * MHOLD * /YMLOCK * /YMEOC * /ySWEHITM * YNOPIPE 
+ /MRESET * PXSAS * /MHOLD * /PCTCXFR * YMSTR * /MAOE 
+ /MRESET * PXSAS * YMLOCK * /PCTCXFR * YMSTR * /MAOE 
+ /MRESET * PXSAS * /MHOLD * /MBOFFI * /YMSTR * /MAOE 
+ /MRESET * PXSAS * /MHOLD * /YSWEHITM * /PCTCXFR * jYNOPIPE * /YPIPE 
* YMSTR 
+ /MRESET * PXSAS * YMLOCK * /YSWEHITM * /PCTCXFR * /YNOPIPE * /YPIPE 
* YMSTR 


YPIPE 
:- 
/MRESET * /YMEOC * YPIPE 
+ /MRESET * PXSAS * /MHOLD * MNA * /YMEOC * WMSWND * /YSWEHITM 
* YNOPIPE 
+ /MRESET * PXSAS * /MHOLD * WMNA * /YMEOC * WMSWND * /YSWEHITM 
* YNOPIPE 
+ /MRESET * PXSAS * MNA * YMLOCK * /YMEOC * WMSWND * /YSWEHITM 
* YNOPIPE 
+ /MRESET * PXSAS * WMNA * YMLOCK * /YMEOC * WMSWND * /YSWEHITM 
* YNOPIPE 
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YMSTR 
/MRESET * YPIPE 
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MAOE 


+ /MRESET * /YMEOC * YNOPIPE 
+ /MRESET * YSWEHITM * YNOPIPE 
+ /MRESET * jMHOLD * YMSTR 
+ /MRESET * YMLOCK * YMSTR 
+ /MRESET * /MHOLD * /MBOFFI * /MAOE 
+ /MRESET * PCTCXFR * YMSTR * /MAOE 
+ /MRESET * RSTRT * YMSTR * /MAOE 


:= 
/MRESET * /PCTCXFR * RSTRT * YMSTR * /MAOE 
+ /MRESET * /YMEOC * YPIPE 
+ /MRESET * YMLOCK * YMEOC * YNOPIPE 
+ /MRESET * /YMEOC * /ySWEHITM 
* YNOPIPE 
+ /MRESET * /YSWEHITM 
* /PCTCXFR * YPIPE 
+ /MRESET * /YMEOC * /YMSTR * MAOE 
+ /MRESET * YMLOCK * /ySWEHITM 
* /PCTCXFR * YMSTR 
+ /MRESET * /MHOLD * /PCTCXFR * YMSTR * /MAOE 
+ /MRESET * YMLOCK * /PCTCXFR * YMSTR * /MAOE 
+ /MRESET * /MHOLD * /MBOFFI * /YMSTR * /MAOE 
+ /MRESET * PSWBAS * MBOFFI * /YMSTR * /MAOE 
+ /MRESET * /MHOLD * /YMLOCK * /YMEOC * /YNOPIPE * MAOE 
+ /MRESET * /MHOLD * /YMLOCK * YMEOC * /yPIPE * YMSTR * MAOE 
+ /MRESET * /PXSAS * YMLOCK * /YNOPIPE * /YPIPE * oYMSTR * MAOE 


:- 
/MRESET * PXSAS * /MHOLD * YMEOC * YNOPIPE 
+ /MRESET * PXSAS * YMLOCK * YMEOC * YNOPIPE 
+ /MRESET * /PCTCXFR * RSTRT * YMSTR * /MAOE 
+ /MRESET * PXSAS * /MHOLD * /PCTCXFR * YMSTR * /MAOE 
+ /MRESET * PXSAS * YMLOCK * /PCTCXFR * YMSTR * /MAOE 
+ /MRESET * PXSAS * /MHOLD * /MBOFFI * /YMSTR * /MAOE 
+ /MRESET * PXSAS * /MHOLD * /YSWEHITM 
* /PCTCXFR * /YNOPIPE 
* /YPIPE 
* YMSTR 
+ /MRESET * PXSAS * YMLOCK * /ySWEHITM * /PCTCXFR * /YNOPIPE 
* /YPIPE 
* YMSTR 
+ /MRESET * PSWBAS * MBOFFI * /YMSTR * /MAOE 
+ /MRESET * PXSAS * /MHOLD * MNA * WMSWND * /YSWEHITM 
* YNOPIPE 
+ /MRESET * PXSAS * /MHOLD * WMNA * WMSWND * /ySWEHITM 
* YNOPIPE 
+ /MRESET * PXSAS * MNA * YMLOCK * WMSWND * /YSWEHITM 
* YNOPIPE 
+ /MRESET * PXSAS * WMNA * YMLOCK * WMSWND * /YSWEHITM 
* YNOPIPE 


YMADS 


YBGT 
:- 
/MRESET * PXSAS * /MHOLD * YMEOC * YNOPIPE 
+ /MRESET * PXSAS * YMLOCK * YMEOC * YNOPIPE 
+ /MRESET * PXSAS * /MHOLD * /MBOFFI * /YMSTR * /MAOE 
+ /MRESET * PSWBAS * MBOFFI * /YMSTR * /MAOE 
+ /MRESET * PXSAS * /MHOLD * MNA * WMSWND * /YSWEHITM 
* YNOPIPE 
+ /MRESET * PXSAS * /MHOLD * WMNA * WMSWND * /ySWEHITM * YNOPIPE 
+ /MRESET * PXSAS * MNA * YMLOCK * WMSWND * /YSWEHITM 
* YNOPIPE 
+ /MRESET * PXSAS * WMNA * YMLOCK * WMSWND * /YSWEHITM * YNOPIPE 
+ /MRESET * PXSAS * YMLOCK * /YNOPIPE * /YPIPE * YMSTR * MAOE 
+ /MRESET * PXSAS * /MHOLD * /PCTCXFR * /RSTRT * YMSTR * /MAOE 
+ /MRESET * PXSAS * YMLOCK * /PCTCXFR * /RSTRT * YMSTR * /MAOE 
+ /MRESET * PXSAS * /MHOLD * /YSWEHITM 
* /PCTCXFR * /YNOPIPE 
* /YPIPE 
* YMSTR * MAOE 


YMSEL 
:- 
/MRESET * /YMEOC * YPIPE 
+ /MRESET * /YMEOC * /YSWEHITM * YNOPIPE 
+ /MRESET * /YSWEHITM * /PCTCXFR * YPIPE 
+ /MRESET * /YMEOC * WMSWND * PCTCXFR * /RSTRT * YMSTR * /MAOE 


UNUSED 
0= VCC 
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Declaration Segment ------------ 
TITLE AYMEMLEN 
PATTERN 
A 
REVISION 2.0 
AUTHOR 
ISIC SILAS 
COMPANY 
INTEL 
DATE 
2/4/91 
CHIP 
xOl 
85C22VI0 


This PLD contains the YMEMLEN and YCPUEOC state machines 


;----------------------------------PIN Declarations --------------- 
PIN 
1 
MCLK 
COMBINATORIAL ; 
PIN 
2 
MRESET 
COMBINATORIAL ; 
PIN 
3 
/YMSEL 
COMBINATORIAL ; 
PIN 
4 
/yPIPE 
COMBINATORIAL ; 
PIN 
5 
/MABORT 
COMBINATORIAL ; 
PIN 
6 
/MBRDY 
COMBINATORIAL ; 


PIN 
7 
jWMSWND 
COMBINATORIAL ; 
PIN 
8 
XLRDYSRC 
COMBINATORIAL ; 
PIN 
9 
/XLKCACHE 
COMBINATORIAL 
PIN 
10 
/MKEN 
COMBINATORIAL ; 
PIN 
11 
LEN 
COMBINATORIAL ; 
PIN 
12 
GND 
PIN 
13 
/CACHE 
PIN 
14 
/YMEOC 
PIN 
15 
/SVRO 
PIN 
16 
/SVRl 
PIN 
17 
/SVR2 
PIN 
18 
/SVR3 
PIN 
19 
/yCEOC 
PIN 
20 
/SVLO 
PIN 
21 
/SVLl 
PIN 
22 
/SVCO 
PIN 
23 
/SVCl 
PIN 
24 
VCC 
;---------------------------------BooleanEquation Segment ------ 
EQUATIONS 


COMBINATORIAL ; INPUT 
COMBINATORIAL ; INPUT 
COMBINATORIAL 
INPUT 
COMBINATORIAL 
INPUT 
COMBINATORIAL 
INPUT 
COMBINATORIAL 
INPUT 
registered ; 
registered ; 
registered ; 
registered ; 
registered ; 


SVLl 
:- 
/YMEOC * /MRESET * SVLl 
+ YPIPE * LEN * /XLKCACHE * /MRESET * SVLl 
+ YMSEL * LEN * /MRESET * /SVLl * /SVLO 
+ YPIPE * LEN * XLRDYSRC * /MKEN * /MRESET * SVLl 
+ YPIPE * YMEOC * LEN * /XLKCACHE * /MRESET * SVLO 
+ YMSEL * /XLRDYSRC * XLKCACHE * /MRESET * /SVLl * /SVLO 
+ YMSEL * XLKCACHE * MKEN * /MRESET * /SVLl * /SVLO 
+ YPIPE * YMEOC * LEN * XLRDYSR~ * /MKEN * /MRESET * SVLO 


SVLO 
:- 
YMSEL * /XLRDYSRC * XLKCACHE * /MRESET * /SVLl * /SVLO 
+ YMSEL * XLKCACHE * MKEN * /MRESET * /SVLl * /SVLO 
+ /YMEOC * /MRESET * SVLO 
+ YPIPE * /LEN * /XLKCACHE * /MRESET * SVLO 
+ YMSEL * /LEN * /MRESET * /SVLl * /SVLO 
+ YPIPE * YMEOC * /LEN * /XLKCACHE * /MRESET * SVLl 
+ YPIPE * /LEN * XLRDYSRC * /MKEN * /MRESET * SVLO 
+ YPIPE * YMEOC * /LEN * XLRDYSRC * /MKEN * /MRESET * SVLl 


SVC1 
:- 
/YMEOC * /yCEOC * /MRESET * SVC1 
+ YMSEL * XLRDYSRC * /MRESET * /SVC1 * /SVCO 
+ YPIPE * YMEOC * /CACHE * XLRDYSRC * /MRESET * SVCl 
+ YPIPE * YMEOC * /CACHE * XLRDYSRC * /MRESET * SVCO 


SVCO 
:- 
/YMEOC * /MRESET * SVCO 
+ /YMEOC * YCEOC * /MRESET * SVCl 
+ YPIPE * /XLRDYSRC * /MRESET * SVCO 
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+ YPIPE * YMEOC * /XLRDYSRC 
* /MRESET * SVCl 
+ YMSEL * CACHE * /MRESET * /SVCl * /SVCO 
+ YMSEL * /XLRDYSRC 
* /MRESET * /SVCl * /SVCO 


YCEOC 
'- 
SVR3 * /SVR2 * /SVRl * SVLl * SVLO * SVCl * WMSWND 
* /MABORT * /MRESET * /yCEOC 
+ SVR3 * /SVRl * /SVRO * SVLl * SVLO * SVCl * WMSWND 
* /MABORT * /MRESET 
* /yCEOC 
+ /SVR3 * /SVR2 * SVRl * /SVRO * SVLl * /SVLO * SVCl 
* WMSWND * /MABORT * /MRESET * /yCEOC 
+ /SVR3 * /SVR2 * /SVRl * SVRO * /SVLl * SVLO * SVCl 
* WMSWND * /MABORT * /MRESET * /yCEOC 
+ /SVR3 * SVR2 * SVRl * /SVRO * SVLl * SVLO * SVCl 
* WMSWND * /MABORT 
* /MRESET * /yCEOC 
+ /SVR3 * /SVR2 * SVRl * SVRO * SVLl * SVLO * SVCl 
* WMSWND * /MABORT * /MRESET * /yCEOC 
+ /SVR3 * /SVR2 * SVRl * /SVRO * SVLl * SVCl * /SVCO 
* WMSWND * /MABORT * /MRESET * /yCEOC 
+ /SVR3 * SVR2 * /SVRO * SVLl * SVLO * SVCl * /SVCO 
* WMSWND * /MABORT * /MRESET * /YCEOC 
+ /SVR3 * /SVR2 * /SVRl * /SVLl * SVLO * SVCl * MBRDY 
* WMSWND * /MABORT * /MRESET * /yCEOC 
+ /SVR3 * SVR2 * /SVRO * SVLl * SVLO * SVCl * MBRDY 
* WMSWND * /MABORT * /MRESET * /yCEOC 
+ /SVR3 * /SVR2 * /SVRl * SVRO * SVLl * /SVLO * SVCl 
* MBRDY * WMSWND * /MABORT * /MRESET * /yCEOC 
+ /SVR3 * /SVR2 * /SVRl * SVRO * SVLl * SVCl * /SVCO 
* MBRDY * WMSWND * /MABORT * /MRESET * /yCEOC 
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COMBINATORIAL 
; 
COMBINATORIAL 
; 
COMBINATORIAL 
; 
COMBINATORIAL 
; 


MCLK 
MRESET 
TR4 
/yALLOC 


/MABORT 
/MBRDY 
/WMSWND 
/MBOFFI 
jYMSEL 
/yPIPE 
CND 
/SVLl 
/SVLO 
/SVR3 
/SVR2 
/SVRl 
/SVRO 
jYMEOCl 
jYMEOC 
jYMFRZ 
/CTCEND 
/SV 
VCC 


---------------------------------- 
Declaration Segment ------------ 
TITLE BYRDYSTR 
PATTERN 
A 
REVISION 2.0 
AUTHOR 
ISIC SILAS 
COMPANY 
INTEL 


DATE 
2/4/91 
CHIP 
xOl 85C22VlO 


This PLO contains the YRDYSTR. YRDYSTR. and YMEMEOC state machines. 


;---------------------------------- 
PIN'Declarations 
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COMBINATORIAL 
; 
COMBINATORIAL 
; 


COMBINATORIAL 
; 


COMBINATORIAL 
; 


COMBINATORIAL 
; 


COMBINATORIAL 


COMBINATORIAL 
; INPUT 
COMBINATORIAL 
; INPUT 
registered 
registered 
registered 
registered 
registered 
registered 
; 


registered 
; 


registered 
registered 
; 


EQUATIONS 


----------------------------------- 
Boolean Equation Segment ------ 
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SVR3 
- 
/MRESET * jYMEOC * /MBRDY * /MABORT * SVR3 
+ /MRESET * MBRDY * /MABORT * /MBOFFI * SVR2 
+ /MRESET * /MBRDY * /MABORT * SVR3 * SVR2 
+ /MRESET * MBRDY * /MABORT * SVR2 * SVRl 
+ /MRESET * jYMEOC * /MABORT * SVR3 * SVRO 
+ /MRESET * MBRDY * /MABORT * /TR4 * /SVR3 * SVR2 


SVR2 
- 
/MRESET * /MBRDY * /MABORT * SVR2 
+ /MRESET * /MABORT * SVR2 * SVRl 
+ /MRESET * jYMEOC * MBRDY * /MABORT * SVRl 
+ /MRESET * MBRDY * /MABORT * MBOFFI * SVRl 
+ /MRESET * MBRDY * /MABORT * SVRl * SVRO 


SVRl 
- 
/MRESET * /MABORT * SVRl * SVRO 
+ /MRESET * jYMEOC * /MBRDY * /MABORT * SVRl 
+ /MRESET * /MBRDY * /MABORT * MBOFFI * SVRl 
+ /MRESET * /MBRDY * /MABORT * SVR2 * SVRl 
+ /MRESET * jYMEOC * MBRDY * /MABORT * /SVR3 * SVRO 
+ /MRESET * MBRDY * /MABORT * MBOFFI * /SVR3 * SVRO 
+ /MRESET * jYMEOC * MBRDY * /MABORT * SVR3 * /SVR2 * /SVRO 


SVRO 
- 
/MRESET * MBRDY * /MABORT * /MBOFFI * SVR3 
+ /MRESET * MBRDY * /MABORT * SVR3 * /SVR2 
+ /MRESET * jYMEOC * /MBRDY * /MABORT * SVRO 


PIN 
1 
PIN 
2 
PIN 
3 
PIN 
4 


;PIN 
5 #### 
PIN 
6 
PIN 
7 
PIN 
8 
PIN 
9 
PIN 
10 
PIN 
11 
PIN 
12 
PIN 
13 
PIN 
14 
PIN 
15 
PIN 
16 
PIN 
17 
PIN 
18 
PIN 
19 
PIN 
20 
PIN 
21 
PIN 
22 
PIN 
23 
PIN 
24 
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+ /MRESET * /MBRDY * /MABORT * SVRl * SVRO 
+ /MRESET * /MBRDY * /MABORT * MBOFFI * /SVR3 * SVRO 
+ /MRESET * YPIPE * YMEOC * MBRDY * /MABORT * /SVRl * SVRO 
+ /MRESET * YMSEL * MBRDY * /MABORT * /SVR2 * /SVRl * /SVRO 
+ /MRESET * MBRDY * /MABORT * MBOFFI * /SVR2 * /SVRl * /SVRO 
+ /MRESET * YPIPE * YMEOC * MBRDY * /MABORT * /SVR2 * SVRl 
* /SVRO 
' 


- 
/MRESET * MBRDY * /MABORT * MBOFFI * SVR3 * SVR2 
+ /MRESET * MBRDY * /MABORT * MBOFFI * TR4 * SVR2 * /SVRl 


YMEOC 
- 
/MRESET * MABORT * YALLOC * /YMEOC * /SV 
+ /MRESET * /SVR3 * /SVR2 * SVRl * /SVRO * SVLl * /SVLO 
* WMSWND * /MABORT * jYMEOC * /SV 
+ /MRESET * /SVR3 * /SVR2 * /SVRl * SVRO * /SVLl * SVLO 
* WMSWND * /MABORT * jYMEOC * /SV 
+ /MRESET * SVR3 * /SVR2 * /SVRl * SVRO * SVLl * SVLO 
* WMSWND * /MABORT * jYMEOC * /SV 
+ /MRESET * SVR3 * /SVR2 * /SVRl * SVLl * SVLO * TR4 * WMSWND 
* /MABORT * /YMEOC * /SV 
+ /MRESET * /SVR3 * /SVR2 * /SVRl * /SVLl * SVLO * MBRDY 
* WMSWND * /MABORT * /YMEOC * /SV 
+ /MRESET * /SVR3 * /SVR2 * /SVRl * SVRO * SVLl * /SVLO 
* MBRDY * WMSWND * /MABORT * jYMEOC * /SV 
+ /MRESET * SVR3 * SVR2 * /SVRl * /SVRO * SVLl * SVLO 
* MBRDY * WMSWND * /MABORT * jYMEOC * /SV 
+ /MRESET * SVR2 * /SVRl * /SVRO * SVLl * SVLO * TR4 * MBRDY 
* WMSWND * /MABORT * /YMEOC * /SV 


CTCEND 


SV 
= 
/MRESET * YMEOC 


/YMFRZ 
- 
MRESET 
+ /MABORT 
+ jYALLOC 
+ YMEOC 
+ SV 


YMEOCl 
- 
/MRESET * MABORT * YALLOC * /YMEOCl * /SV 
+ /MRESET * /SVR3 * /SVR2 * SVRl * /SVRO * SVLl * /SVLO 
* WMSWND * /MABORT * /YMEOCl * /SV 
+ /MRESET * /SVR3 * /SVR2 * /SVRl * SVRO * /SVLl * SVLO 
* WMSWND * /MABORT * /YMEOCl * /SV 
+ /MRESET * SVR3 * /SVR2 * /SVRl * SVRO * SVLl * SVLO 
* WMSWND * /MABORT * jYMEOCl * /SV 
+ /MRESET * SVR3 * /SVR2 * /SVRl * SVLl * SVLO * TR4 * WMSWND 
* /MABORT * jYMEOCl * /SV 
+ /MRESET * /SVR3 * /SVR2 * /SVRl * /SVLl * SVLO * MBRDY 
* WMSWND * /MABORT * jYMEOCl * /SV 
+ /MRESET * /SVR3 * /SVR2 * /SVRl * SVRO * SVLl * /SVLO 
* MBRDY * WMSWND * /MABORT * /YMEOCl * /SV 
+ /MRESET * SVR3 * SVR2 * /SVRl * /SVRO * SVLl * SVLO 
* MBRDY * WMSWND 
~ /MABORT * /YMEOCl * /SV 
+ /MRESET * SVR2 * /SVRl * /SVRO * SVLl * SVLO * TR4 * MBRDY 
* WMSWND * /MABORT * /YMEOCl * /SV 
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Declaration Segment ------------ 
TITLE EABORT 
PATTERN 
A 
REVISION 2.0 
AUTHOR 
ISIC SILAS 
COMPANY 
INTEL 
DATE 
2/5/91 


CHIP 
x01 85C224 


This PLD contains the YABORT, YRSTRT, and YMEMDOE state machines. 


Pin Definitions 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
13 
14 
15 
16 
23 


18 
19 
20 
21 
22 
17 


PIN 
PIN 
PIN 
PIN 
PIN 
PIN 
PIN 
PIN 
PIN 
PIN 
PIN 
PIN 
PIN 
PIN 
PIN 
PIN 


MCLK 
MRESET 
WMSWND 
YSWEHITM 
YALLOC 
YPIPE 
YNOPIPE 
YMEOC 
MHITMI 
MAOE 
CTCEND 
MKEN 
MHLDA 
YWR 
YMADS 
CTCDIS 


PIN 
PIN 
PIN 
PIN 
PIN 
PIN 


RSTRT 
YMDOE 
PCTCXFR 
TRIABORT 
MABORT 
SV 
;Swapped pins 23 and 17 to fit 85C224 


EQUATIONS 


/RSTRT.D 
:- /MRESET * /PCTCXFR * /CTCDIS 
+ /MRESET * /PCTCXFR * /RSTRT 
+ /MRESET * /ySWEHITM * /CTCDIS * RSTRT 
+ /MRESET * /ySWEHITM * YWR * YALLOC * RSTRT 
RSTRT.CLKF - MCLK 
RSTRT.RSTF - GND 
RSTRT.SETF - GND 
RSTRT.TRST - VCC 


/YMDOE.D 
'- /MRESET * YWR * /YPIPE * /YMEOC 
+ /MRESET * /YNOPIPE * YMEOC * /YMDOE 
+ /MRESET * MHLDA * YMEOC * /YMDOE 
+ /MRESET * /yPIPE * YMEOC * /YMDOE 
+ /MRESET * /YMADS * YWR * /YNOPIPE * YMDOE 
+ /MRESET * /YMADS * YWR * MHLDA * YMDOE 
YMDOE.CLKF - MCLK 
YMDOE.RSTF - GND 
YMDOE.SETF - GND 
YMDOE.TRST - VCC 


/PCTCXFR.D 
:- /MRESET * YALLOC * /MABORT 
+ /MRESET * /ySWEHITM * /MAOE * PCTCXFR 
+ /MRESET * /MHITMI * /WMSWND * /MABORT 
+ /MRESET * CTCEND * /PCTCXFR * MABORT 
+ /MRESET * /PCTCXFR * MABORT * /SV 
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+ /MRESET * /YSWEHITM 
* jYPIPE * YMEOC * PCTCXFR 
+ /MRESET * jYPIPE * /YMEOC * /YALLOC * PCTCXFR 
+ /MRESET * /YNOPIPE * YMEOC * /YALLOC * /MKEN * PCTCXFR 
PCTCXFR.CLKF 
- MCLK 
PCTCXFR.RSTF 
- GND 
PCTCXFR.SETF 
- GND 
PCTCXFR.TRST 
- VCC 


/TRIABORT.D 
:- /MRESET * /YPIPE * jYMEOC * /YALLOC * PCTCXFR * /MHLDA 
+ /MRESET * /YNOPIPE * YMEOC * jYALLOC * /MKEN * PCTCXFR 
* /MHLDA 
+ /MRESET * /YSWEHITM * /MAOE * YPIPE * PCTCXFR * /MHLDA 
+ /MRESET * /YSWEHITM * /MAOE * jYMEOC * PCTCXFR * /MHLDA 
+ /MRESET * jYMEOC * /PCTCXFR * MABORT * /SV * /MHLDA 
TRIABORT.CLKF 
- MCLK 
TRIABORT.RSTF 
= GND 
TRIABORT.SETF 
= GND 
TRIABORT.TRST 
= /MHLDA 


/MABORT.D 
'- /MRESET * /YPIPE * jYMEOC * jYALLOC * PCTCXFR 
+ /MRESET * jYNOPIPE * YMEOC * /YALLOC * /MKEN * PCTCXFR 
+ /MRESET * /YSWEHITM * /MAOE * YPIPE * PCTCXFR 
+ /MRESET * jYSWEHITM 
* /MAOE * jYMEOC * PCTCXFR 
+ /MRESET * jYMEOC * /PCTCXFR * MABORT * /SV 
MABORT.CLKF 
- MCLK 
MABORT.RSTF 
- GND 
MABORT.SETF 
- GND 
MABORT.TRST 
- VCC 


/SV.D 
:- MABORT * /SV 
+ PCTCXFR 
SV .CLKF - MCLK 
SV.RSTF 
= GND 
SV.SETF 
- GND 
SV.TRST 
= VCC 
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PIN 
2 
RESET 
PIN 
3 
CADS 
PIN 
4 
CDTS 
PIN 
5 
SNPADS 


PIN 
6 
CIJR 


PIN 
7 
YSBGT 
PIN 
8 
CRDY 


PIN 
9 
CAHOLD 
PIN 
10 
FSIOUT 
PIN 
11 
SLFTST 
PIN 
13 
OEx 


PIN 
14 
RDYSRC 


PIN 
16 
LRDYSRC 
PIN 
17 
SV2 
PIN 
18 
STFAIL 
PIN 
19 
SV1 
PIN 
20 
XSNPWB 


PIN 
21 
WSDTS 
PIN 
22 
XAS 


OE control inverted during design conversion. 


EQUATIONS 


LRDYSRC.D 
:- RDYSRC 
LRDYSRC .CLKF - CLK 
LRDYSRC.RSTF - GND 
LRDYSRC.SETF - GND 
/LRDYSRC.TRST - OEx 


/SV2.D 
:- /RESET * FSIOUT * /SLFTST * STFAIL * SV2 
SV2.CLKF - CLK 
SV2.RSTF - GND 
SV2.SETF - GND 
/SV2.TRST - OEx 


/STFAIL.D 
:- /RESET * FSIOUT * /CAHOLD * /SLFTST * STFAIL * SV2 


STFAIL.CLKF - CLK 
STFAIL.RSTF - GND 
STFAIL.SETF - GND 
/STFAIL.TRST - OEx 


/SV1.D 
:- /RESET * CDTS * CRDY * /SV1 
+ /RESET * CDTS * WSDTS * /SVl 
+ /RESET * /SNPADS * XSNPWB * SV1 
+ /RESET * /CDTS * CRDY * /WSDTS * XSNPWB 
SVl .CLKF - CLK 
SV1.RSTF - GND 
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SVl. SETF 
- CND 


/SVl.TRST 
- OEx 


/XSNPWB.D 
:- /RESET * CRDY * /XSNPWB 
+ /RESET * /CDTS * WSDTS * /SVl 


XSNPWB.CLKF 
- CLK 


XSNPWB.RSTF 
- CND 


XSNPWB.SETF 
- CND 


/XSNPWB.TRST 
- OEx 


jWSDTS.D 
:- /RESET * CRDY * /XSNPWB 
+ /RESET * /CDTS * XSNPWB 
+ /RESET * jWSDTS * /SVl 
+ /RESET * SNPADS * CRDY * /WSDTS 


WSDTS.CLKF 
- CLK 


WSDTS.RSTF 
- CND 


WSDTS.SETF 
- CND 


/WSDTS.TRST 
- OEx 


/XAS.D 
:- /RESET * SNPADS * YSBCT * /XAS 
+ /RESET * /CDTS * CWR * XAS 
+ /RESET * /CADS * /CWR * XAS 


XAS .CLKF 
- CLK 


XAS.RSTF 
- CND 


XAS.SETF 
- CND 


/XAS.TRST 
- OEx 
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;-----------------------------------Declaration 
Segment -,----------- 
TITLE 
EBGTKWN 
PATTERN 
REVISION 
1.0 
AUTHOR 
COMPANY 
INTEL 
DATE 


CHIP 
INTEL 85C224 


This PLO contains the XBGTKWND, XCNA and XENBCT state machines 


;---------------------------------Pin 
Declarations-------------------- 
PIN 
1 
CLK 
PIN 
2 
RESET 
PIN 
3 
YSBCT 
PIN 
4 
CRDY 
PIN 
5 
C8LORV 
PIN 
6 
TR4 
PIN 
7 
NC5 
PIN 
8 
NC6 
PIN 
9 
WCPLB 
PIN 
10 
CNADIS 
PIN 
11 
NCl 
PIN 
13 
OE 
PIN 
14 
NC2 
PIN 
15 
NC3 
PIN 
23 
NC4 


PIN 
16 
CKENLC 
PIN 
17 
ENBCT 
PIN 
18 
CNA 
PIN 
19 
PBCT 
PIN 
20 
KWEND 
PIN 
21 
C5BCT. 


PIN 
22 
BGT 


EQUATIONS 


/CKENLC.D 
:- /RESET * YSBGT * CRDY * jBGT * jKWEND 
+ jRESET * CRDY * ENBCT * JBCT * jKWEND 
CKENLC.CLKF - CLK 
CKENLC.RSTF - CND 
CKENLC.SETF - CND 
/CKENLC.TRST - OE 


ENBGT.D :- /RESET * jYSBCT 
ENBGT.CLKF - CLK 
ENBCT.RSTF - CND 
ENBCT.SETF - CND 
jENBGT.TRST - OE 


/CNA.D 
:- jRESET * CRDY * jCNA 
+ jRESET * jYSBCT * WCPLB * CNADIS 
+ jRESET * jYSBCT * WCPLB * jCNA 
+ jRESET * jPBCT * WCPLB * jCNA 
+ /RESET * JBCT * WCPLB * CNADIS * CNA 
CNA.CLKF - CLK 
CNA.RSTF - CND 
CNA.SETF - GND 
/CNA,TRST - OE 


/PBGT.D :- jRESET * CRDY * jPBGT 
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+ /RESET * jYS.BGT * CRDY * /ENBGT * /BGT * /KWEND 
PBGT.CLKF 
- CLK 
PBGT.RSTF 
- GND 
PBGT. SETF 
- GND 


/PBGT.TRST 
- OE 


/KWEND.D 
:- /RESET * /BGT * KWEND 
+ /RESET * /CRDY * /PBGT 
+ /RESET * YSBGT * CRDY * /BGT 
+ /RESET * CRDY * ENBGT * /BGT 
+ RESET * TR4 
KWEND .CLKF 
- CLK 
KWEND.RSTF 
- GND 
KWEND.SETF 
- GND 


/KWEND.TRST 
- OE 


/CSBGT.D 
:- /RESET * /BGT * KWEND 
+ /RESET * /CRDY * /PBGT 
+ /RESET * YSBGT * CRDY * /BGT 
+ /RESET * CRDY * ENBGT * /BGT 
+ /RESET * jYSBGT * /CRDY * /ENBGT * /BGT 
+ /RESET * jYSBGT * /ENBGT * CSBGT * KWEND * PBGT 
+ RESET * /C8LDRV 
CSBGT.CLKF 
- CLK 
CSBGT.RSTF 
- GND 
CSBGT. SETF 
- GND 


/CSBGT.TRST 
- OE 


/BGT.D 
:- /RESET * /BGT * KWEND 
+ /RESET * /CRDY * /PBGT 
+ /RESET * YSBGT * CRDY * /BGT 
+ /RESET * CRDY * ENBGT * /BGT 
+ /RESET * jYSBGT * /CRDY * /ENBGT * /BGT 
+ /RESET * /YSBGT * /ENBGT * CSBGT * KWEND * PBGT 
BGT.CLKF 
- CLK 
BGT.RSTF 
- GND 
BGT.SETF 
- GND 
/BGT.TRST 
- OE 
240957-76 
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Declaration Segment ------------ 
TITLE EBRDY 
PATTERN 
A 
REVISION 2.0 
AUTHOR 
ISIC SILAS 
COMPANY 
INTEL 
DATE 
2/4/91 


CHIP 
x01 85C224 


This PLD contains the XBRDY, XMSWNDO, and XCTRCK state machines. 


;----------------------------------- 
Pin Declarations ---------------- 
PIN 
1 
CLK 
PIN 
2 
RESET 
PIN 
3 
CLENl 
PIN 
4 
WSDTS 
PIN 
5 
YSCEOC 
PIN 
6 
SNPCYC 
PIN 
7 
MSNPSTB 
PIN 
8 
CKENLC 
PIN 
9 
MKEN 
PIN 
13 
OEx 


PIN 
15 
CKEN 
PIN 
17 
SV 
PIN 
18 
PNDCEOC 
PIN 
19 
ENBRDY 
PIN 
20 
BRDY 
PIN 
21 
BRDYl 
PIN 
22 
MSWNDO 


EQUATIONS 


/CKEN - CKENLC * /MKEN 
+ /CKENLC * /CKEN 
+ /MKEN * /CKEN 
CKEN.TRST - VCC 


/SV.D :- /RESET * BRDY * /SV 
+ /RESET * CLENl * /SV 
+ /RESET * /ySCEOC * BRDY * ENBRDY * /PNDCEOC 
+ /RESET * /ySCEOC * CLENl * ENBRDY * /PNDCEOC 
SV .CLKF - CLK 
SV.RSTF - GND 
SV.SETF - GND 
/SV.TRST - OEx 


/PNDCEOC.D 
:- /RESET * /SV 
+ /RESET * BRDY * /PNDCEOC 
+ /RESET * CLENl * /PNDCEOC 
+ /RESET * /ySCEOC * ENBRDY * /PNDCEOC 
+ /RESET * /ySCEOC * /ENBRDY * PNDCEOC 
PNDCEOC.CLKF - CLK 
PNDCEOC.RSTF - GND 
PNDCEOC.SETF - GND 
/PNDCEOC.TRST - OEx 


/ENBRDY.D :- YSCEOC * PNDCEOC 
+ jENBRDY * PNDCEOC 
+ YSCEOC * /BRDY * /CLEN1 
+ /ySCEOC * BRDY * /ENBRDY 
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+ jYSCEOC * CLENl * /ENBRDY 
+ /BRDY * /CLENl * ENBRDY * /PNDCEOC 
+ RESET 
ENBRDY. CLKF 
- CLK 
ENBRDY.RSTF 
- GND 
ENBRDY.SETF 
- GND 
/ENBRDY.TRST 
- OEx 


/BRDY.D 
:- /RESET * CLENl * /BRDY 
+ /RESET * /PNDCEOC 
* jWSDTS * BRDY 
+ /RESET * jYSCEOC 
* /ENBRDY 
* jWSDTS * BRDY 
BRDY .CLKF 
- CLK 
BRDY.RSTF 
- GND 
BRDY.SETF 
- GND 
/BRDY.TRST 
- OEx 
11II 


/BRDY1.D 
:- /RESET * CLENl * /BRDYl 
+ /RESET * /PNDCEOC 
* jWSDTS * BRDYl 
+ /RESET * jYSCEOC 
* /ENBRDY 
* jWSDTS * BRDYl 
BRDYl .CLKF 
- CLK 
BRDY1.RSTF 
- GND 
BRDY1. SETF 
- GND 


/BRDY1.TRST 
- OEx 


/MSWNDO 
- /RESET * /SNPCYC 
+ /RESET * MSNPSTB * /MSWNDO 
MSWNDO.TRST 
- VCC 
240957-78 
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PIN 
"1. 
YMAOE 
PIN 
3 
MHLDA 
PIN 
4 
KCACHE 
PIN 
5 
CWR 
PIN 
6 
CMIO 
PIN 
7 
CDC 
PIN 
8 
LLEN 
PIN 
9 
C5MTHIT 
PIN 
10 
YSNPDIS 
PIN 
11 
NC1 


PIN 
13 
NC2 
PIN 
14 
YNOSWNDI 
PIN 
23 
YWRI 


PIN 
15 
YWR 
PIN 
16 
MTHIT 


PIN 
17 
MLEN 
PIN 
18 
MDC 
PIN 
19 
MMIO 
PIN 
20 
MWR 
PIN 
21 
MCACHE 
PIN 
22 
YNOSWND 


240957-79 


EQUATIONS 


fYWR - YMALE * /CWR 
+ /YMALE * IYWRI 
+ /CWR * IYWRI 


YWR.TRST - VCC 


/MTHIT - /C5MTHIT * /MWR * MHLDA 
MTHIT.TRST - MHLDA 


/MLEN - YMALE * /LLEN * /YMAOE 
+ /YMALE * /MLEN * /YMAOE 
+ /LLEN * /MLEN * /YMAOE 
MLEN.TRST - /YMAOE 


/MDC - YMALE * /CDC * /YMAOE 
+ /YMALE * /MDC * /YMAOE 
+ /CDC * /MDC * /YMAOE 


MDC.TRST - /YMAOE 


/MMIO - YMALE * /CMIO * /YMAOE 
+ /YMALE * /MMIO * /yMAOE 
+ /CMIO * /MMIO * /YMAOE 
MMIO.TRST - /YMAOE 


/MWR - YMALE * /CWR * /YMAOE 
+ /YMALE * /MWR * /YMAOE 
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+ /CWR * /MWR * jYMAOE 
MWR.TRST 
- jYMAOE 


/MCACHE 
- YMALE * /KCACHE * jYMAOE 
+ jYMALE * /MCACHE * jYMAOE 
+ /KCACHE * /MCACHE * jYMAOE 
MCACHE.TRST 
- jYMAOE 
. 


/YNOSWND 
- YMALE * /ySNPDIS 
+ YMALE * /CMIO 
+ YMALE * CWR * /KCACHE 
+ jYMALE * /YNOSWNDI 
+ /ySNPDIS * /YNOSWNDI 
+ /CMIO * /YNOSWNDI 
+ CWR * jKCACHE * /YNOSWNDI 
YNOSWND.TRST 
- VCC 
240957-80 
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Declaration Segment 
. 
TITLE EMBE 
PATTERN 
A 
REVISION 2.0 
AUTHOR 
ISlC SILAS 
COMPANY 
INTEL 
DATE 
2/5/91 


CHIP 
xOl 85C224 


This PLO generates the memory bus byte enables (MBEs) 


;·······························PinDeclarations···················· 


PIN 
1 
LBEO 
PIN 
2 
LBEl 
PIN 
3 
LBE2 
PIN 
4 
LBE3 
PIN 
5 
LBE4 
PIN 
6 
LBE5 
PIN 
7 
LBE6 
PIN 
8 
LBE7 
PIN 
9 
RDYSRC 
PIN 
10 
KCACHE 
PIN 
11 
YMALE 
PIN 
13 
YMAOE 
PIN 
14 
MBE61 
PIN 
23 
MBE7l 


PIN 
15 
MBE7 
PIN 
16 
MBE5 
PIN 
17 
MBE4 
PIN 
18 
MBE3 
PIN 
19 
MBE2 
PIN 
20 
MBE1 
PIN 
21 
MBEO 
PIN 
22 
MBE6 


EQUATIONS 


/MBE7 - /YMALE * /LBE7 * /YMAOE 
+ /YMALE * /KCACHE * jRDYSRC * /YMAOE 
+ YMALE * /MBE71 * /YMAOE 
+ /LBE7 * /MBE7l * /YMAOE 
+ /KCACHE * /RDYSRC * /MBE7I * /YMAOE 
MBE7.TRST - /YMAOE 


/MBE5 - /YMALE * /LBE5 * /YMAOE 
+ /YMALE * /KCACHE * /RDYSRC * /YMAOE 
+ YMALE * /MBE5 * /YMAOE 
+ /LBE5 * /MBE5 * /YMAOE 
+ /KCACHE * /RDYSRC * /MBE5 * /YMAOE 
MBE5.TRST - /YMAOE 


/MBE4 - /YMALE * /LBE4 * /YMAOE 
+ /YMALE * /KCACHE * /RDYSRC * /YMAOE 
+ YMALE * /MBE4 * /YMAOE 
+ /LBE4 * /MBE4 * /YMAOE 
+ /KCACHE * /RDYSRC * /MBE4 * /yMAOE 
MBE4.TRST - /YMAOE 


/MBE3 - /YMALE * /LBE3 * /YMAOE 
+ /YMALE * /KCACHE * /RDYSRC * /YMAOE 
+ YMALE * /MBE3 * /YMAOE 
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+ /LBE3 * /MBE3 * jYMAOE 
+ /KCACHE * /RDYSRC * /MBE3 * jYMAOE 
MBE3.TRST 
- jYMAOE 


/MBE2 
- jYMALE * /LBE2 * jYMAOE 
+ jYMALE * /KCACHE * /RDYSRC * jYMAOE 
+ YMALE * /MBE2 * jYMAOE 
+ /LBE2 * /MBE2 * jYMAOE 
+ /KCACHE 
* /RDYSRC 
* /MBE2 * jYMAOE 
MBE2.TRST 
- jYMAOE 


/MBEl 
- jYMALE * /LBEl * jYMAOE 
+ /YMALE * /KCACHE 
* /RDYSRC * /YMAOE 
+ YMALE * /MBEl * jYMAOE 
+ /LBEl * /MBEl * jYMAOE 
+ /KCACHE * /RDYSRC * /MBEl * jYMAOE 
MBEl.TRST 
- jYMAOE 


/MBEO 
- jYMALE * /LBEO * jYMAOE 
+ jYMALE * /KCACHE 
* /RDYSRC * jYMAOE 
+ YMALE * /MBEO * /YMAOE 
+ /LBEO * /MBEO * jYMAOE 
+ /KCACHE * /RDYSRC * /MBEO * /YMAOE 
MBEO.TRST 
- jYMAOE 


/MBE6 
- /YMALE * /LBE6 * /YMAOE 
+ /YMALE * /KCACHE 
* /RDYSRC * jYMAOE 
+ YMALE * /MBE61 * jYMAOE 
+ /LBE6 * /MBE61 * jYMAOE 
+ /KCACHE * /RDYSRC * /MBE61 * jYMAOE 
MBE6.TRST 
= jYMAOE 
240957-82 
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Pin Declarations 
00000000000000000 


PIN 
1 
MCLK 
PIN 
2 
MRESET 
PIN 
3 
YRGT 
PIN 
4 
YNOPIPE 
PIN 
5 
YPIPE 
PIN 
6 
MNA 
PIN 
7 
WMSWND 
PIN 
8 
YMEOC 
PIN 
9 
PXSAS 
PIN 
10 
YALLOC 
PIN 
11 
MSWNDI 
PIN 
13 
OE 
PIN 
14 
MBRDY 
PIN 
23 
YMADS 


PIN 
15 
YIMSWND 
PIN 
16 
YDRCTM 
PIN 
17 
NC1 
PIN 
18 
YMALE 
PIN 
19 
DISWND 
PIN 
20 
WMNA 
PIN 
21 
YMBRDY 
PIN 
22 
NC2 


EQUATIONS 


/YIMSWND - /MSWNDI * YALLOC * DISWND 
YIMSWND.TRST 
- VCC 


/yDRCTM.D 
;- /MRESET * /DISWND 
+ /MRESET * YMEOC * YPIPE * /yDRCTM 
YDRCTM.CLKF - MCLK 
YDRCTM.RSTF - GND 
YDRCTM.SETF - GND 
/yDRCTM.TRST - OE 


NCl.D 
;- VCC 
NC1.CLKF - MCLK 
NC1.RSTF - GND 
NC1.SETF - GND 
/NC1.TRST - OE 


/YMALE.D 
;- /MRESET * /yPIPE * /YMALE 
+ /MRESET * /YRGT * YMALE 
+ /MRESET * YNOPIPE * YMEOC * /YMALE 
+ /MRESET * WMSWND * YMEOC * /YMALE 
+ /MRESET * /YMADS * WMNA * YMEOC * /YMALE 
+ /MRESET * MNA * WMNA * YMEOC * /YMALE 
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YMALE.CLKF 
- MCLK 
YMALE.RSTF 
- CND 
YMALE.SETF 
- CND 
/YMALE.TRST 
- OE 


/DISWND.D 
:- /MRESET * /DISWND * YDRCTM 
+ jMRESET * jPXSAS * /YALLOC * YDRCTM 
DISWND. CLKF 
- MCLK 
DISWND.RSTF 
- CND 
DISWND.SETF 
- CND 
/DISWND.TRST 
- OE 


/WMNA.D 
:- jMRESET * /YNOPIPE 
* YMADS * YMEOC * fWMNA 
+ /MRESET * /YNOPIPE 
* YMADS *,/MNA * WMNA 
+ /MRESET * /YPIPE * YMADS * /MNA * jYMEOC * WMNA 
WMNA .CLKF 
= MCLK 
WMNA.RSTF 
- CND 
WMNA.SETF 
- CND 


jWMNA.TRST 
- OE 


/YMBRDY.D 
:- /MRESET * /MBRDY 
YMBRDY.CLKF 
- MCLK 


YMBRDY.RSTF 
- CND 


YMBRDY.SETF 
- CND 


/YMBRDY.TRST 
- OE 


NC2 - VCC 
NC2.TRST 
- VCC 
240957-84 
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Declaration Segment ------------ 
TITLE EMSNPST 
PATTERN 
A 
REVISION 2.0 
AUTHOR 
ISIC SILAS 
COMPANY 
INTEL 
DATE 
2/4/91 
"~, 


CHIP 
xOl 85C224 


This PLD contains the YALLC, YKEMLOCK, YSNPSTB, 
and YKBREQ state machines. 


;--------------------------------Pin 
Declarations------------------- 


PIN 
1 
MCLK 
PIN 
2 
KRESET 
PIN 
3 
KKEN 
PIN 
4 
KHLDA 
PIN 
5 
YWR 
PIN 
6 
YNOSWND 
PIN 
7 
YBGT 
PIN 
8 
XLRDYSRC 
PIN 
9 
RFO 
PIN 
10 
SNPDIS 
PIN 
11 
PALLC 
PIN 
13 
KLOCK 
PIN 
23 
PXSAS 


PIN 
16 
YKLOCK 
PIN 
17 
SV2 
PIN 
18 
HBASWB 
PIN 
19 
KBREQ 
PIN 
20 
MSNPSTB 
PIN 
21 
SVl 
PIN 
22 
YALLOC 


EQUATIONS 


/YKLOCK.D 
:- /KRESET * /yALLOC * YKLOCK 
+ /KRESET * /HBASWB * YKLOCK 
+ /KRESET * YBGT * /yALLOC * /HBASWB 
+ /KRESET * /KLOCK * /yALLOC * /HBASWB 
+ /KRESET * /YBGT * /KLOCK * YKLOCK 
YMLOCK.CLKF - MCLK 
YMLOCK.RSTF - GND 
YKLOCK.SETF - GND 
YKLOCK.TRST - VCC 


/SV2.D 
:- PXSAS * /SV2 
+ PXSAS * /KHLDA * /HBASWB 
SV2.CLKF - MCLK 
SV2.RSTF - GND 
SV2.SETF - GND 
SV2.TRST - VCC 


/HBASWB.D 
:- /KRESET * PXSAS * /HBASWB * SV2 
+ /KRESET * PXSAS * /YBGT * MBREQ * SV2 
HBASWB.CLKF - MCLK 
HBASWB.RSTF - GND 
HBASWB.SETF - GND 
HBASWB.TRST - VCC 


/KBREQ.D 
'- PXSAS * /KBREQ 
+ PXSAS * HBASWB * /SV2 
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+ jPXSAS 
* 'jYBCT * MBREQ * HBASWB * SV2 
+ MRESET 
MBREQ.CLKF 
- MCLK 
MBREQ.RSTF 
- CND 
MBREQ.SETF 
- CND 
MBREQ.TRST 
- VCC 


jMSNPSTB.D 
:- jMRESET 
* jYBCT * YNOSWND 
* YWR * MSNPSTB 
+ jMRESET 
* jYBCT * YNOSWND 
* XLRDYSRC 
* MSNPSTB 
+ jMRESET 
* jYBCT * YNOSWND 
* RFO * MSNPSTB 
MSNPSTB.CLKF 
- MCLK 
MSNPSTB.RSTF 
- CND 
MSNPSTB.SETF 
- CND 
MSNPSTB.TRST 
- VCC 


jSVl.D 
:- jYALLOC 
SVl .CLKF 
- MCLK 
2 


SVl .RSTF 
- CND 
SVl.SETF 
- CND 
SVl.TRST 
- VCC 


jYALLOC.D 
:- jMRESET 
* PXSAS * jYALLOC 
* jSVl 
+ jMRESET 
* jMKEN * jYALLOC 
* SVl 
+ jMRESET 
* jYBCT * jPALLC * SNPDIS * jRFO * YALLOC 
YALLOC. CLKF 
- MCLK 
YALLOC.RSTF 
- CND 
YALLOC.SETF 
- CND 
YALLOC.TRST 
- VCC 


.' 
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PIN 
1 
HCLK 
PIN 
2 
MRESET 
PIN 
3 
MAOE 
PIN 
4 
MHLDA 
PIN 
5 
YNOPIPE 
PIN 
6 
YPIPE 
PIN 
7 
HCACHE 
PIN 
8 
YMEOC 
PIN 
9 
HEMZBTEN 
PIN 
10 
SYNC 
PIN 
11 
MALDRV 
PIN 
13 
FLUSH 
PIN 
14 
NCPFLD 
PIN 
15 
FPFLDEN 
PIN 
23 
NC4 


PIN 
16 
NC1 
PIN 
17 
NC2 
PIN 
18 
NC3 
PIN 
19 
YMZBT 
PIN 
20 
FPFLD 
PIN 
21 
YFLUSH 
PIN 
22 
YSYNC 


EQUATIONS 


NC1 - VCC 
NC1.TRST 
- VCC 


NC2 - VCC 
NC2.TRST 
- VCC 


NC3 - VCC 
NC3.TRST 
- VCC 


/YMZBT.D 
:- /MRESET * YPIPE * YMEOC * jYMZBT 
+ /MRESET * /MCACHE * YMEOC * jYMZBT 
+ /MRESET * YNOPIPE * jYPIPE * /MCACHE * /MEMZBTEN 
+ /MRESET * YNOPIPE * /YPIPE * /MCACHE 
* jYMZBT 
+ /MRESET * MHLDA * /MAOE * /MEMZBTEN 
* YMZBT 
+ /MRESET * jYNOPIPE.* 
/MCACHE 
* /MEMZBTEN 
* YMZBT 
+ MRESET 
YMZBT. CLKF - MCLK 
YMZBT.RSTF 
= GND 
YMZBT.SETF 
- GND 
YMZBT.TRST 
- VCC 
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jFPFLD 
- FPFLDEN * HRESET 
FPFLD.TRST 
- HRESET 


jYFLUSH 
- HRESET * jNCPFLD 
+ jHRESET * jFLUSH 
YFLUSH.TRST 
- VCC 


jYSYNC 
- HRESET * jMALDRV 
+ jHRESET * jSYNC 


YSYNC.TRST 
- VCC 
240957-88 
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;-----------------------------Declaration 
Segment--------------------- 
TITLE 
ESIGGEN 
PATTERN 
REVISION 
1.0 
AUTHOR 
COMPANY 
INTEL 
DATE 
CHIP 
INTEL 85C224 


This PLD drives memory bus and core signals based on the states 
of other state machines 


;-----------------------------Pin 
Declarations------------------------- 


PIN 
1 
YDRCTM 
PIN 
2 
YMADS 
PIN 
3 
YMAOE 
PIN 
4 
MHLDA 
PIN 
5 
NCl 
PIN 
6 
MWBWT 
PIN 
7 
MDRCTM 
PIN 
8 
SNPDIS 
PIN 
9 
UNI 
PIN 
10 
YMSEL 
PIN 
11 
TR4 
PIN 
13 
YMFRZ 
PIN 
14 
MDLDRV 
PIN 
23 
LMRST 


15 
16 
17 
18 
19 
20 
21 
22 


C8MSEL 
NC2 
CDRCTM 
CWBWT 
MBOFF 
MADS 
YSNPDIS 
C8MFRZ 


PIN 
PIN 
PIN 
PIN 
PIN 
PIN 
PIN 
PIN 


EQUATIONS 


/C8MSEL - LMRST * jTR4 
+ /LMRST * jYMSEL 
C8MSEL.TRST - VCC 


NC2 - vcc 
NC2.TRSr - VCC 


CDRCTM - MDRCTM * YDRCTM 
CDRCTM.TRST - VCC 


/CWBWT - jMWBWT * YDRCTM 
CWBWT.TRST - VCC 


/MBOFF - YMAOE * /MHLDA 
MBOFF.TRST - jMHLDA 


/MADS - jYMADS * jYMAOE 
MADS.TRST - jYMAOE 


YSNPDIS - SNPDIS * UNI 
YSNPDIS.TRST - VCC 


/C8MFRZ - LMRST * /MDLDRV 
+ /LMRST * jYMFRZ 
C8MFRZ.TRST - VCC 
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;--------------------------Dec1aration 
Segment-------------------------- 
TITLE ESWND 
PATTERN 
A 
REVISION 2.0 
AUTHOR 
ISIC SILAS 
COMPANY 
INTEL 
DATE 
2/4/91 


CHIP 
x01 85C224 


This PLO contains the XCRDY. XSWND. and XENSWND state machines. 


;---------------------------Pin 
Dec1arations----------------------------- 
PIN 
1 
CLK 
PIN 
2 
RESET 
PIN 
3 
WSDTS 
• 


PIN 
4 
BGT 
PIN 
5 
PBGT 
PIN 
6 
TR4 
PIN 
7 
YSMSWND 
PIN 
8 
SNPDIS 
PIN 
9 
YSMEOC 
PIN 
10 
SLFTST 
PIN 
13 
OEx 


PIN 
16 
ENSWND 
PIN 
17 
SV3 
PIN 
18 
SWEND 
PIN 
19 
SV2 
PIN 
20 
SV1 
PIN 
21 
CRDY 
PIN 
22 
CRDYl 


EQUATIONS 


ENSWND.D :- /RESET * /ySMSWND 
ENSWND.CLKF - CLK 
ENSWND.RSTF - GND 
ENSWND.SETF - GND 
/ENSWND.TRST,- OEx 


/SV3.D 
:- RESET * TR4 
+ /RESET * CRDY * SWEND * /SV3 
+ /RESET * /PBGT * /ENSWND * /ySMSWND * CRDY * SV3 
+ /RESET * /PBGT * CRDY * /SNPDIS * /SWEND * SV3 
+ /RESET * /PBGT * /ENSWND * /YSMSWND * SWEND * SV3 
SV3 .CLKF - CLK 
SV3.RSTF - GND 
SV3 .SETF - GND 
/SV3.TRST - OEx 


/8WEND.D 
:- RESET * TR4 
+ /RESET * /CRDY * SWEND * /SV3 
+ /RESET * PBGT * CRDY * /SWEND * SV3 
+ /RESET * /BGT * /SNPDIS * SWEND * SV3 
+ /RESET * ENSWND * CRDY * SNPDIS * /SWEND * SV3 
+ /RESET * YSMSWND * CRDY * SNPDIS * /SWEND * SV3 
+ /RESET * /BGT * /ENSWND * /YSMSWND * SWEND * SV3 
+ /RESET * /PBGT * /ENSWND * /ySMSWND * /CRDY * /SWEND * SV3 
SWEND.CLKF - CLK 
SWEND.RSTF - GND 
SWEND.SETF - GND 
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jSWEND.TRST 
- OEx 


/SV2.D 
:- RESET * jSLFTST 
+ jRESET * jYSMEOC 
* CRDY * jSV2 
+ jRESET * jYSMEOC 
* jCRDY * SV2 
SV2.CLKF 
- CLK 
SV2.RSTF 
= GND 
SV2.SETF 
- GND 
jSV2.TRST 
- OEx 


/SV1.D 
:= jRESET * jYSMEOC * CRDY 
+ jRESET * CRDY * jSVl * SV2 
SV1. CLKF 
- CLK 
SV1. RSTF 
- GND 
SVl. SETF 
- GND 


/SV1.TRST 
- OEx 


jCRDY.D 
:- RESET * jSLFTST 
+ jRESET * jYSMEOC 
* jWSDTS * CRDY * SV2 
+ jRESET * jWSDTS * CRDY * jSVl * SV2 
CRDY. CLKF 
- CLK 
CRDY.RSTF 
- GND 
CRDY. SETF 
= GND 


/CRDY.TRST 
- OEx 


/CRDY1.D 
:- RESET * jSLFTST 
+ jRESET * jYSMEOC 
* jWSDTS * CRDYl * SV2 
+ jRESET * jWSDTS * CRDYl * jSVl * SV2 
CRDY1. CLKF 
- CLK 
CRDY1.RSTF 
- GND 
CRDYl. SETF 
- GND 


/CRDY1.TRST 
- OEx 
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;-------------------------------Declaration 
TITLE EWCPLB 
PATTERN 
A 
REVISION 
2.0 
AUTHOR 
ISIC SILAS 


COMPANY 
INTEL 
DATE 
2/4/9l 


Segment---------------- 


CHIP 
xOl 85C224 


This PLD contains the XWCPLB and YCPULEN state machines. 


;-----------------------------Pin 
Declaration----------------------- 
PIN 
1 
CLK 
PIN 
2 
RESET 
PIN 
3 
CRDY 
PIN 
4 
RDYSRC 
• 


PIN 
5 
BGT 
PIN 
6 
PBGT 
PIN 
7 
KCACHE 
PIN 
8 
LEN 
PIN 
9 
CACHE 
PIN 
10 
CKEN 
PIN 
11 
BRDY 
PIN 
13 
OEx 


PIN 
16 
CLEN4 
PIN 
17 
CLEN2 
PIN 
18 
CLENl 
PIN 
19 
LKCACHE 
PIN 
20 
SV 
PIN 
21 
CPUEN 
PIN 
22 
WCPLB 


EQUATIONS 


/CLEN4.D 
'- CPUEN * /CLEN2 * /CLEN4 
+ /BRDY * /CLENl 
+ BRDY * CLEN2 * ICLEN4 
+ /CACHE * /CKEN * /LKCACHE * /CLEN2 * /CLEN4 
+ RESET 
CLEN4.CLKF - CLK 
CLEN4.RSTF - GND 
CLEN4.SETF - GND 
/CLEN4.TRST - OEx 


/CLEN2.D 
:- CPUEN * /CLEN2 * /CLEN4 
+ BRDY * /CLEN2 * CLEN4 
+ /BRDY * CLEN2 * CLEN4 
+ LEN * CACHE * /CLEN2 * /CLEN4 
+ LEN * CKEN * /CLEN2 * /CLEN4 
+ LEN * LKCACHE * /CLEN2 * /CLEN4 
+ RESET 
CLEN2.CLKF - CLK 
CLEN2.RSTF - GND 
CLEN2.SETF - GND 
/CLEN2.TRST - OEx 


/CLEN1.D 
:- /RESET * BRDY * /CLENl 
+ /RESET * /BRDY * /CLEN2 * CLEN4 
+ /RESET * /CPUEN * /LEN * CACHE * /CLEN2 * /CLEN4 
+ /RESET * /CPUEN * /LEN * CKEN * /CLEN2 * /CLEN4 
+ /RESET * /CPUEN * /LEN * LKCACHE * /CLEN2 * /CLEN4 


CLEN1.CLKF - CLK 
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CLENl.RSTF 
- CND 
CLENl.SETF 
- CND 


/CLENl.TRST 
- OEx 


/LKCACHE.D 
:- /KCACHE 
LKCACHE .CLKF 
- CLK 
LKCACHE.RSTF 
- CND 
LKCACHE.SETF 
- CND 
/LKCACHE.TRST 
- OEx 


/SV.D 
:- /RESET * CRDY * /SV 
+ /RESET * /RDYSRC 
* /BCT * CPUEN * SV 
+ /RESET * CRDY * /BRDY * /CLENl * WCPLB * /CPUEN 
SV. CLKF 
- CLK 
SV.RSTF 
- GND 
SV.SETF 
- GND 


/SV.TRST 
- OEx 


/CPUEN:D 
:- /RESET * BRDY * /CPUEN 
+ /RESET * CLENl * /CPUEN 
+ /RESET * RDYSRC * /BGT * /WCPLB 
+ /RESET 
* RDYSRC * /BGT * CPUEN * SV 
CPUEN.CLKF 
- CLK 
CPUEN.RSTF 
- GND 
CPUEN.SETF 
-·GND 


/CPUEN.TRST 
- OEx 


/WCPLB.D 
:- /RESET * BRDY * /WCPLB 
+ /RESET * CLENl * /WCPLB 
+ /RESET * /CRDY * BRDY * /CPUEN 
+ /RESET * /CRDY * CLEN! * /CPUEN 
WCPLB. CLKF 
- CLK 
WCPLB.RSTF 
- GND 
WCPLB.SETF 
- GND 
/WCPLB.TRST 
- OEx 
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;-----------------------------Declaration 
Segment------------------------ 
TITLE EWMSWND 
PATTERN 
A 
REVISION 2.0 
AUTHOR 
ISIC SILAS 
COMPANY 
INTEL 
DATE 
2/4/91 


CHIP 
xOl 85C224 


This PLO contains the YENMSWND. YWMSWND. and YENXSAS state machines. 


;----------------------------Pin 
Declarations------------------------------ 
PIN 
1 
MCLl< 
PIN 
2 
MRESET 
PIN 
3 
XSAS 
PIN 
4 
XSNPWB 
PIN 
5 
YPIPE 
PIN 
6 
YNOPIPE 
PIN 
7 
YMEOC 
PIN 
8 
MHITMI 
PIN 
9 
YMSWEND 
PIN 
10 
YNOSWND 
PIN 
11 
YBCT 
PIN 
13 
OEx 
PIN 
14 
YALLOC 
PIN 
23 
PCTCXFR 


PIN 
15 
UNUSED 
PIN 
16 
PSWBAS 
PIN 
17 
SV 
PIN 
18 
WMSWND 
PIN 
19 
ENMSWND 
PIN 
20 
ENXSAS 
PIN 
21 
PXSAS 
PIN 
22 
YSWEHITM 


EQUATIONS 


UNUSED - VCC 
UNUSED.TRST - VCC 


/PSWBAS - /XSAS * /XSNPWB * /ENXSAS 
PSWBAS.TRST - VCC 


/SV.D 
:- YMEOC * /WMSWND * /SV 
+ /YNOSWND * jYPIPE * YMEOC * /WMSWND 
+ jYPIPE * /YMSWEND * /ENMSWND * YMEOC * /WMSWND 
SV.CLl<F- MCLl< 
SV.RSTF - CND 
SV.SETF - CND 
/SV.TRST - OEx 


I /WMSWND.D :- /MRESET * YMEOC * /WMSWND 
+ /MRESET * /WMSWND * /SV 
+ /MRESET * /YNOSWND * jYPIPE * /WMSWND 
+ /MRESET * /YNOSWND * jYBCT * WMSWND 
+ /MRESET * jYPIPE * /YMSWEND * /ENMSWND * /WMSWND 
+ /MRESET * YPIPE * /PCTCXFR * /YALLOC * /WMSWND 
+ /MRESET * /YMSWEND * /ENMSWND * /YALLOC * WMSWND 
+ /MRESET * YNOSWND * /YNOPIPE * /YMSWEND * /ENMSWND * WMSWND 
WMSWND.CLl<F- MCLl< 
WMSWND .RS.TF- CND 
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WMSWND.SETF 
- GND 


(WMSWND.TRST 
- OEx 


jENMSWND.D 
:- YMSWEND 
ENMSWND.CLKF 
- MCLK 
ENMSWND.RSTF 
- GND 


ENMSWND.SETF 
- GND 


jENMSWND.TRST 
- OEx 


jENXSAS.D 
:- YBGT * jENXSAS 
+ XSAS * ENXSAS 
+ MRESET 
ENXSAS.CLKF 
- MCLK 
ENXSAS.RSTF 
- GND 
ENXSAS.SETF 
- GND 


jENXSAS.TRST 
- OEx 


jPXSAS 
- jXSAS * XSNPWB * jENXSAS 
PXSAS.TRST 
- VCC 


jYSWEHITM 
- /YMSWEND * jENMSWND * jMHITMI * YALLOC 
+ /YMSWEND * jENMSWND * jMHITMI * YNOPIPE * YPIPE 


YSWEHITM.TRST 
= VCC 
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Embedded 
Architecture 
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20 MIPS· 
Burst Execution 
at 20 MHz 
- 
7.5 MIPS Sustained 
Execution 
at 
20 MHz 


• 
512-Byte 
On-Chip 
Instruction 
Cache 
- 
Direct Mapped 
- 
Parallel Load/Decode 
for Uncached 
Instructions 


• 
Multiple 
Register 
Sets 
- 
Sixteen 
Global 32-Blt Registers 
- 
Sixteen 
Local 32-Blt Registers 
- 
Four Local Register 
Sets Stored 
On- 
Chip 
- 
Register 
Scoreboarding 


• 
Software 
Compatible 
with S0960KA/ 


KB/CA/CF 
Processors 


• 
Pin Compatible 
with S0960SB 


• 
Built-In Interrupt 
Controller 
- 
4 Direct Interrupt 
Pins 
- 
31 Priority 
Levels, 256 Vectors 


• 
Easy to Use, High Bandwidth 
16-Blt Bus 


- 
32 Mbytes/s 
Burst 
- 
Up to 16 Bytes Transferred 
per Burst 


• 
32-Bit Address 
Space, 4 Glgabytes 


• 
SO-Lead Quad Flat Pack (EIAJ QFP) 


• 
S4-Lead Plastic Leaded 
Chip Carrier 


(PLCC) 


The 80960SA is a member of Intel's i960@32-bit processor family,'which is designed especially for low cost 
embedded applications. It includes a 512·byte instruction cache and a built-in interrupt controller. The 
80960SA has a large register set, multiple parallel execution units and a 16-bit burst bus. Using advanced 
Rise technology, this high performance processor is capable of execution rates in excess of 7.5 million 
instructions per second". The 80960SA is well-suited for a wide range of cost sensitive embedded applications 
including non-impact printers, network adapters and 1/0 controllers. 
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1.0 
THE i960@ PROCESSOR 


The 80960SA is a member of the 32-bit architecture 
from Intel known as the i960 processor family. 
These microprocessors were especially designed to 
serve the needs of embedded applications. The em- 
bedded market includes applications as diverse as 
industrial automation, avionics, image processing, 
graphics and networking. These types of applica- 
tions require high integration, low power consump- 
tion, quick interrupt response times and high per- 
formance. Since time to market is critical, embedded 
microprocessors need to be easy to use in both 
hardware and software designs. 


intel~ 


All members of the i960 processor family share a 
common core architecture which utilizes RiSe tech- 
nology so that, except for special functions, the fam- 
ily members are object-code compatible. Each new 
processor in the family adds its own special set of 
functions to the core to satisfy the needs of a specif- 
ic application or range of applications in the embed- 
ded market. 


Software written for the 80960SA will run without 
modification on any other member of the 80960 
Family. This processor is pin-compatible with the 
80960S8 which includes an integrated floating-point 
unit. 


0000 
OOOOH 


ADDRESS 
SPACE 
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ARCHITECTURALLY 
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STRUCTURES 


FETCH 
STORE 


INSTRUCTION 
STREAM 


INSTRUCTION 
EXECUTION 


LOAD 


16 
32-BIT 
GLOBAL 
REGISTERS 


CONTROL 
REGISTERS 


Figure 2. 80960SA 
Programming 
Environment 


intel~ 


1.1 Key Performance Features 


The 809~OSAarchitecture is based on the most re- 
cent advances in microprocessor technology and is 
grounded in Intel's long experience in the design 
and manufacture of embedded microprocessors. 
Many features contribute to the 80960SA's excep- 
tional performance: 
1. Large 
Register 
Set. Having a large number of 
registers reduces the number of times that a 
processor needs to access memory. Modem 
compilers can take advantage of this feature to 
optimize execution speed. For maximum flexibili- 
ty, the 80960SA provides thirty-two 32-bit regis- 
ters. (See Figure 2.) 
2. Fast 
Instruction 
Execution. 
Simple functions 
make up the bulk of instructions in most pro- 
grams so that execution speed can be improved 
by ensuring that these core instructions are exe- 
cuted as quickly as possible. The most frequently 
executed instructions-such 
as register-register 
moves, 
add/subtract, 
logical 
operations 
and 
shifts-execute 
in one to two cycles. (Table 1 
contains a list of instructions.) 
3. Load/Store 
Architecture. 
One way to improve 
execution speed is to reduce the number of times 
that the processor must access memory to per- 
form an operation. As with other processors 
based on RiSe technology, the 80960SA has a 
Load/Store architecture. As such, only the LOAD 
and STORE instructions reference memory; all 
other instructions operate on registers. This type 
of architecture 
simplifies instruction decoding 
and is used in combination with other techniques 
to increase parallelism. 
4. Simple 
Instruction 
Formats. 
All instructions in 
the 80960SA are 32 bits long and must be 
aligned on word 
boundaries. This alignment 
makes it possible to eliminate the instruction 
alignment stage in the pipeline. To simplify the 
instruction decoder, there are only five instruction 
formats; each instruction uses only one format. 
(See Figure 3.) 


80960SA 


5. Overlapped 
Instruction 
Execution. 
Load oper- 


ations allow execution of subsequent instructions 
to continue before the data has been returned 
from memory, so that these instructions can 
overlap the load. The 80960SA manages this 
process transparently to software through the 
use of a register scoreboard. Conditional instruc- 
tions also make use of a scoreboard so that sub- 
sequent unrelated instructions may be executed 
while the conditional instruction is pending. 


6. Integer 
Execution 
Optimization. 
When the re- 
sult of an arithmetic execution is used as an oper- 
and in a subsequent calculation, the value is sent 
immediately to its destination register. At the 
same time, the value is put on a bypass path to 
the ALU, thereby saving the time that otherwise 
would be required to retrieve the value for the 
next operation. 


7. Bandwidth 
Optlmlzatlons. 
The 80960SA gets 
optimal use of its memory bus bandwidth be- 
cause the bus is tuned for use with the on-chip 
instruction cache: instruction cache 
line size 
matches the maximum burst size for instruction 
fetches. The 80960SA automatically fetches four 
words in a burst and stores them directly in the 
cache. Due to the size of the cache and the fact 
that it is continually filled in anticipation of need- 
ed instructions in the program flow, the 80960SA 
is relatively insensitive to memory wait states. 
The benefit is that the 80960SA delivers out- 
standing performance even with a low cost mem- 
ory system. 


8. Cache Bypass. If a cache miss occurs, the proc- 


essor fetches the needed instruction then sends 
it on to the instruction decoder at the same time it 
updates the cache. Thus, no extra time is spent 
to load and read the cache. 
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Table 
1. 80960SA 
Instruction 
Set 


Data Movement 
Arithmetic 
Logical 
Bit and Bit Field 


Load 
Add 
And 
Set Bit 
Store 
Subtract 
Not And 
Clear Bit 
Move 
Multiply 
And Not 
Not Bit 
Load Address 
Divide 
Or 
Check Bit 
Remainder 
Exclusive Or 
Alter Bit 
Modulo 
Not Or 
Scan For Bit 
Shift 
Or Not 
Scan Over Bit 
Extended 
Multiply 
Nor 
Extract 
Extended 
Divide 
Exclusive 
Nor 
Modify 
Not 
Nand 
Rotate 


Comparison 
Branch 
Call/Return 
Fault 


Compare 
Unconditional 
Branch 
Call 
Conditional 
Fault 
Conditional 
Compare 
Conditional 
Branch 
Call Extended 
Synchronize 
Faults 


Compare 
and Increment 
Compare 
and Branch 
Call System 
Compare 
and Decrement 
Return 
Branch and Link 


Debug 
Miscellaneous 
Decimal 


Modify Trace Controls 
Atomic Add 
Move 
Mark 
Atomic 
Modify 
Add and Carry 


Force Mark 
Flush Local Registers 
Subtract 
with Carry 
Modify Arithmetic 
Controls 
Scan Byte for Equal 
Test Condition 
Code 


Synchronous 


Synchronous 
Load 
Synchronous 
Move 
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Control 
Displacement 
r=I 


I_O_P_C_O_d_e..LI_R_e_g_I,-_B_a_s_e_ 
,-_M_I~,- 
O_ff_s_e_t--, 


I_O_P_C_O_d_e..LI_R_e_g_,-I_B_a_s_e_1 
Mode 


Compare 
and 
Branch 
I Opcode 
I Reg/Lit 
I 
Reg 
I 


Register 
to 
Register 


Memory Aycess- 


Short 


Memory Access- 


Long 


M 
Displacement 


EB 
Offset 


Displacement 


Figure 3. Instruction 
Formats 


1.1.1 
MEMORY 
SPACE 
AND ADDRESSING 
MODES 


The 60960SA 
offers 
a linear programming 
environ- 
ment so that all programs 
running 
on the processor 
are contained 
in a single address 
space. 
Maximum 
address 
space size is 4 Gigabytes 
(232 bytes). 


For ease of use the 60960SA 
has a small number of 
addressing 
modes, 
but includes 
all those 
necessary 
to ensure efficient 
execution 
of high-level 
languages 
such 
as C. Table 
2 lists 
the 
memory 
addressing 
modes. 


Table 2. Memory Addressing Modes 


• 
12-Bit Offset 


• 
32-Bit Offset 


• 
Register-Indirect 


• 
Register + 12-Bit Offset 


• 
Register + 32-Bit Offset 


• 
Register + (Index-Register 
x Scale-Factor) 


• 
Register 
x Scale Factor + 32-Bit Displacement 


• 
Register 
+ 
(Index-Register 
x Scale-Factor) 
+ 
32-Bit Displacement 


Scale-Factor 
is 1, 2, 4, 8 or 16 


1.1.2 
DATA 
TYPES 


The 80960SA 
recognizes 
the following 
data types: 


Numeric: 


• 
6-, 16-, 32- and 64-bit ordinals 


• 
8-, 16-, 32- and 64-bit integers 


Non-Numeric: 


• 
Bit 


• 
Bit Field 


• 
Triple Word (96 bits) 


• 
Quad-Word 
(128 bits) 


1.1.3 
LARGE 
REGISTER 
SET 


The 80960SA 
programming 
environment 
includes 
a 


large 
number 
of registers. 
In fact, 
32 registers 
are 


available 
at any time. The availability 
of this 
many 


registers 
greatly reduces 
the number of memory 
ac- 


cesses 
required 
to perform 
algorithms, 
which 
leads 


to greater 
instruction 
processing 
speed. 


There 
are 
two 
types 
of 
general-purpose 
register: 


local and global. The global registers 
consist 
of six- 


teen 32-bit 
registers 
(gO though 
g15). These 
regis- 


ters 
perform 
the 
same 
function 
as 
the 
general- 


• 
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dure calls can be made ~ithoutha;i~g-to a~~;~; 
the 
procedure stack resident in memory. 
Global registers are not exchang~d on a proc~dure 
call but retain their contents, making them available 
to ~II procedures for fast parameter passing. 


ONE OF FOUR 
LOCAL 
REGISTER SETS--.... 


REGISTER 
CACHE 


LOCAL REGISTER SET 
---=:: 
I~ 


rO 


31 
r15 
o 
272206-3 


3-8 


Figure 4. Multiple Register 
Sets Are Stored 
On-Chip 


••••.••••••••.•••••••• 
__ 
•• ...,••..., •.••••...,. , u • ...,"•••••• 
""••• 
t"'''' •••...•..", 
gl~V- 


rithm increases the probability that an instruction will 
already be in the cache when it is needed. 


Code for small loops often fits entirely within the 
cache, leading to a great increase in ~rocessing 
speed since further memory ref.erencesmlgh! n~t be 
necessary until the program exits the loop. Similarly, 
when calling short procedures, the code for the cal~- 
ing procedure is likely to remain in the cache so It 
will be there on the procedure's return. 


1.1.6 
REGISTER 
SCOREBOARDING 


The instruction decoder is optimized in several ways. 
One optimization method is the ability to overlap 
instructions by using register scoreboarding. 


Register scoreboarding occurs when a LOAD moves 
a variable from memory into a register. When the 
instruction initiates, a scoreboard bit on the target 
register is set. Once the register is loaded, the bit is 
reset. In between, any reference to the register con- 
tents is accompanied by a test of the scoreboard bit 
to ensure that the load has completed before pro- 
cessing continues. Since the processor does not 
need to wait for the LOAD to complete, it can exe- 
cute additional instructions placed between the 
LOAD and the instruction that uses the register con- 
tents, as shown in the following example: 


ld 
data_2. 
r4 
ld 
data_2. 
r5 
Unrelated 
instruction 
Unrelated 
instruction 


add 
r4. 
r5. 
r6 


memory and 1/0 subsystem interfaces. The proces- 
sor uses the bus to fetch instructions, manipulate 
memory and respond to interrupts. Bus features 
include: 
• 16-bit data path multiplexed onto the lower bits of 


the 32-bit address path 
• 
• Eight 
16-bit half-word 
burst capability 
which 
allows transfers from 1 to 16 bytes at a time 


• High bandwidth reads and writes with 32 Mbytesl s 


burst (at 20 MHz) 


Table 3 defines bus signal names and functions; Ta- 
ble 4 defines other component-support signals such 
as interrupt lines. 


1.1.8 
INTERRUPT 
HANDLING 


The B0960SAcan be interrupted in one of two ways: 
by the activation of one of four interrupt pins or by 
sending a message on the processor's data bus. 


The B0960SA is unusual in that it automatically han- 
dles interrupts on a priority basis and can keep track 
of pending interrupts through its on-chip interrupt 
controller. Two of the interrupt pins can be config- 
ured to provide B259A-style handshaking for expan- 
sion beyond four interrupt lines. 


1.1.9 
DEBUG 
FEATURES 


The B0960SA has built-in debug capabilities. There 
are two types of breakpoints and six trace modes. 
Debug features are controlled by two internal 32-bit 
registers, the Process-Controls Word and the Trace- 
Controls Word. By setting bits in these control 
words, a software debug monitor can closely control 
how the processor responds during program execu- 
tion. 
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The 80960SA provides two hardware breakpoint 
registers on-chip which, by using a special com- 
mand, can be set to any value. When the instruction 
pointer matches either breakpoint regi~ter value, the 
breakpoint handling routine is automatically called. 


The 80960SA also provides software breakpoints 
through the use of two instructions: MARK and 
FMARK. These can be placed at any point in a pro- 
gram and cause the processor .to halt e~ecutio~ at 
that point and call the breakpoint handling routine. 
The breakpoint mechanism is easy to use and pro- 
vides a powerful debugging tool. 


Tracing is available for instructions (single step exe- 
cution), calls and returns and branching. E~ch trace 
type may be enabled separately by a special debug 
instruction. In each case, the 80960SA executes the 
instruction first and then calls a trace handling rou- 
tine (usually part of a software debug monitor). Fur- 
ther program execution is halted until the routine 
completes, at which time execution resumes at the 
next instruction. The 80960SA's tracing mecha- 
nisms, implemented completely in hardware, greatly 
simplify the task of software test and debug. 


1.1.10 
FAULT 
DETECTION 


The 80960SA has an automatic mechanism to han- 
dle faults. Fault types include trace and arithmetic 
faults. When the processor detects a fault, it auto- 
matically calls the appropriate fault handling routine 
and saves the current instruction pointer and neces- 
sary state information to make efficient recovery 
possible. Like interrupt handling routines, fault han- 
dling routines are usually written to meet the needs 
of specific applications and are often included as 
part of the operating system or kernel. 
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For each of the fault types, there are numerous sub- 
types that provide specific information about a fault. 
The fault handler can use this specific information to 
respond correctly to the fault. 


1.1.11 
BUILT-IN 
TESTABILITY 


Upon reset, the 80960SA automatically conducts an 
exhaustive internal test of its major blocks of logic. 
Then, before executing its first instruction, it does a 
zero check sum on the first eight words in memory 
to ensure that the memory image was programmed 
correctly. If a problem is discovered at any point dur- 
ing the self-test, the 80960SA asserts its FAIL pin 
and will not begin program execution. Self test takes 
approximately 24,000 cycles to complete. 


System manufacturers can use the 80960SA's self- 
test feature during incoming parts inspection. No 
special diagnostic programs need to be written. T~~ 
test is both thorough and fast. The self-test capabili- 
ty helps ensure that defective parts are.discov~red 
before systems are shipped and, once In the field, 
the self-test makes it easier to distinguish between 
problems caused by processor failure and problems 
resulting from other causes. 


1.1.12 CHMOS 


The 80960SA is fabricated using Intel's CHMOS IV 
(Complementary High Speed Metal Oxide Semicon- 
ductor) 
process. The 
80960SA 
is 
available 
at 
10 MHz in both PLCCand ·QFPpackages, and at 16 
and 20 MHz in the PLCC package. 


80960SA 


Table 3. 80960SA 
Pin Description: 
Bus Signals 


Name 
Type 
Description 


CLK2 
I 
SYSTEM CLOCK provides the fundamental 
timing for 80960SA 
systems. 
It is divided 


by two inside the 80960SA 
to generate 
the internal processor 
clock. 


A31:16 
0 
ADDRESS 
BUS carries the upper 16 bits of the 32-bit physical address to memory. 
It 


T.S. 
is valid throughout 
the burst cycle; no latch is required. 


AD15:1, 
DO 
I/O 
ADDRESS/DATA 
BUS carries the low order 32-bit addresses 
and 16-bit data to and 


T.S. 
from memory. AD15:4 must be latched since the cycle following 
the address cycle 
carries data on the bus. 


A3:1 
0 
ADDRESS 
BUS carries the word addresses 
of the 32-bit address to memory. These 
T.S. 
three bits are incremented 
during a burst access indicating 
the next word address 
of 


the burst access. 
Note that A3:1 are duplicated 
with AD3:1 during the address cycle. 


ALE 
0 
ADDRESS 
LATCH ENABLE indicates 
the transfer 
of a physical address. ALE is 
T.S. 
asserted 
during a Ta cycle and deasserted 
before the beginning 
of the Td state. It is 
active high and floats to a high impedance 
state during a hold cycle (Th). 


AS 
0 
ADDRESS 
STATUS 
indicates 
an address state. AS is asserted 
every Ta state and 
T.S. 
deasserted 
during the following 
Td state. AS is driven HIGH during reset. 


W/R 
0 
WRITE/READ 
specifies, 
during aT a cycle, whether 
the operation 
is a write or read. It 


O.D. 
is latched on-chip and remains valid during Td cycles. 


DEN 
0 
DATA ENABLE is asserted 
during Td cycles and indicates 
transfer 
of data on the AD 


T.S. 
lines. The AD lines should not be driven by an external 
source unless DEN is 
asserted. 
When DEN is asserted, 
outputs from the previous cycle are guaranteed 
to 
be three-stated. 
In addition, 
DEN deasserted 
indicates 
inputs have been captured 
and 
therefore 
input hold times can be disregarded. 
DEN is driven HIGH during reset. 


DT/R 
0 
DATA TRANSMIT/RECEIVE 
indicates 
the direction 
of data transfer to and from the 
T.S. 
bus. It is low during Ta and Td cycles for a read or interrupt 
acknowledgment; 
it is high 
during Ta and Td cycles for a write. DTlA" never changes 
state when DEN is asserted. 


DTlA" is driven HIGH during reset. 


READY 
I 
READY indicates 
that data on AD lines can be sampled 
or removed. 
If READY is not 


asserted 
during a Td cycle, the Td cycle is extended 
to the next cycle by inserting 
a 
wait state (Tw). 


LOCK 
I/O 
BUS LOCK prevents 
bus masters from gaining control of the bus during Read/ 
O.D. 
Modify/Write 
(RMW) cycles. The processor 
or any bus agent may assert LOCK. 


At the start of a RMW operation, 
the processor 
examines 
the LOCK pin. If the pin is 
already asserted, 
the processor 
waits until it is not asserted. 
If the pin is not asserted. 


the processor 
asserts 
LOCK during the Ta cycle of the read transaction. 
The processor 
deasserts 
LOCK in the Ta cycle of the write transaction. 
During the 
time LOCK is asserted, 
a bus agent can perform a normal read or write but not a RMW 
operation. 
The processor 
also asserts LOCK during interrupt-acknowledge 
transactions. 
Do not leave LOCK unconnected. 
It must be pulled HIGH for the processor 
to function 
properly. 
ONCE MODE: The LOCK pin is sampled 
during reset. If it is asserted 
LOW at the end 


of RESET, all outputs will be three-stated 
until the part is reset again. ONCE mode is 


used in conjunction 
with an in-circuit 
emulator. 


1/0 - 
Input/Output, 0 = Output, I = Input. 0.0. = Open Drain, T.S. = Three-state 


• 
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Table 3. 80960SA 
Pin Description: 
Bus Signals (Continued) 


Name 
Type 
Description 


BE1:0 
0 
BYTE ENABLE 
L,lNES specify which data bytes (up to two) on the bus take part in 
T.S. 
the current bus cycle. BE1 corresponds 
to AD15:8; 
BEO corresponds 
to AD7:1, DO. 


The byte enable lines are asserted 
appropriately 
during each data cycle. 
INITIALIZATION 
FAILURE 
indicates 
that the processor 
has failed to initialize 
correctly. 
The failure state is indicated 
by a combination 
of BLAST asserted 
and 
BE1 :0 not asserted. 
This condition 
occurs after RESET is deasserted 
and before the 
first bus transaction 
begins. FAIL is asserted 
while the processor 
performs 
a self- 


test. If the self-test 
completes 
successfully, 
FAIL is deasserted. 
The processor 
then 
performs 
a zero checksum 
on the first eight words of memory. 
If it fails, FAIL is 
asserted 
for a second time and remains asserted; 
if it passes, system initialization 
continues 
and FAIL remains deasserted. 


HOLD 
I 
HOLD: A request from an external 
bus master to acquire the bus. When the 
processor 
receives 
HOLD and grants bus control to another 
master, it floats its 
three-state 
bus lines, then asserts HLDA and enters the Th state. When HOLD is 
deasserted, 
the processor 
deasserts 
HLDA and enters the Tj or Ta state. 


HLDA 
0 
HOLD ACKNOWLEDGE: 
Notifies an external 
bus master that the processor 
has 
T.S. 
relinquished 
control 
of the bus. This signal is always driven. At RESET it is driven 
LOW. 


BLAST/FAIL 
0 
BURST LAST indicates 
the last data cycle (Td) of a burst access. 
It is asserted 
low 
T.S. 
during the last Td and associated 
with Tw cycles in a burst access. 


INITIALIZATION 
FAILURE 
indicates 
that the processor 
has failed to initialize 
correctly. 
The failure state is indicated 
by a combination 
of BLAST asserted 
and 
BE1 :0 not asserted. 
This condition 
occurs after RESET is deasserted 
and before the 
first bus transaction 
begins. FAIL is asserted 
while the processor 
performs 
a self- 
test. If the self-test 
completes 
successfully, 
FAIL is deasserted. 
The processor 
then 
performs 
a zero checksum 
on the first eight words of memory. 
If it fails, FAIL is 
asserted 
for a second time and remains asserted; 
if it passes, system initialization 
continues 
and FAIL remains deasserted. 


I/O = Input/Output, 0 = Output, I = Input, 0.0. = Open Drain, T.5. = Three-state 
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Table 4. 80960SA 
Pin Description: 
Support 
Signals 


Name 
Type 
Description 


RESET 
I 
RESET clears the processor's 
internal logic and causes it to reinitialize. 
During RESET assertion, 
the input pins are ignored (except for INTO, INT1, INT3, 
LOCK), the three-state 
output pins are placed in a HIGH impedance 
state (except for 
DTlR, DEN, and AS) and other output pins are placed in their non-asserted 
states. 


RESET must be asserted 
for at least 41 ClK2 
cycles for a predictable 
RESET. 
Optionally, 
for a synchronous 
reset, the lOW 
and HIGH transition 
of RESET should 
occur after the rising edge of both ClK2 
and the external 
bus ClK 
and before the next 
rising edge of ClK2. 
The interrupt 
pins indicate the initialization 
sequence 
executed. 
Typical initialization 
requires driving only INTO and INT3 to a HIGH state. The reset conditions 
follow: 


INTO 
INT1 
INT3 
LOCK 
Action Taken 


1 
x 
1 
1 
Run-self-test 
(core initialization) 
0 
0 
1 
1 
Disable self-test 
0 
1 
x 
x 
Reserved 


, 
x 
x 
0 
x 
Reserved 
x 
x 
x 
0 
ONCE mode (see lOCK 
pin) 


INTO 
I 
INTERRUPT 
0 indicates 
a pending interrupt. 
The bus interrupt 
control 
register 
determines 
in which way the signal should be interpreted'. 
To signal an interrupt 
in a 
synchronous 
system, this pin-as 
well as the other interrupt 
pins-must 
be enabled 
by 
being deasserted 
for at least one bus cycle and then asserted 
for at least one 
additional 
bus cycle. In an asynchronous 
system the pin must remain deasserted 
for at 
least two system clock cycles and then asserted 
for at least two more system clock 
cycles. 
INTO is sampled 
during RESET to determine 
if the self-test 
sequence 
is to be 
executed. 


INT1 
I 
INTERRUPT 
1, like INTO, provides 
direct interrupt 
signaling. 
INT1 is sampled 
during 
RESET to determine 
if the self-test 
sequence 
is to be executed. 


INT2/INTR 
I 
INTERRUPT2/INTERRUPT 
REQUEST: The interrupt 
control 
register determines 
how 
this pin is interpreted. 
If INT2, it has the same interpretation 
as the INTO and INT1 
pins. If INTR, it is used to receive an interrupt 
request from an external 
interrupt 
controller. 


INT3/INTA 
I/O 
INTERRUPT3/INTERRUPT 
ACKNOWLEDGE: 
The bus interrupt 
control 
register 
T.S. 
determines 
how this pin is interpreted. 
If INT3, it has the same interpretation 
as the 
INTO and INT1 pins. If INTA, it is used as an output to control 
interrupt-acknowledge 
transactions. 
The INTA output is latched on-Chip and remains valid during Td cycles; 


as an output, it is open-drain. 
INT3 must be pulled HIGH during RESET. 


N.C. 
N/A 
NOT CONNECTED 
indicates 
pins should not be connected. 
Never connect 
any pin 
marked 
N.C.; these pins may be reserved 
for factory 
use. 


I/O - 
Input/Output, 0 = Output, I = Input, 0.0. = Open Drain, T.S, = Three-state 


• 
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2.0 
ELECTRICAL SPECIFICATIONS 


2.1 
Power and Grounding 


The a0960SA is implemented in CHMOS IV technol- 
ogy and therefore has modest power requirements. 
Its high clock frequency and numerous output buff- 
ers (address/ data, control, error and arbitration sig- 
nals) can cause power surges as multiple output 
buffers simultaneously drive new signal levels. For 
clean on-chip power distribution, vcc and Vss pins 
separately feed the device's functional units. Power 
and ground connections 
must be made to 
all 
a0960SA power and ground pins. On the circuit 
board, all vcc pins must be strapped closely togeth- 
er, preferably on a power plane; all Vss pins should 
be strapped together, preferably on a ground plane. 


2.2 
Power Decoupling 
Recommendations 


Place a liberal amount of decoupling capacitance 
near the a0960SA. When driving the bus the proces- 
sor can cause transient power surges, particularly 
when connected to a large capacitive load. 


low 
inductance capacitors and interconnects are 
recommended for best high frequency electrical per- 
formance. 
Inductance is reduced by shortening 
board traces between the processor and decoupling 
capacitors as much as possible. 


2.3 
Connection 
Recommendations 


For reliable operation, always connect unused in- 
puts to an appropriate signal level. In particular, if 
one or more interrupt lines are not used, they should 
be pulled up. No inputs should ever be left floating. 


The LOCK open drain pin requires a pullup resistor 
whether or not the pin is used as an output. While in 
some cases a simple pullup resistor will be ade- 
quate, a network of pullup and pulldown resistors 
biased to a valid VIH (> 2.0 V) and terminated in the 
characteristic impedance of the circuit board is rec- . 
ommended. Figures 5 and 6 show recommended re- 
sistor values for both a low and high current drive 
network, which assumes a circuit board characterIs- 
tic impedance of 100 n. 
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Do not connect external logic to pins marked N.C. 


Vcc 
OPEN·DRAIN 
OUTPUT 


470 n 


390 n 
I 
_ 


I 


low Drive Network: 
VOH ~ 2.45V 10 3.0V 
IOL ~ 9.5 mA 10 12 mA 
272206-4 


Figure 5. Connection 
Recommendations 
for Low Current 
Drive Network 


220n 


OPEN·DRAIN 
OUTPUT 


270 n 


I 
_ 
I 


High Drive Network: 
VOH ~ 2.48V 10 3.0V 
IOL ~ 16 mA 10 20 mA 
272206-5 


Figure 6. Connection 
Recommendations 
for High Current 
Drive Network 


2.4 
Characteristic 
Curves 


Figure 7 shows typical supply current requirements 
over the operating temperature range of the proces- 
sor at supply voltage (Ved of 5V. Figure a shows 
the typical power supply current (led 
that the 
a0960SA requires at various operating frequencies 
when measured at three input voltage (Vcc) levels. 


For a given output current (100 the curve in Figure 9 
shows the worst case output low voltage (V00. Fig- 
ure 10 shows the typical capacitive derating curve 
for the a0960SA measured from 1.5V on the system 
clock (ClK) to o.av on the falling edge and 2.0V on 
the rising edge of the .bus address/data 
(AD) 
signals. 
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Figure 7. Typical Supply Current 
vs Case Temperature 
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Figure 8. Typical Current 
vs Frequency 
(Room Temp) 
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Figure 9. Typical 
Current 
vs Frequency 
(Hot Temp) 
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Curve 
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2.5 Test Load Circuit 


Figure 11 illustrates the load circuit used to test the 
80960SA's three-state pins; Figure 12 shows the 
load circuit used to test the open drain output. The 
open drain test uses an active load circuit in the form 
of a matched diode bridge. Since the open-drain out- 
put sinks current, only the IOL legs of the bridge are 
necessary and the IOH legs are not used. When the 
80960SA driver under test is turned off, the output 
pin is pulled up to VREF 
(Le., VOH)'- Diode 01 is 
turned off and the IOL current source flows through 
diode 02. 


When the 80960SA open-drain driver under test is 
on, diode 01 is also on and the voltage on the pin 
being tested drops to VOL. Diode 02 turns off and 
IOL flows through diode 01. 


80960SA 


THREE-STATE 
OUTPUT 
? 


~ 
272206-10 
CL = 50 pF for all signals 


Figure 11. Test Load Clrcultfor 
Three-State 
Output Pins 


OPEN-DRAIN 
OUTPUT 
0, 


272206-11 


IOL Tested at 12 and 20 mA 
VREF = Vex; 
D, and D2 are matched 


Figure 12. Test Load Circuit for 
Open-Drain 
Output Pins 
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2.6 
Absolute Maximum Ratings· 


Operating 
Temperature 
(PLCC) 
O·C to + 85·C Case 


Operating 
Temperature 
(QFP) .O·C to + 1OO·CCase 


Storage Temperature 
- 65·C to + 150·C 


Voltage on Any Pin (PLCC) 
-0.5V 
to vcc + 0.5V 


Voltage 
on Any Pin (QFP) .. - 0.25V to vcc +0.25V 


Power Dissipation 
1.9W (20 MHz) 


2.7 
DC Characteristics 


80960SA 
(10 MHz QFP) 
80960SA 
(10 and 16 MHz PLCC) 
80960SA 
(20 MHz PLCC) 


intel~ 


NOTICE: This data sheet contains preliminary infor- 
mation on new products in production. The specifica- 
tions are subject to change without notice. Verify with 
your local Intel Sales office that you have the latest 
data sheet before finalizing a design. 


• WARNING: Stressing the device beyond the "Absolute 
Maximum 
Ratings" 
may 
cause permanent 
damage. 
These are stress ratings 
only. Operation 
beyond 
the 
"Operating 
Conditions" 
is not recommended 
and ex- 
tended exposure 
beyond 
the 
"Operating 
Conditions" 
may affect device reliability. 


TCASE = O·C to + 100·C, vcc = 5V ±50,'0 
TCASE = O·C to + 85·C, vcc = 5V ± 10% 
TCASE = O·C to + 85·C, vcc = 5V ± 5% 


Table 5. DC Characteristics 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


Vll 
Input Low Voltage 
-0.3 
+0.8 
V 


VIH 
Input High Voltage 
2.0 
vcc +0.3 
V 


VCl 
CLK2 Input Low Voltage 
-0.3 
+0.8 
V 


VCH 
CLK2 Input High Voltage 
0.7 vcc 
vcc +0.3 
V 


VOl 
Output Low Voltage 
0.45 
V 
IOl = 2.5mA 
0.45 
V 
IOl = 12 mA, LOCK Pin 
0.60 
V 
IOl = 20 mA, LOCK Pin 


VOH 
Output High Voltage 
2.4 
V 
All TS, -2.5 
mA (1) 


ICC 
Power Supply Current: 
10 MHz-QFP 
240 
mA 
TCASE = O·C 
10 MHz-PLCC 
240 
mA 
TCASE = O·C 
16 MHz-PLCC 
300 
mA 
TCASE = O·C 
20 MHz·PLCC 
340 
mA 
TCASE = O·C 


ILl1 
Input Leakage Current, 
±15 
/LA 
o s VIN ~ vcc 
Except INTO, LOCK 


ILl2 
Input Leakage Current, 
-300 
/LA 
VIN = 0.45V(2) 
INTO, LOCK 


ILO 
Output Leakage Current 
±15 
/LA 


CIN 
Input Capacitance 
10 
pF 
fc = 1 MHz(3) 


Co 
Output Capacitance 
12 
pF 
fc = 1 MHz(3) 


CClK 
Clock Capacitance 
10 
pF 
fc = 1 MHz(3) 


NOTES: 
1. Not measured for open-drain output. 
2.'INTOand LOCK have internal pullup devices. 
3. Input, output and clock capacitance are not tested. 
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This section 
describes 
the AC specifications 
for the 
80960SA 
pins. All input and output timings are spec- 
ified relative 
to the 1.5V level of the rising edge of 
CLK2 and refer to the time at which the signal cross- 
es 1.5V (for output 
delay 
and input 
setup). 
All AC 


testing 
should 
be done with input voltages 
of O.4V 
and 2.4V, except 
for the clock 
(CLK2) which 
should 
be tested with input voltages 
of 0.45V and 0.7 x vcc. 
See Figure 13 and Tables 
6, 7 and 8 for timing rela- 
tionships 
for the 80690SA 
signals. 


2.8 
AC Specifications 
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Figure 13. Drive Levels and Timing Relationships 
for 80960SA 
Signals 


• 
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2.8.1 
AC SPECIFICATION 
TABLES 


Table 
6. 80960SA 
AC Characteristics 
(10 MHz) 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


Input Clock 


T1 
Processor 
Clock 
50 
125 
ns 
VIN = 1.5V 
Period (CLK2) 


T2 
Processor 
Clock 
8 
ns 
Vr = 10% Point 
Low Time (CLK2) 
= VCl + (VCH - 
VcUxO.1 


T3 
Processor 
Clock 
8 
ns 
Vr = 90% Point 
High Time (CLK2) 
= VCl + (VCH - 
Vcu x 0.9 


T4 
Processor 
Clock 
10 
ns 
VT = 90% to 10% Point (1) 
Fall Time (CLK2) 


T5 
Processor 
Clock 
10 
ns 
Vr = 10% to 90% Point (1) 
Rise Time (CLK2) 


Synchronous 
Outputs 


T6 
Output Valid Delay 
2 
31 
ns 


T6AS 
AS Output Valid Delay 
2 
25 
ns 


T7 
ALE Width 
T1-11 
ns 


TB 
ALE Output Valid Delay 
4 
33 
ns 


T9 
Output Float Delay 
2 
20 
ns 
(2) 


Synchronous 
Inputs 


T10 
Input Setup 1 
10 
ns 


T11 
Input Hold 
2 
ns 


T12 
Input Setup 2 
13 
ns 


T13 
Setup to ALE Inactive 
10 
ns 


T14 
Hold After ALE Inactive 
8 
ns 


T15 
RESET Hold 
3 
ns 
(3) 


T16 
RESET Setup 
5 
ns 
(3) 


T17 
RESET Width 
2050 
ns 
41 CLK2 Periods Minimum 


NOTES: 
1. Processor clock (ClK2) rise time and fall time are not tested. 
2. A float condition occurs when the maximum output current becomes less than ILO.Float delay is not tested, but should be 
no longer than the valid delay. 
' 


3. Meeting ~ 
setup and hold times is an optional method of synchronizing your clocks. If you decide to use an asyn- 
chronous reset, synchronizing the clock can be accomplished by using AS. 
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Table 7. 80960SA 
AC Characteristics 
(16 MHz) 


Symbol 
Parameter 
Min 
Max 
Units 
Notes 


Input Clock 


T1 
Processor 
Clock 
31.25 
125 
ns 
VIN = 1.5V 
Period (CLK2) 


T2 
Processor 
Clock 
8 
ns 
VT = 10% Point 
Low Time (CLK2) 
= VCl + (VCH - 
Vcu 
x 0.1 


T3 
Processor 
Clock 
8 
ns 
VT = 90% Point 
High Time (CLK2) 
= VCl + (VCH - 
Vcu 
x 0.9 


T4 
Processor 
Clock 
10 
ns 
VT = 90% to 10% Point (1) 
Fall Time (CLK2) 


T5 
Processor 
Clock 
10 
ns 
VT = 10% to 90% Point (1) 
Rise Time (CLK2) 


Synchronous 
Outputs 


T6 
Output Valid Delay 
2 
25 
ns 


T6AS 
AS Output Valid Delay 
2 
21 
ns 


T7 
ALE Width 
T1-11 
ns 


Ta 
ALE Output Valid Delay 
2 
22 
ns 


Tg 
Output Float Delay 
2 
20 
ns 
(2) 


Synchronous 
Inputs 


T10 
Input Setup 1 
10 
ns 


T11 
Input Hold 
2 
ns 


T12 
Input Setup 2 
13 
ns 


T13 
Setup to ALE Inactive 
10 
ns 


T14 
Hold After ALE Inactive 
8 
ns 


T15 
RESET Hold 
3 
ns 
(3) 


T16 
RESET Setup 
." 
5 
ns 
(3) 


TH 
RESET Width 
1281 
ns 
41 CLK2 Periods Minimum 


NOTES: 
1. Processor clock (ClK2) rise time and fall time are not tested. 
2. A float condition occurs when the maximum output current becomes less than ILQ.Float delay is not tested, but should be 
no longer than the valid delay. 


3. Meeting RESET setup and hold times is an optional method of synchronizing your clocks. If you decide to use an asyn- 
chronous reset, synchronizing the clock can be accomplished by using AS. 
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Table 
8. 80960SA 
AC Characteristics 
(20 MHz) 


Symbol 
Parameter 
Min 
Max 
Units 
Notes 


Input Clock 


T1 
Processor 
Clock 
25 
125 
ns 
VIN = 1.5V 
Period (CLK2) 


T2 
Processor 
Clock 
6 
ns 
VT = 10% Point 
Low Time (CLK2) 
= VCl + (VCH - 
VcuxO.1 


T3 
Processor 
Clock 
6 
ns 
VT = 90% Point 
High Time (CLK2) 
= VCl + (VCH - 
Vcu 
x 0.9 


T4 
Processor 
Clock 
10 
ns 
VT = 90% to 10% Point (1) 
Fall Time (CLK2) 


T5 
Processor 
Clock 
10 
ns 
VT = 10% to 90% Point (1) 
Rise Time (CLK2) 


Synchronous 
Outputs 
- 


T6 
Output Valid Delay 
2 
20 
ns 


T6AS 
AS Output Valid Delay 
2 
20 
ns 


T7 
ALE Width 
T1-11 
ns 


Ta 
ALE Output Valid Delay 
2 
18 
ns 


Tg 
Output Float Delay 
2 
17 
ns 
(2) 


Synchronous 
Inputs 


T10 
Input Setup 1 
7 
ns 


T11 
" 
Input Hold 
2 
ns 


T12 
Input Setup 2 
13 
ns 


T13 
Setup to ALE Inactive 
10 
ns 


T14 
Hold After ALE Inactive 
8 
ns 


T15 
RESET Hold 
3 
ns 
(3) 


T16 
RESET Setup 
5 
ns 
(3) 


T17 
RESET Width 
1025 
ns 
41 CLK2 Periods Minimum 


. NOTES: 
1. Processor clock (ClK2) rise time and fall time are not tested. 
2. A float condition occurs when the maximum output current becomes less than ILO.Float delay is not tested, but should be 
no longer than the valid delay. 


3. Meeting RESET setup and hold times is an optional method of synchronizing your clocks. If you decide to use an asyn- 
chronous reset, synchronizing the clock can be accomplished by using AS. 
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Figure 14. Processor 
Clock Pulse (CLK2) 


ClK2["NV', 
, 
, 
[lL.1' 
: : 
ClK 
, 
, 
, 
, 
, 


OUTPUTS[---~: 
>C 


RESET [ H... 
·--'---'"-' 
11:(--=--=~~--=--=--=--=~--=7.T1~7:::==~==~~~~T1~6='==~~=: 


I 
T15~ 


INITIALIZATIONPARAMETERS 
~ 


I 
I 
I 
I 


INTO.~[~W4 
INT3, lOCK 


, 
, 
. 
. 


NOTE: Initialization 
parameters 
must 
be set 
up at 
leost 
four 
CLK2 
periods 
before 
the 
first 
CLK2 
"A" 
edge. 


272206-14 


Figure 15. RESET Signal Timing 


ClK2 
[ 


ClK 
[ 


HOLD [ 


HlDA [ 


, 
--' 
--- 
T6 
, 
, 
--' 
f-- T6 
, 
, 
, 
, 
, 
, 
, 
, 
, 
, 
, 
, 
, 
, 
, 
, 
, 


" 
, 
, 
, 
, 
, 
, 
, ~ 
, 
, 
, 
I 
, 
, 
, 
, 
I 
, 
, 
, 


272206-15 


Figure 16. HOLD Timing 
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3.0 
MECHANICAL 
DATA 
3.2 Pin Assignment 


3.1 Packaging 


The 80960SA 
is available 
in two package 
types: 


• 
80-lead 
quad flat pack (EIAJ OFP). Shown in Fig- 


ure 17. 


• 
84-lead 
plastic 
leaded 
chip 
carrier 
(PLCC). 
Shown 
in Figure 18. 


Dimensions 
for both package 
types are given in the 
Intel Packaging 
handbook 
(Order 
#240800). 


The OFP and PLCC have different 
pin assignments. 


The OFP pins are numbered 
in order from 
1 to 80 


around 
the package 
perimeter. 
The PLCC pins are 


numbered 
in order from 1 to 84 around the package 
perimeter. 
Table 9 and Table 
10 list the function 
of 


each OFP pin; Table 
11 and Table 
12 list the func- 


tion of each PLCC pin. 


vcc and GND connections 
must be made to multi- 


ple vcc and GND pins. Each vcc and GND pin must 
be connected 
to the appropriate 
voltage 
or ground 
and externally 
strapped 
close 
to the package. 
It is 
recommended 
that you include 
separate 
power and 
ground 
planes 
in your circuit 
board for power 
distri- 


bution. 


Pins identified 
as N.C. (No Connect) 
should never be 
connected. 
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Figure 
17. SO-Lead EIAJ Quad Flat Pack (QFP) Package 
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Table 10. 80960SA QFP Pinout-In 
Signal Order 


Signal 
Pin 
Signal 
Pin 
Signal 
Pin 
Signal 
Pin 


A1 
38 
A18 
5 
DO 
31 
Vcc 
51 


A2 
35 
A19 
4 
DEN 
55 
Vcc 
59 


A3 
34 
A20 
3 
DT/R 
56 
Vcc 
61 


AD1 
30 
A21 
2 
HLOA 
50 
Vcc 
73 


AD2 
29 
A22 
1 
HOLD 
53 
Vcc 
77 


AD3 
28 
A23 
80 
INTO 
46 
Vcc 
8 


AD4 
27 
A24 
79 
INT1 
47 
Vss 
13 


AD5 
26 
A25 
76 
INT211NTR 
48 
Vss 
22 


AD6 
25 
A26 
75 
INT3/INTA 
49 
Vss 
24 


AD7 
20 
A27 
74 
LOCK 
58 
Vss 
32 


AD8 
19 
A28 
71 
N.C. 
39 
Vss 
37 


AD9 
18 
A29 
70 
N.C. 
63 
Vss 
43 


AD10 
17 
A30 
69 
READY 
67 
Vss 
52 


AD11 
16 
A31 
68 
RESET 
45 
Vss 
60 


AD12 
15 
ALE 
66 
Vcc 
12 
Vss 
62 


AD13 
14 
AS 
64 
Vcc 
21 
Vss 
72 


AD14 
11 
BEO 
41 
Vcc 
23 
Vss 
78 


AD15 
10 
BE1 
40 
Vcc 
33 
Vss 
9 


A16 
7 
BLAST 
57 
Vcc 
36 
Vss 
65 


A17 
6 
CLK2 
44 
Vcc 
42 
W/R 
54 


NOTE: 
Do not connect 
any external 
logic to any pins marked 
N.C. 


• 
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Table 
11. 80960SA 
PLCC Plnout-In 
Pin Order 


Pin 
Signal 
Pin 
Signal 
Pin 
Signal 
Pin 
Signal 


1 
Vcc 
22 
Vss 
43 
Vss 
64 
HOLD 


2 
N.C. 
23 
N.C. 
44 
Vcc 
65 
N.C. 


3 
A27 
24 
AD13 
45 
A3 
66 
W/R 


4 
A26 
25 
AD12 
46 
A2 
67 
DEN 


5 
A25 
26 
AD11 
47 
Vcc 
68 
DT/R 


6 
Vcc 
27 
AD10 
48 
Vss 
69 
BLAST 


7 
Vss 
28 
AD9 
49 
A1 
70 
LOCK 


8 
A24 
29 
AD8 
50 
N.C. 
71 
VCC 


9 
A23 
30 
AD7 
51 
BE1 
72 
Vss 


10 
A22 
31 
Vcc 
52 
BEO 
73 
Vcc 


11 
A21 
32 
Vss 
53 
Vcc 
74 
Vss 


12 
A20 
33 
Vcc 
54 
Vss 
75 
N.C. 


13 
A19 
34 
Vss 
55 
CLK2 
76 
AS 


14 
A18 
35 
AD6 
56 
RESET 
77 
Vss 


15 
A17 
36 
AD5 
57 
INTO 
78 
ALE 


16 
A16 
37 
AD4 
58 
INT1 
79 
READY 


17 
Vcc 
38 
AD3 
59 
INT2/INTR 
80 
A31 


18 
Vss 
39 
02 
60 
INT3/1NTA 
81 
A30 


19 
AD15 
40 
01 
61 
HLDA 
82 
A29 


20 
AD14 
41 
DO 
62 
CC 
83 
A28 


21 
Vcc 
42 
N.C. 
63 
Vss 
84 
Vss 


NOTE: 
Do not connect 
any external 
logic to any pins marked 
N.C. 
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Table 12.80960SA PLCC Plnout-InSignalOrder 


Signal 
Pin 
Signal 
Pin 
Signal 
Pin 
Signal 
Pin 


A1 
49 
A18 
14 
DT/R 
68 
Vcc 
44 


A2 
46 
A19 
13 
HLDA 
61 
Vcc 
47 


A3 
45 
A20 
12 
HOLD 
64 
Vcc 
53 


DO 
41 
A21 
11 
INTO 
57 
Vcc 
6 


AD1 
40 
A22 
10 
INT1 
58 
Vcc 
62 


AD2 
39 
A23 
9 
INT2/1NTR 
59 
Vcc 
71 


AD3 
38 
A24 
8 
INT3/1NTA 
60 
Vcc 
73 


AD4 
37 
A25 
5 
LOCK 
70 
Vss 
18 


AD5 
36 
A26 
4 
N.G. 
2 
Vss 
22 


AD6 
35 
A27 
3 
N.G. 
23 
Vss 
32 


AD7 
30 
A28 
83 
N.G. 
42 
Vss 
34 


AD8 
29 
A29 
82 
N.G. 
50 
Vss 
43 


AD9 
28 
A30 
81 
N.G. 
65 
Vss 
48 


AD10 
27 
A31 
80 
N.G. 
75 
Vss 
54 


AD11 
26 
ALE 
78 
READY 
79 
Vss 
63 


AD12 
25 
AS 
76 
RESET 
56 
Vss 
7 


AD13 
24 
BEO 
52 
Vcc 
1 
Vss 
72 


AD14 
20 
BE1 
51 
Vcc 
17 
Vss 
74 


AD15 
19 
BLAST 
69 
Vcc 
21 
Vss 
77 


AD16 
16 
GLK2 
55 
Vcc 
31 
Vss 
84 


A17 
15 
DEN 
67 
Vcc 
33 
W/R 
66 


NOTE: 
00 not connect 
any external 
logic to any pins marked 
N.C. 
• 
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I nermill netinnance- 
\", VYall 


Parameter 
Alrflow-ft.lmln 
(m/sec) 


0 
50 
100 
200 
400 
600 
800 


(J Junction-to-Ambient 
(Case 
measured 
in the middle of the 
59 
57 
54 
50 
44 
40 
38 
top of the package) 
(No heatsink) 


(J Junction-to-Case 
11 
11 
11 
11 
11 
11 
11 


NOTE: 
This table applies 
to 80960SA 
OFP soldered 
directly 
to board. 


Table 14_80960SA PLCC Package Thermal Characteristics 


Thermal Reslstance-°C/Watt 


Parameter 
Alrflow-ft.lmln 
(m/sec) 


0 
50 
100 
200 
400 
600 
800 
1000 


(J Junction-to-Ambient 
34 
32 
29.5 
28 
25 
23 
21 
20.5 
(No heatsink) 


(J Junction-to-Case 
12 
12 
12 
12 
12 
12 
12 
12 


NOTE: 
This table applies 
to 80960SA 
OFP soldered 
directly 
to board. 
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4.0 
,WAVEFORMS 


80960SA 


Figures 19, 20, 21, 22 and 23 show waveforms for various transactions on the 80960SA's bus. Figure 24 
shows a cold reset functional waveform. 
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Figure 20. Quad Word Burst Read Transaction 
with 1,0,0,0,0,0,0, °Wait States 
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Figure 22. Accesses 
Generated 
by Quad Word Read Bus Request, 
Misaligned 
One Byte from Quad Word Boundary 
(1, 0, 0, 0, 0, 0, 0, 0) Wait States 
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Figure 23. Interrupt Acknowledge Cycle 
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5.0 
REVISION HISTORY 


This data sheet 
supersedes 
data sheet 270917-004, 
which 
applied 
to both the 80960SA 
and the 80960S8. 


The 80960S8 
is now documented 
in 272207-001. 
Specification 
changes 
in the 80960SA 
data sheet 
are a 
result of design changes. 
The sections 
significantly 
changed 
since the previous 
revision 
are: 


Section 
Last 
Description 
Rev. 


2.3 Connection 
Recommendations 
-004 
Deleted corresponding 
graph of Open Drain Voltage vs. 


(p.15) 
Output Current. 


Figure 7. Typical Supply Current vs. 
-004 
Regraphed 
new data in three graphs instead of two. 
Case Temperature 
(p. 16), Figure 8. 
Typical Current vs. Frequency 
(Room Temp) (p. 6) and Figure 9. 
Typical Current vs. Frequency 
(HotTemp) 
(p. 7) 


Table 5. DC Characteristics 
(p. 19) 
-004 
Input Leakage Current (IU2) specification 
added to 
accurately 
describe 
leakage of INTO and LOCK as inputs . 


Icc max reduced: 
Power Supply Current: 
Was: 
Is: 
10MHz 
280 
240 
16 MHz 
350 
300 


Table 6. 80960SA 
AC Characteristics 
-004 
T7 minimum specification 
improved: 
(10 MHz) (p. 21) and Table 7. 
Power Supply Current: 
Was: 
Is: 
80960SA 
AC Characteristics 
10MHz 
24 ns 
T1-11 
ns 
(16 MHz) (p. 22) 
16MHz 
15 ns 
T1-11 
ns 


Table 8. 80960SA 
AC Characteristics 
-004 
New 20 MHz specification 
table added for 80960SA 
C-step. 
(20 MHz) (p. 23) 


Table 13. 80960SA 
QFP Package 
-004 
eJA increased 
to reflect smaller die size and lower Icc. 


Thermal 
Characteristics 
(p. 32) 


Table 14. 80960SA 
PLCC Package 
-004 
eJA and eJC increased 
to reflect smaller die size and lower 
Thermal 
Characteristics 
(p. 32) 
'cc- 


The sections 
significantly 
changed 
between 
revisions 
-003 and -004 of the 80960SAlS8 
data sheet were: 


Section 
Last 
Description 
Rev. 


DC Characteristics 
(p. 19) 
-003 
Operating 
temperature 
for PLCC package 
changed: 
Was: 
TCASE = O·C to + 100·C 
Is: 
TCASE = O·C to + 85·C 
The test program 
has not changed. 


Table 9. 80960SA 
and 80960S8 
-003 
Signal A 12 incorrectly 
shown as Pin 28; is now correctly 
QFP Pinout-In 
Pin Order (p. 23) 
shown as Pin 38. Note added to clarify No Connect 
Pins. 


• 
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80960SB 
EMBEDDED 32-BIT MICROPROCESSOR 
WITH 16-BIT 
BURST DATA BUS 


• 
High-Performance 
Embedded 
Architecture 
-16 
MIPS* 
Burst Execution 
at 16 MHz 
- 
5 MIPS Sustained 
Execution 
at 
16 MHz 


• 
512-Byte 
On-Chip 
Instruction 
Cache 
- 
Direct Mapped 
- 
Parallel Load/Decode 
for Uncached 
Instructions 


• 
Multiple 
Register 
Sets 
- 
Sixteen 
Global 32-Blt Registers 
- 
Sixteen 
Local 32-Bit Registers 
- 
Four Local Register 
Sets Stored 
On- 
Chip 
- 
Register Score boarding 


• 
Software 
Compatible 
with 80960KA/ 


KB/CA/CF 
Processors 


• 
Pin Compatible 
with 80960SA 


• 
Built-In Interrupt 
Controller 
- 
4 Direct Interrupt 
Pins 
-31 
Priority 
Levels, 256 Vectors 


• 
Built-In Floating 
Point Unit 
- 
Fully IEEE 754 Compatible 


• 
Easy to Use, High Bandwidth 
16-Blt Bus 
- 
25.6 Mbytes/s 
Burst 
-;- Up to 16 Bytes Transferred 
per Burst 


• 
32-Blt Address 
Space, 4 Gigabytes 


• 
80-Lead 
Quad Flat Pack (EIAJ QFP) 


• 
84-Lead 
Plastic Leaded 
Chip Carrier 
(PLCC) 


The 80960S8 is a member of Intel's i960~ 32-bit processor family, which is designed especially for low cost 
embedded applications. It includes a 512-byte instruction cache, an integrated floating-point unit and a built-in 
interrupt controller. The 80960S8 has a large register set, multiple parallel execution units and a 16-bit burst 
bus. Using advanced RiSe technology, this high performance processor is capable of execution rates in 
excess of 5 million instructions per second". The 80960S8 is well-suited for a wide range of cost sensitive 
embedded applications including non-impact printers, network adapters and 1/0 controllers. 


32-BIT 
BUS 
CONTROL 
LOGIC 
32-BIT 
ADDRESS 
16-BIT 
I:"_~J:_':;;;;;;;;;;;;;;;;;;;;;;;;;;;!-------•••~ 
J 
BURST 
BUS 


Figure 1. The 8096058 
Processor's 
Highly Parallel Architecture 
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'Relative 
to Digital Equipment 
Corporation's 
VAX-111780 
at 1 MIPS CVAX-11 is a trademark 
of Digital Equipment 
Corporation). 
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1.0 
THE i960® PROCESSOR 


The 80960S8 is a member of the 32-bit architecture 
from Intel known as the i960 processor family. 
These microprocessors were especially designed to 
serve the needs of embedded applications. The em- 
bedded market includes applications as diverse as 
industrial automation, avionics, image processing, 
graphics and networking. These types of applica- 
tions require high integration, low power consump- 
tion, quick interrupt response times and high per- 
formance. Since time to market is critical, embedded 
microprocessors need to be easy to use in both 
hardware and software designs. 


8096058 


All members of the i960 processor family share a 
common core architecture which utilizes RiSe tech- 
nology so that, except for special functions, the fam- 
ily members are object-code compatible. Each new 
processor in the family adds its own special set of 
functions to the core to satisfy the needs of a specif- 
ic application or range of applications in the embed- 
ded market. 


Software written for the 80960S8 will run without 
modification on any other member of the 80960 
Family. This processor is pin-compatible with the 
80960SA. 
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Figure 2. 8096088 
Programming 
Environment 
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1.1 Key Performance Features 


The 8096088 architecture is based on the most re- 
cent advances in microprocessor technology and is 
grounded in Intel's long experience in the design 
and manufacture of embedded microprocessors. 
Many features contribute to the 8096088's excep- 
tional performance: 
1. Large 
Register 
Set. Having a large number of 
registers reduces the number of times that a 
processor needs to access memory. Modem 
compilers can take advantage of this feature to 
optimize execution speed. For maximum flexibili- 
ty, the 8096088 provides thirty-two 32-bit regis- 
ters and four 80-bit floating point registers. (8ee 
Figure 2.) 


2. Fast 
Instruction 
Execution. 
8imple functions 
make up the bulk of instructions in most pro- 
grams so that execution speed can be improved 
by ensuring that these core instructions are exe- 
cuted as quickly as possible. The most frequently 
executed instructions-such 
as register-register 
moves, add/subtract, 
logical 
operations 
and 
shifts-execute 
in one to two cycles. (Table 1 
contains a list of instructions.) 
3. Load/Store 
Architecture. 
One way to improve 
execution speed is to reduce the number of times 
that the processor must access memory to per- 
form an operation. As with other processors 
based on RI8e technology, the 8096088 has a 
Load/8tore architecture. As such, only the LOAD 
and 8TORE instructions reference memory; all 
other instructions operate on registers. This type 
of architecture simplifies instruction decoding 
and is used in combination with other techniques 
to increase parallelism. 


4. Simple 
Instruction 
Formats. 
All instructions in 
the 8096088 
are 32 bits long and must be 
aligned on word 
boundaries. This alignment 
makes it possible to eliminate the instruction 
alignment stage in the pipeline. To simplify the 
instruction decoder, there are only five instruction 
formats; each instruction uses only one format. 
(8ee Figure 3.) 
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5. Overlapped 
Instruction 
Execution. 
Load oper- 
ations allow execution of subsequent instructions 
to continue before the data has been returned 
from memory, so that these instructions can 
overlap the load. The 8096088 
manages this 
process transparently to software through the 
use of a register scoreboard. Conditional instruc- 
tions also make use of a scoreboard so that sub- 
sequent unrelated instructions may be executed 
while the conditional instruction is pending. 


6. Integer 
Execution 
Optimization. 
When the re- 
sult of an arithmetic execution is used as an oper- 
and in a subsequent calculation, the value is sent 
immediately to its destination register. At the 
same time, the value is put on a bypass path to 
the ALU, thereby saving the time that otherwise 
would be required to retrieve the value for the 
next operation. 


7. Bandwidth 
Optlmizatlons. 
The 8096088 gets 
optimal use of its memory bus bandwidth be- 
cause the bus is tuned for use with the on-chip 
instruction cache: instruction cache 
line size 
matches the maximum burst size for instruction 
fetches. The 8096088 automatically fetches four 
words in a burst and stores them directly in the 
cache. Due to the size of the cache and the fact 
that it is continually filled in anticipation of need- 
ed instructions in the program flow, the 8096088 
is relatively insensitive to memory wait states. 
The benefit is that the 8096088 delivers out- 
standing performance even with a low cost mem- 
ory system. 


8. Cache Bypass. If a cache miss occurs, the proc- 
essor fetches the,needed instruction then sends 
it on to the instruction decoder at the same time it 
updates the cache. Thus, no extra time is spent 
to load and read the cache. 
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Control 
I Opcode 
1 
D_i_sp_l_a_ce_m_en_t 
----' 


Compare 
and 
Branch 
I Opcode I Reg/Lit 
I 
Reg 
~ 
D_is_p_la_c_e_m_e_n_t 
__ 
-' 


I Opcode 
I_R_e_ 
g 
__ 
B_a_se____'~L_ 
O_ff_s_e_t____' 


I Opcode 
I 


Register to 
Register 


Memory Acc,ess- 
Short 


Memory Acccess- 
Long 
Reg I 
Base 
I_M_o_d_e --'ETI 
Offset 


Displacement 


Figure 3. Instruction 
Formats 


1.1.1 
MEMORY 
SPACE 
AND ADDRESSING 
MODES 


The 80960SB 
offers 
a linear programming 
environ- 
ment so that all programs 
running 
on the processor 
are contained 
in a single address 
space. 
Maximum 
address 
space size is 4 Gigabytes 
(232 bytes). 


For ease of use the 80960SB 
has a small number of 
addressing 
modes, 
but includes 
all those 
necessary 
to ensure efficient 
execution 
of hlqh-level 
languages 
such 
as C. Table 
2 lists 
the 
memory 
addressing 
modes. 


Table 2. Memory 
Addressing 
Modes 


• 
12-Bit Offset 


• 
32-Bit Offset 


• 
Register·lndirect 


• 
Register + 12-Bit Offset 


• 
Register + 32-Bit Offset 


• 
Register + (Index-Register 
x Scale-Factot) 


• 
Register 
x Scale Factor + 32-Bit Displacement 


• 
Register 
+ 
(Index-Register 
x Scale-Factor) 
+ 
32-Bit 
Displacement 


Scale-Factor 
is 1, 2, 4, 8 or 16 
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1.1.2 
DATA TYPES 


The 80960SB 
recognizes 
the following 
data types: 


Numeric: 


• 
8-, 16-, 32- and 64-bit ordinals 


• 
8-, 16-, 32- and 64-bit integers 


• 
32-, 64- and 80-bit real numbers 


Non-Numeric: 


• 
Bit 


• 
Bit Field 


• 
Triple Word (96 bits) 


• 
Quad-Word 
(128 bits) 


1.1.3 
LARGE 
REGISTER 
SET 


The 80960SB 
programming 
environment 
includes 
a 
large 
number 
of registers. 
In fact, 
32 registers 
are 
available 
at any time. The availability 
of this many 
registers 
greatly 
reduces 
the number of memory 
ac- 


cesses 
required 
to perform 
algorithms, 
which 
leads 
to greater 
instruction 
processing 
speed. 


There 
are 
two 
types 
of 
general-purpose 
register: 
local and global. The global 
registers 
consist 
of six- 
teen 32-bit registers 
(gO though 
g15) and four 80-bit 
registers 
(FPO through 
FP3). 
These 
registers 
per- 


form the same function as the general-purpose reg- 
isters provided in other popular microprocessors. 
The term global refers to the fact that these regis- 
ters retain their contents across procedure calls. 


The local registers, on the other hand, are proce- 
dure specific. For each procedure call, the 80960S8 
allocates 16 local registers (rO through r15). Each 
local register is 32 bits wide. Any register can also 
be used for single or double-precision floating-point 
operations; the 80-bit floating-point registers are pro- 
vided for extended precision. 


1.1.4 
MULTIPLE 
REGISTER 
SETS 


To further increase the efficiency of the register set, 
multiple sets of local registers are stored on-chip 
(See Figure 4). This cache holds up to four local 
register frames, which means that up to three proce- 
dure calls can be made without having to access the 
procedure stack resident in memory. 


Although programs may have procedure calls nest- 
ed many calls deep, a program typically oscillates 
back and forth between only two to three levels. As 
a result, with four stack frames in the cache, the 
probability of having a free frame available on the 
cache when a call is made is very high. In fact, runs 
of representative C-Ianguage programs show that 
80% of the calls are handled without needing to 
access memory. 


If four or more procedures are active and a new pro- 
cedure is called, the 80960S8 moves the oldest 
local register set in the stack-frame cache to a pro- 
cedure stack in memory to make room for a new set 
of registers. Global register g15 is the frame pointer 
(FP) to the procedure stack. 


Global and floating point registers are not ex- 
changed on a procedure call, but retain their con- 
tents, making them available to all procedures for 
fast parameter passing. 
' 


1.1.5 
INSTRUCTION 
CACHE 


To further reduce memory accesses, the 80960S8 
includes a 512-byte on-chip instruction cache. The 
instruction cache is based on the concept of locality 
of reference; most programs are not usually execut- 
ed in a steady stream but consist of many branches, 
loops and procedure calls that lead to jumping back 


80960SB 


and forth in the same small section of code. Thus, 
by maintaining a block of instructions in cache, the 
number of memory references required to read in- 
structions into the processor is greatly reduced. 


To 
load the 
instruction cache, 
instructions 
are 
fetched in 16-byte blocks; up to four instructions can 
be fetched at one time. An efficient prefetch algo- 
rithm increases the probability that an instruction will 
already be in the cache when it is needed. 


Code for small loops often fits entirely within the 
cache, leading to a great increase in processing 
speed since further memory references might not be 
necessary until the program exits the loop. Similarly, 
when calling short procedures, the code for the call- 
ing procedure is likely to remain in the cache so it 
will be there on the procedure's return. 


1.1.6 
REGISTER 
SCOREBOARDING 


The instruction decoder is optimized in several ways. • 
One optimization method is the ability to overlap 
instructions by using register scoreboarding. 


Register scoreboarding occurs when a LOAD moves 
a variable from memory into a register. When the 
instruction initiates, a scoreboard bit on the target 
register is set. Once the register is loaded, the bit is 
reset. In between, any reference to the register con- 
tents is accompanied by a test of the scoreboard bit 
to ensure that the load has completed before pro- 
cessing continues. Since the processor does not 
need to wait for the LOAD to complete, it can exe- 
cute additional instructions placed between the 
LOAD and the instruction that uses the register con- 
tents, as shown in the following example: 


Id data_2, 
r4 
Id data_2, 
r5 
Unrelated 
instruction 
Unrelated 
instruction 
add 
r4, 
r5, 
r6 


In essence, the two unrelated instructions between 
LOAD and ADD are executed "for free" (i.e., take no 
apparent time to execute) because they are execut- 
ed while the register is being loaded. Up to three 
load instructions can be pending at one time with 
three corresponding scoreboard bits set. 8y exploit- 
ing this feature, system programmers and compiler 
writers have a useful tool for optimizing execution 
speed. 
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1.1.7 FLOATING-POINT ARITHMETIC 


Figure 4. Multiple Register Sets Are Stored On-Chip 


In the 80960S8, floating-point arithmetic has been 
made an integral part of the architecture. Having the 
floating-point unit integrated on chip provides two 
advantages. First, it improves the performance of 
the chip for floating-point applications, since no ad- 
ditional bus overhead is associated with floating- 
point calculations, thereby leaving more time for oth- 
er bus operations such as 1/0. Second, the cost of 
using floating-point operations is reduced because a 
separate coprocessor chip is not required. 


The 
80960S8 
floating-point 
(real-number) 
data 
types 
include 
single-precision 
(32-bit), 
double- 


precision (64-bit) and extended precision (80-bit) 
floating-point numbers. Any registers may be used to 
execute floating-point operations. 


The processor provides hardware support for both 
mandatory and recommended portions of 
IEEE 
Standard 754 for floating-point arithmetic, including 
all arithmetic, exponential, logarithmic and other 
transcendental functions. Table 3 shows execution 
times for some representative instructions. 


1.1.8 HIGH BANDWIDTH BUS 


The 80960S8 CPU resides on a high-bandwidth 
addressl data bus. The bus provides a direct com- 
munication path between the processor and the 
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Table 3. Sample Floating-Point Execution Times 
(p,s)at 16 MHz 


32-Blt 
64-Blt 
Function 


Add 
Subtract 
Multiply 
Divide 


0.6 
0.6 
1.1 
2.0 


0.8 
0.8 
2.0 
4.5 


Square Root 
Arctangent 
Exponent 
Sine 
Cosine 


5.8 
15.8 
17.7 
23.8 
23.8 


6.1 
20.5 
19.5 
25.9 
25.9 


memory and 1/0 subsystem interfaces. The proces- 
sor uses the bus to fetch instructions, manipulate 
memory and respond to interrupts. 8us features 
include: 


• 16-bit data path multiplexed onto the lower bits of 
the 32-bit address path 
• Eight 
16-bit half-word 
burst capability 
which 
allows transfers from 1 to 16 bytes at a time 
• High 
bandwidth 
reads 
and 
writes 
with 
25.6 Mbytesls burst (at 16 MHz) 


Table 4 defines bus signal names and functions; 
Table 5 defines other component-support signals 
such as interrupt lines. 


1.1.9 INTERRUPT HANDLING 


The 80960SB can be interrupted in one of two ways: 
by the activation of one of four interrupt pins or by 
sending a message on the processor's data bus. 


The 80960SB is unusual in that it automatically han- 
dies interrupts on a priority basis and can keep track 
of pending interrupts through its on-chip interrupt 
controller. Two of the interrupt pins can be config- 
ured to provide 8259A-style handshaking for expan- 
sion beyond four interrupt lines. 


1.1.10 DEBUG FEATURES 


The 80960SB has built-in debug capabilities. There 
are two types of breakpoints and six trace modes. 
Debug features are controlled by two internal 32-bit 
registers, the Process-Controls Word and the Trace- 
Controls Word. By setting bits in these control 
words, a software debug monitor can closely control 
how the processor responds during program execu- 
tion. 


The 80960SB provides two hardware breakpoint 
registers on-chip which, by using a special com- 
mand, can be set to any value. When the instruction 
pointer matches either breakpoint register value, the 
breakpoint handling routine is automatically called. 


The 80960SB also provides software breakpoints 
through the use of two instructions: MARK and 
FMARK. These can be placed at any point in a pro- 
gram and cause the processor to halt execution at 
that point and call the breakpoint handling routine. 
The breakpoint mechanism is easy to use and pro- 
vides a powerful debugging tool. 


Tracing is available for instructions (single step exe- 
cution), calls and returns and branching. Each trace 
type may be enabled separately by a special debug 
instruction. In each case, the 80960SB executes the 
instruction first and then calls a trace handling rou- 
tine (usually part of a software debug monitor). Fur- 
ther program execution is halted until the routine 
completes, at which time execution resumes at the 
next instruction. The 80960SB's tracing mecha- 
nisms, implemented completely in hardware, greatly 
simplify the task of software test and debug. 
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1.1.11 FAULT DETECTION 


The 80960SB has an automatic mechanism to han- 
dle faults. Fault types include floating point, trace 
and arithmetic faults. When the processor detects a 
fault, it automatically calls the appropriate fault han- 
dling routine and saves the current instruction point- 
er and necessary state information to make efficient 
recovery possible. Like interrupt handling routines, 
fault handling routines are usually written to meet 
the needs of specific applications and are often in- 
cluded as part of the operating system or kernel. 


For each of the fault types, there are numerous sub- 
types that provide specific information about a fault. 
For example, a floating point fault may have the sub- 
type set to an Overflow or Zero-Divide fault. The 
fault handler can use this specific information to re- 
spond correctly to the fault. 
. 


1.1.12 BUILT-IN TESTABILITY 


Upon reset, the 80960SB automatically conducts an 
exhaustive internal test of its major blocks of logic. 
Then, before executing its first instruction, it does a 
zero check sum on the first eight words in memory 
to ensure that the memory image was programmed 
correctly. If a problem is discovered at any point dur- 
Ing the self-test, the 80960SB asserts its FAIL pin 
and will not begin program execution. Self test takes 
approximately 47,000 cycles to complete. 


System manufacturers can use the 80960SB's self- 
test feature during incoming parts inspection. No 
special diagnostic programs need to be written. The 
test is both thorough and fast. The self-test capabili- 
ty helps ensure that defective parts are discovered 
before systems are shipped and, once in the field, 
the self-test makes it easier to distinguish between 
problems caused by processor failure and problems 
resulting from other causes. 


1.1.13 CHMOS 


The 80960SB is fabricated using Intel's CHMOS IV 
(Complementary High Speed Metal Oxide Semicon- 
ductor) 
process. 
The 
80960SB 
is 
available 
at 
10 MHz in both PLCC and QFP packages, and at 
16 MHz in the PLCC package. 


• 
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Table 4. 80960SB 
Pin Description: 
Bus Signals 


Name 
Type 
Description 


CLK2 
I 
SYSTEM 
CLOCK provides the fundamental 
timing for 80960S8 
systems. 
It is divided 
by two inside the 80960S8 
to generate 
the internal 
processor 
clock. 


A31:16 
0 
ADDRESS 
BUS carries the upper 16 bits of the 32-bit physical address to memory. 
It 


T.S. 
is valid throughout 
the burst cycle; no latch is required. 


AD15:1, 
DO 
I/O 
ADDRESS/OAT 
A BUS carries the low order 32-bit addresses 
and 16-bit data to and 
T.S. 
from memory. AD15:4 must be latched since the cycle following 
the address cycle 
carries data on the bus. 


A3:1 
0 
ADDRESS 
BUS carries the word addresses 
of the 32-bit address to memory. These 
T.S. 
three bits are incremented 
during a burst access indicating 
the next word address of 
the burst access. 
Note that A3:1 are duplicated 
with AD3:1 during the address cycle. 


ALE 
0 
ADDRESS 
LATCH ENABLE indicates 
the transfer 
of a physical address. 
ALE is 
T.S. 
asserted 
during a Ta cycle and deasserted 
before the beginning 
of the Td state. It is 
active HIGH and floats to a high impedance 
state during a hold cycle (Th)' 


AS 
0 
ADDRESS 
STATUS 
indicates 
an address state. AS is asserted 
every Ta state and 
T.S. 
deasserted 
during the following 
Td state. AS is driven HIGH during reset. 


W/R 
0 
WRITE/READ 
specifies, 
during aT a cycle, whether 
the operation 
is a write or read. It 


0.0. 
is latched on-chip and remains valid during Td cycles. 


DEN 
0 
OAT A ENABLE is asserted 
during Td cycles and indicates 
transfer 
of data on the AD 
T.S. 
lines. The AD lines should not be driven by an external 
source unless DEN is 
asserted. 
When DEN is asserted, 
outputs from the previous cycle are guaranteed 
to 
be 3-stated. 
In addition, 
DEN deasserted 
indicates 
inputs have been captured 
and 
therefore 
input hold times can be disregarded. 
DEN is driven high during reset. 


DT/R 
0 
DATA TRANSMIT/RECEIVE 
indicates 
the direction 
of data transfer 
to and from the 
T.S. 
bus. It is low during Ta and Td cycles for a read or interrupt acknowledgment; 
it is high 


during 
Ta and Td cycles for a write. DTIR never changes 
state when DEN is asserted. 


DTIR is driven high during reset. 


READY 
I 
READY indicates 
that data on AD lines can be sampled 
or removed. 
If READY is not 
asserted 
during a Td cycle, the Td cycle is extended 
to the next cycle by inserting 
a 
wait state (Tw). 


LOCK 
I/O 
BUS LOCK prevents 
bus masters from gaining control 
of the bus during Readl 
0.0. 
Modify/Write 
(RMW) cycles. The processor 
or any bus agent may assert LOCK. 


At the start of a RMW operation, 
the processor 
examines 
the LOCK pin. If the pin is 
already asserted, 
the processor 
waits until it is not asserted. 
If the pin is not asserted, 


the processor 
asserts 
LOCK during the Ta cycle of the read transaction. 


The processor 
deasserts 
LOCK in the Ta cycle of the write transaction. 
During the 
time LOCK is asserted, 
a bus agent can perform 
a normal read or write but not a RMW 
operation. 
The processor 
also asserts LOCK during interrupt-acknowledge 
transactions. 
Do not leave LOCK unconnected. 
It must be pulled high for the processor 
to function 
properly. 
ONCE MODE: The LOCK pin is sampled 
during reset. If it is asserted 
LOW at the end 
of RESET, all outputs will be three-stated 
until the part is reset again. ONCE mode is 
used in conjunction 
with an in-circuit 
emulator. 


1/0 = Input/Output, 0 = Output, I = Input, 0.0. = Open Drain, T.8. = Three-state 
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Table 4. 80960SB 
Pin Description: 
Bus Signals (Continued) 


Name 
Type 
Description 


BE1:0 
0 
BYTE ENABLE 
LINES specify which data bytes (up to two) on the bus take part in 
T.S. 
the current bus cycle. BE1 corresponds 
to AD15:8; 
BEO corresponds 
to AD7:1, DO. 
The byte enable lines are asserted 
appropriately 
during each data cycle. 


INITIALIZATION 
FAILURE 
indicates 
that the processor 
has failed to initialize 
correctly. 
The failure state is indicated 
by a combination 
of BLAST asserted 
and 
BE1 :0 not asserted. 
This condition 
occurs after RESET is deasserted 
and before the 
first bus transaction 
begins. FAIL is asserted 
while the processor 
performs 
a self- 
test. If the self-test 
completes 
successfully, 
FAIL is deasserted. 
The processor 
then 
performs 
a zero checksum 
on the first eight words of memory. 
If it fails, FAIL is 
asserted 
for a second time and remains asserted; 
if it passes, system initialization 
continues 
and FAIL remains deasserted. 


HOLD 
I 
HOLD: A request from an external 
bus master to acquire the bus. When the 
processor 
receives 
HOLD and grants bus control 
to another 
master, it floats its 
three-state 
bus lines, then asserts 
HLDA and enters the Th state. When HOLD is 
deasserted, 
the processor 
deasserts 
HLDA and enters the Tj or Ta state. 


HLDA 
0 
HOLD ACKNOWLEDGE: 
Notifies an external 
bus master that the processor 
has 
T.S. 
relinquished 
control 
of the bus. This signal is always driven. At RESET it is driven 
LOW. 


BLAST/FAIL 
0 
BURST LAST indicates 
the last data cycle (Td) of a burst access. 
It is asserted 
low 
T.S. 
during the last Td and associated 
with Tw cycles in a burst access. 
INITIALIZATION 
FAILURE 
indicates 
that the processor 
has failed to initialize 
correctly. 
The failure state is indicated 
by a combination 
of BLAST asserted 
and 
BE1 :0 not asserted. 
This condition 
occurs after RESET is deasserted 
and before the 
first bus transaction 
begins. FAIL is asserted 
while the processor 
performs 
a self- 
test. If the self-test 
completes 
successfully, 
FAIL is deasserted. 
The processor 
then 
performs 
a zero checksum 
on the first eight words of memory. 
If it fails, FAIL is 
asserted 
for a second time and remains asserted; 
if it passes, system initialization 
continues 
and FAIL remains deasserted. 


1/0 = Input/Output, 0 = Output, I = Input, 0.0. = Open Drain, T.S. = Three-state 
•• 
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Table 5. 80960S8 
Pin Description: 
Support 
Signals 


Name 
Type 
Description 


RESET 
I 
RESET: Clears the processor's 
internal 
logic and causes it to reinitialize. 
During RESET assertion, 
the input pins are ignored (except for INTO, INT1, INT3, 


lOCK), 
the three-state 
output pins are placed in a HIGH impedance 
state (except for 
DTfR, DEN, and AS) and other output pins are placed in their non-asserted 
states. 


RESET must be asserted 
for at least 41 ClK2 
cycles for a predictable 
RESET. 


Optionally, 
for a synchronous 
reset, the lOW 
and HIGH transition 
of RESET should 
occur after the rising edge of both ClK2 
and the external 
bus ClK 
and before the next 
rising edge of ClK2. 


I, 
The interrupt 
pins indicate the initialization 
sequence 
executed. 
Typical initialization 
requires driving only INTO and INT3 to a HIGH state. The reset conditions 
follow: 


INTO 
INT1 
INT3 
LOCK 
Action Taken 


1 
x 
1 
1 
Run-self-test 
(core initialization) 
0 
0 
1 
1 
Disable self-test 
0 
1 
x 
x 
Reserved 
x 
x 
0 
x 
Reserved 
x 
x 
x 
0 
ONCE mode (see lOCK 
pin) 


INTO 
I 
INTERRUPT 
0: Indicates a pending interrupt. 
The bus interrupt 
control 
register 
determines 
in which way the signal should be interpreted. 
To signal an interrupt 
in a 
synchronous 
system, this pin-as 
well as the other interrupt 
pins-must 
be enabled 
by 
being deasserted 
for at least one bus cycle and then asserted 
for at least one 
additional 
bus cycle. In an asynchronous 
system the pin must remain deasserted 
for at 
least two system clock cycles and then asserted 
for at least two more system clock 
cycles. 
INTO is sampled 
during RESET to determine 
if the self-test 
sequence 
is to be 
executed. 


INT1 
I 
INTERRUPT 
1: Like INTO, provides 
direct interrupt 
signaling. 
INT1 is sampled 
during 
RESET to determine 
if the self-test 
sequence 
is to be executed. 


INT2/INTR 
I 
INTERRUPT2/1NTERRUPT 
REQUEST: The interrupt control 
register determines 
how 
this pin is interpreted. 
If INT2, it has the same interpretation 
as the INTO and INT1 
pins. If INTR, it is used to receive an interrupt 
request from an external 
interrupt 
controller. 


INT3/INTA 
I/O 
INTERRUPT3/1NTERRUPT 
ACKNOWLEDGE: 
The bus interrupt 
control 
register 
T.S. 
determines 
how this pin is interpreted. 
If INT3, it has the same interpretation 
as the 
INTO and INT1 pins. If INTA, it is used as an output to control 
interrupt-acknowledge 
transactions. 
The INTA output is latched on-chip and remains valid during Td cycles; 


as an output, it is open drain. INT3 must be pulled HIGH during RESET. 


N.C. 
N/A 
NOT CONNECTED: 
Indicates 
pins should not be connected. 
Never connect 
any pin 
marked N.C.; these pins may be reserved 
for factory use. 
. 


I/O = Input/Output, 0 = Output, I = Input, 0.0. = Open Drain, T.S. = Three-state 
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2.0 
ELECTRICAL 
SPECIFICATIONS 


2.1 
Power and Grounding 


The 80960S8 
is implemented 
in CHMOS 
IV technol- 
ogy and therefore 
has modest 
power 
requirements. 
Its high clock 
frequency 
and numerous 
output 
buff- 
ers (address/data, 
control, 
error and arbitration 
sig- 
nals) 
can 
cause 
power 
surges 
as multiple 
output 
buffers 
simultaneously 
drive 
new signat levels. 
For 
clean on-chip 
power 
distribution, 
vcc and Vss 
pins 
separately 
feed the device's 
functional 
units. Power 
and 
ground 
connections 
must 
be 
made 
to 
all 
80960S8 
power 
and 
ground 
pins. 
On 
the 
circuit 
board, all vcc pins must be strapped 
closely togeth- 
er, preferably 
on a power plane; all Vss pins should 
be strapped 
together, 
preferably 
on a ground plane. 


2.2 
Power Decoupling 
Recommendations 


Place 
a liberal 
amount 
of decoupling 
capacitance 
near the 80960S8. 
When driving the bus the proces- 
sor can 
cause 
transient 
power 
surges, 
particularly 
when connected 
to a large capacitive 
load. 


low 
inductance 
capacitors 
and 
interconnects 
are 
recommended 
for best high frequency 
electrical 
per- 
formance. 
Inductance 
is 
reduced 
by 
shortening 
board traces 
between 
the processor 
and decoupling 
capacitors 
as much as possible. 


2.3 
Connection 
Recommendations 


For reliable 
operation, 
always 
connect 
unused 
in- 
puts to an appropriate 
signal 
level. 
In particular, 
if 
one or more interrupt 
lines are not used, they should 
be pulled up. No inputs should ever be left floating. 


The IT5Ci( open drain pin requires 
a pullup resistor 
whether 
or not the pin is used as an output. While in 
some 
cases 
a simple 
pullup 
resistor 
will 
be ade- 
quate, 
a network 
of pullup 
and 
pulldown 
resistors 
biased to a valid VIH (> 2.0 V) and terminated 
in the 
characteristic 
impedance 
of the circuit 
board is rec- 
ommended. 
Figures 5 and 6 show recommended 
re- 
sistor values 
for both a low and high current 
drive 
network, 
which assumes 
a circuit board characteris- 
tic impedance 
of 100 .0.. 
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Do not connect 
external 
logic to pins marked 
N.C. 


OPEN-DRAIN 
OUTPUT 


470n 


390 n 


I 


Low Drive 
Network: 


VOH = 2.45V 
10 3.0V 
IOL = 9.5 mA 1012 
mA 
272207-4 


Figure 5. Connection 
Recommendations 
for Low Current 
Drive Network 


OPEN-DRAIN 
OUTPUT 


270 
n 


220 
n 
I 
_ 
I 


High Drive 
Network: 
VOH = 2.48V 
10 3.0V 
IOL = 16mAl020mA 
272207-5 


Figure 6. Connection 
Recommendations 
for High Current 
Drive Network 


2.4 
Characteristic 
Curves 


Figure 7 shows 
typical 
supply 
current 
requirements 
over the operating 
temperature 
range of the proces- 
sor at supply 
voltage 
(Vcd 
of 5V. Figure 
8 shows 
the 
typical 
power 
supply 
current 
(led 
that 
the 
80960S8 
requires 
at various 
operating 
frequencies 
when measured 
at three input voltage 
(Vcd 
levels. 


For a given output current 
(loll 
the curve in Figure 9 
shows the worst case output 
low voltage 
(VoLl. 
Fig- 
ure 10 shows 
the typical 
capacitive 
derating 
curve 
for the 80960S8 
measured 
from 1.5V on the system 
clock (ClK) 
to 0.8V on the falling edge and 2.0V on 
the 
rising 
edge 
of 
the 
bus 
address/data 
(AD) 
signals. 


• 
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Figure 7. Typical 
Supply Current 
vs Case Temperature 
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Figure 8. Typical Current 
vs Frequency 
(Room Temp) 
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Figure 9. Typical Current vs Frequency (Hot Temp) 
• 
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Figure 10.Capacitive Derating Curve 
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2.5 
Test Load Circuit 


Figure 11 illustrates the load circuit used to test the 
8096088's 
three-state pins; Figure 12 shows the 
load circuit used to test the open drain output. The 
open drain test uses an active load circuit in the form 
of a matched diode bridge. 8ince the open drain out- 
put sinks current, only the IOl legs of the bridge are 
necessary and the IOHlegs are not used. When the 
8096088 driver under test is turned off, the output 
pin is pulled up to VREF (Le., VOH). Diode 01 is 
turned off and the IOl current source flows through 
diode 02. 


When the 8096088 open drain driver under test is 
on, diode 01 is also on and the voltage on the pin 
being tested drops to VOL. Diode 02 turns off and 
IOl flows through diode 01. 
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THREE-STATE 
OUTPUT 
? 


~ 
272207-10 
CL ~ 50 pF for all signals 


Figure 11. Test Load Circuit 
for Three-State 
Output 
Pins 


OPEN-DRAIN 
OUTPUT 
0, 


272207-11 


IOL Tested at 12 and 20 mA 
VREF ~ vcc 
01 and 02 are matched 


Figure 12. Test Load Circuit 
for Open Drain Output 
Pins 
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2.6 
ABSOLUTE 
MAXIMUM 
RATINGS* 
NOTICE: This data sheet contains preliminary 
infor- 


mation on new products in production. The specifica- 
tions are subject to change without notice. Verity with 
your local Intel Sales office that you have the latest 
data sheet before firializing a design. 


Operating 
Temperature 
(PLCC) 
O·C to + 85·C Case 


Operating 
Temperature 
(QFP) .O·C to + 1OO·C Case 


Storage Temperature 
- 65·C to + 150·C 


Voltage 
on Any Pin (PLCC) 
- 0.5V to vcc + 0.5V 


Voltage 
on Any Pin (QFP) .. -0.25V 
to Vcc 
+ 0.25V 


Power Dissipation 
1.9W (16 MHz) 


• WARNING: Stressing the device beyond the "Absolute 
Maximum 
Ratings" 
may 
cause permanent 
damage .• 


These are stress ratings 
only. Operation 
beyond 
the 


"Operating 
Conditions" 
is not recommended 
and ex- 
tended exposure 
beyond 
the 
"Operating 
Conditions" 


may affect device reliability. 


2.7 
DC Characteristics 
80960S8 
(10 MHz QFP) 


80960S8 
(10 and 16 MHz PLCC) 


TCASE = O·C to +100·C, 
vcc = 5V ±5% 


TCASE = O·C to + 85·C, vcc = 5V ± 10% 


Table 6. DC Characteristics 


Symbol 
Parameter 
Min 
Max 
Units 
Notes 


Vll 
Input Low Voltage 
-0.3 
+0.8 
V 


VIH 
Input High Voltage 
2.0 
vcc +0.3 
V 


VCl 
CLK2 Input Low Voltage 
-0.3 
+0.8 
V 


VCH 
CLK2 Input High Voltage 
0.7 vcc 
vcc +0.3 
V 


VOl 
Output Low Voltage 
0.45 
V 
IOl = 2.5 mA 
0.45 
V 
IOl = 12 mA, LOCK Pin 
0.60 
V 
IOl = 20 mA, LOCK Pin 


VOH 
Output High Voltage 
2.4 
V 
All TS, -2.5 
mA (1) 


tee 
Power Supply Current: 
10 MHz-QFP 
280 
mA 
TCASE = O·C 
10 MHz-PLCC 
280 
mA 
TCASE = O·C 
16 MHz-PLCC 
350 
mA 
TCASE = O·C 


IU1 
Input Leakage Current, 
±15 
/LA 
0::;; VIN ::;;vcc 
Except INTO, LOCK 


IU2 
input Leakage Current, 
-300 
/LA 
VIN = 0.45V(2) 
iNTO, LOCK 


ILO 
Output Leakage Current 
±15 
/LA 


CIN 
Input Capacitance 
10 
pF 
fc = 1 MHz(3) 


Co 
Output Capacitance 
12 
pF 
fc = 1 MHz(3) 


CClK 
Clock Capacitance 
10 
pF 
fc = 1 MHz(3) 


•• 


NOTES: 
1. Not measured for open-drain output. 
2. INTOand LOCK have internal puliup devices. 
3. Input, output and clock capacitance are not tested. 
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This section 
describes 
the AC specifications 
for the 
8096058 pins. All input and output timings are spec- 
ified relative 
to the 1.5V level of the rising edge of 
CLK2 and refer to the time at which the signal cross- 
es 1.5V (for output 
delay 
and input 
setup). 
All AC 


testing 
should 
be done with input voltages 
of OAV 
and 2.4V, except 
for the clock 
(CLK2) which 
should 
be tested with input voltages 
of OA5V and 0.7 x Vee. 


See Figure 13 and Tables 8 and 9 for timing relation- 
ships for the 8069058 signals. 


2.8 
AC Specifications 


EDGE 


CLK2 


OUTPUTS: 


ADI5:1,A3:1,DO 


A31:16, 
BE-l:0 
W/R, DEN, BLAST 


HLDA, 
LOCK,INTA 


ALE 


DT/R 


INPUTS: 


ADI5:1,DO 


INTO, INT 1 


INT2,INT3 


HOLD 


LOCK 


READY 


A 
C 
C 
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Figure 13. Drive Levels and Timing Relationships 
for 80960S8 
Signals 


8096088 


2.8.1 
AC SPECIFICATION 
TABLES 


Table 7. 80960SB AC Characteristics 
(10 MHz) 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


Input Clock 


T1 
Processor Clock 
50 
125 
ns 
VIN = 1.5V 
Period (CLK2) 


T2 
Processor Clock 
8 
ns 
VT = 10% Point 
Low Time (CLK2) 
= VCl + (VCH - 
VcuxO.1 


T3 
Processor Clock 
8 
ns 
VT = 90% Point 
High Time (CLK2) 
= VCl + (VCH - 
Vcu x 0.9 


T4 
Processor Clock 
10 
ns 
VT = 90% to 10% Point (1) 
Fall Time (CLK2) 


Ts 
Processor Clock 
10 
ns 
VT = 10% to 90% Point (1) 
Rise Time (CLK2) 


Synchronous Outputs 


T6 
Output Valid Delay 
2 
31 
ns 


T6AS 
AS Output Valid Delay 
2 
25 
ns 


T7 
ALE Width 
T1-11 
ns 


Ta 
ALE Output Valid Delay 
4 
33 
ns 


Tg 
Output Float Delay 
2 
20 
ns 
(2) 


Synchronous Inputs 


T10 
Input Setup 1 
10 
ns 


T11 
Input Hold 
2 
ns 


T12 
Input Setup 2 
13 
ns 


T13 
Setup to ALE Inactive 
10 
ns 


T14 
Hold After ALE Inactive 
8 
ns 


T1S 
RESET Hold 
3 
ns 
(3) 


T16 
RESET Setup 
5 
ns 
(3) 


T17 
RESET Width 
2050 
ns 
41 CLK2 Periods Minimum 


NOTES: 
1. Processorclock (ClK2) rise time and fall time are not tested. 
2. A float conditionoccurswhen the maximumoutput currentbecomeslessthan ILO.Float delay is not tested, but shouldbe 
no longerthan the valid delay. 
3. Meeting~ 
setup and hold times is an optional method of synchronizingyour clocks. If you decide to use an asyn- 
chronous reset, synchronizingthe clock can be accomplishedby usingAS. 
. 
, 


• 
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Table 
8. 8096058 
AC Characteristics 
(16 MHz) 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


Input Clock 


T1 
Processor 
Clock 
31.25 
125 
ns 
VIN = 1.5V 
Period (CLK2) 


T2 
Processor 
Clock 
8 
ns 
VT = 10% Point 
Low Time (CLK2) 
= VCl + (VCH - 
VcuxO.1 


T3 
Processor 
Clock 
8 
ns 
VT = 90% Point 
High Time (CLK2) 
= VCl + (VCH - 
Vcu 
x 0.9 


T4 
Processor 
Clock 
10 
ns 
VT = 90% to 10% Point (1) 
Fall Time (CLK2) 


T5 
Processor 
Clock 
10 
ns 
VT = 10% to 90% Point(1) 
Rise Time (CLK2) 


Synchronous 
Outputs 


T6 
Output Valid Delay 
2 
25 
ns 


T6AS 
AS Output Valid Delay 
2 
21 
ns 


T7 
ALE Width 
T1-11 
ns 


Ta 
ALE Output Valid Delay 
2 
22 
ns 


T9 
Output Float Delay 
2 
20 
ns 
(2) 


Synchronous 
Inputs 


T10 
Input Setup 1 
10 
ns 


T11 
Input Hold 
2 
ns 


T12 
Input Setup 2 
13 
ns 


T13 
Setup to ALE Inactive 
10 
ns 


T14 
Hold After ALE Inactive 
8 
ns 


T15 
RESET Hold 
3 
ns 
(3) 


T16 
RESET Setup 
5 
ns 
(3) 


TH 
RESET Width 
1281 
ns 
41 CLK2 Periods Minimum 


NOTES: 
1. Processor clock (ClK2) rise time and fall time are not tested. 
2. A float condition occurs when the maximum output current becomes less than ILO.Float delay is not tested, but should be 
no longer than the valid delay. 


3. Meeting RESET setup and hold times is an optional method of synchronizing your clocks. If you decide to use an asyn- 
chronous reset, synchronizing the clock can be accomplished by using AS. 
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Figure 14. Processor 
Clock Pulse (CLK2) 
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Figure 15. RESET Signal Timing 
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Figure 16. HOLD Timing 
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3.0 
MECHANICAL DATA 


3.1 
Packaging 


The S0960S8 
is available 
in-two package 
types: 


• 
SO-lead quad flat pack (EIAJ QFP). Shown in Fig- 
ure 17. 


• 
S4-lead 
plastic 
leaded 
chip 
carrier 
(PLCC). 
Shown 
in Figure 1S. 


Dimensions 
for both package 
types are given in the 
Intel Packaging 
handbook 
(Order 
#240S00). 


intel~ 


3.2 
PinAssignment 


The QFP and PLCC have different 
pin assignments. 


The QFP pins are numbered 
in order from 
1 to SO 
around 
the package 
perimeter. 
The PLCC pins are 
numbered 
in order from 1 to S4 around the package 
perimeter. 
Table 9 and Table 
10 list the function 
of 
each QFP pin; Table 
11 and Table 
12 list the func- 
tion of each PLCC pin. 


vcc and GND connections 
must be made to multi- 
ple vcc and GND pins. Each vcc and GND pin must 
be connected 
to the appropriate 
voltage 
or ground 
and externally 
strapped 
close 
to the package. 
It is 
recommended 
that you include 
separate 
power and 
ground 
planes 
in your circuit 
board for power 
distri- 
bution. 


Pins identified 
as N.C. (No Connect) 
should never be 
connected. 
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14 
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A17 
15 
71 
VCC 


A16 
16 
70 
LOCK 


VCC 
17 
69 
BLAST 


VSS 
18 
68 
DT/R 


AD15 
19 
67 
DEN 


AD14 
20 
• N80960SB -16 
66 
W/R 


VCC 
21 I 


65 
NC 


VSS 
22 
XXXXXXXX 
64 
HOLD 


NC 
23 
XXXXXX 
63 
VSS 


AD13 
24 
XXXXXX 
62 
VCC • 


AD12 
25 
61 
HLDA 


ADll 
26 
60 
INT3/INTA 


AD10 
27 
59 
INT2/INTR 


AD9 
28 
58 
INTl 


AD8 
29 
57 
INTO 


AD7 
30 
56 
RESET 


VCC 
31 
55 
CLK2 


VSS 
VSS 


Figure 18. 84-Lead 
Plastic Leaded 
Chip Carrier (PLCC) Package 
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7 
A16 
27 
AD4 
47 
INT1 
67 
READY 


8 
VCC 
28 
AD3 
48 
INT2/INTR 
68 
A31 


9 
VSS 
29 
AD2 
49 
INT3/INTA 
69 
A30 


10 
AD15 
30 
AD1 
50 
HLDA 
70 
A29 


11 
AD14 
31 
DO 
51 
VCC 
71 
A28 


12 
VCC 
32 
Vss 
52 
VSS 
72 
VSS 


13 
VSS 
33 
VCC 
53 
HOLD 
73 
Vcc 


14 
AD13 
34 
A3 
54 
W/R 
74 
A27 


15 
AD12 
35 
A2 
55 
DEN 
75 
A26 


16 
AD11 
36 
VCC 
56 
DT/R 
76 
A25 


17 
AD10 
37 
Vss 
57 
BLAST 
77 
Vcc 


18 
AD9 
38 
A1 
58 
LOCK 
78 
VSS 


19 
AD8 
39 
N.C. 
59 
VCC 
79 
A24 


20 
AD7 
40 
BE1 
60 
Vss 
80 
A23 


NOTE: 
Do not connect 
any external 
logic to any pins marked 
N.C. 
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Table 
10. 80960S8 
QFP Plnout-In 
Signal 
Order 


Signal 
Pin 
Signal 
Pin 
Signal 
Pin 
Signal 
Pin 


A1 
38 
A18 
5 
DO 
31 
Vcc 
51 


A2 
35 
A19 
4 
DEN 
55 
Vcc 
59 


A3 
34 
A20 
3 
DT/R 
56 
Vcc 
61 


AD1 
30 
A21 
2 
HLDA 
50 
Vcc 
73 


AD2 
29 
A22 
1 
HOLD 
53 
Vcc 
77 


AD3 
28 
A23 
80 
INTO 
46 
Vcc 
8 


AD4 
27 
A24 
79 
INT1 
47 
Vss 
13 


AD5 
26 
A25 
76 
INT211NTR 
48 
Vss 
22 


AD6 
25 
A26 
75 
INT3/1NTA 
49 
Vss 
24 


AD7 
20 
A27 
74 
I15CK 
58 
Vss 
32 


AD8 
19 
A28 
71 
N.C. 
39 
VSS 
37 


AD9 
18 
A29 
70 
N.C. 
63 
Vss 
43 


AD10 
17 
A30 
69 
READY 
67 
Vss 
52 


AD11 
16 
A31 
68 
RESET 
45 
Vss 
60 


AD12 
15 
ALE 
66 
Vcc 
12 
Vss 
62 


AD13 
14 
AS 
64 
Vcc 
21 
Vss 
72 


AD14 
11 
BEO 
41 
Vcc 
23 
Vss 
78 


AD15 
10 
BE1 
40 
Vcc 
33 
Vss 
9 


A16 
7 
BLAST 
57 
Vcc 
36 
Vss 
65 


A17 
6 
CLK2 
44 
Vcc 
42 
W/R 
54 


NOTE: 
Do not connect 
any external 
logic to any pins marked 
N.C. 
• 
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Table 11.8096058 
PLCC Plnout-In 
Pin Order 


Pin 
Signal 
Pin 
Signal 
Pin 
Signal 
Pin 
Signal 


1 
Vcc 
22 
Vss 
43 
Vss 
64 
HOLD 


2 
N.C. 
23 
N.C. 
44 
Vcc 
65 
N.C. 


3 
A27 
24 
AD13 
45 
A3 
66 
W/R 


4 
A26 
25 
AD12 
46 
A2 
67 
DEN 


5 
A25 
26 
AD11 
47 
Vcc 
68 
DT/R 


6 
Vcc 
27 
AD10 
48 
Vss 
69 
BLAST 


7 
Vss 
28 
AD9 
49 
A1 
70 
LOCK 


8 
A24 
29 
AD8 
50 
N.C. 
71 
Vcc 


9 
A23 
30 
AD7 
51 
BE1 
72 
Vss 


10 
A22 
31 
Vcc 
52 
BEO 
73 
Vcc 


11 
A21 
32 
Vss 
53 
Vcc 
74 
Vss 


12 
A20 
33 
Vcc 
54 
Vss 
75 
N.C. 


13 
A19 
34 
Vss 
55 
CLK2 
76 
AS 


14 
~18 
35 
AD6 
56 
RESET 
77 
Vss 


15 
A17 
36 
AD5 
57 
INTO 
78 
ALE 


16 
A16 
37 
AD4 
58 
INT1 
79 
READY 


17 
VCC 
38 
AD3 
59 
INT2/INTR 
80 
A31 


18 
Vss 
39 
D2 
60 
INT3/INTA 
81 
A30 


19 
AD15 
40 
D1 
61 
HLDA 
82 
A29 


20 
AD14 
41 
DO 
62 
Vcc 
83 
A28 


21 
Vcc 
42 
N.G. 
63 
Vss 
84 
Vss 


NOTE: 
Do not connect 
any external 
logic to any pins marked 
N.C. 
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Table 12.80960S8 PLCC Plnout-In 
Signal Order 


Signal 
Pin 
Signal 
Pin 
Signal 
Pin 
Signal 
Pin 


A1 
49 
A18 
14 
DT/R 
68 
Vcc 
44 


A2 
46 
A19 
13 
HLDA 
61 
Vcc 
47 


A3 
45 
A20 
12 
HOLD 
64 
Vcc 
53 


DO 
41 
A21 
11 
INTO 
57 
Vcc 
6 


AD1 
40 
A22 
10 
INT1 
58 
Vcc 
62 


AD2 
39 
A23 
9 
INT211NTR 
59 
Vcc 
71 


AD3 
38 
A24 
8 
INT3/1NTA 
60 
Vcc 
73 


AD4 
37 
A25 
5 
LOCi< 
70 
Vss 
18 


AD5 
36 
A26 
4 
N.G. 
2 
Vss 
22 


AD6 
35 
A27 
3 
N.G. 
23 
Vss 
32 


AD7 
30 
A28 
83 
N.G. 
42 
Vss 
34 


AD8 
29 
A29 
82 
N.G. 
50 
Vss 
43 


AD9 
28 
A30 
81 
N.G. 
65 
Vss 
48 


AD10 
27 
A31 
80 
N.G. 
75 
Vss 
54 


AD11 
26 
ALE 
78 
READY 
79 
Vss 
63 


AD12 
25 
AS 
76 
RESET 
56 
Vss 
7 


AD13 
24 
BEO 
52 
Vcc 
1 
Vss 
72 


AD14 
20 
BE1 
51 
Vcc 
17 
Vss 
74 


AD15 
19 
BLAST 
69 
Vcc 
21 
Vss 
77 


AD16 
16 
GLK2 
55 
Vcc 
31 
Vss 
84 


A17 
15 
DEN 
67 
Vcc 
33 
W/R 
66 


NOTE: 
Do not connect 
any external 
logic to any pins marked 
N.C. 
• 
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3.3.1 PACKAGE THERMAL SPECIFICATION 
Compute P by multiplying the maximum voltage by 
the typical current at maximum temperature. Values 
for OJAand 0JCfor various airflows are given in Ta- 
ble 13 for the QFP package and in Table 14 for the 
PLCC package. tee at maximum temperature is typi- 
cally 80 percent of specified tee maximum (cold). 


The 8096058 is specified for operation when case 
temperature is within the range O·C to +85·C 
(PLCC) or O·C to + 100·C (QFP). Measure case 
temperature at the top center of the package. Ambi- 
ent temperature can be calculated from: 


TJ = Tc + P*OJC 
TA = TJ - 
P*OJA 
Tc = TA + P*[OJA-OJcl 


Thermal Reslstance-·C/Watt 


Parameter 
Alrflow-ft.lmln 
(m/sec) 


0 
50 
100 
200 
400 
600 
800 
o Junction-to-Ambient (Case 
measured in the middle of the 
54 
52 
49 
45 
39 
35 
33 
top of the package) 
(No heatsink) 
o Junction-to-Case 
11 
11 
11 
11 
11 
11 
11 


Table 13.8096058 QFP Package Thermal Characteristics 


NOTE: 
This table applies 
to 80960S8 
QFP soldered 
directly 
to board. 


Table 14.8096058 PLCC Package Thermal Characteristics 


Thermal Reslstance-·C/Watt 


Parameter 
Alrflow-ft.lmln 
(m/sec) 


0 
50 
100 
200 
400 
600 
800 
1000 
o Junction-to-Ambient 
33 
31 
28.5 
27 
24 
22 
20 
19.5 
(No heatsink) 
e Junction-to-Case 
11 
11 
11 
11 
11 
11 
11 
11 


NOTE: 
This table applies 
to 80960S8 
PLCC soldered 
directly 
to board. 
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Read and Write 
Transactions 
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Wait States 
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Figure 20. Quad Word Burst Read Transaction 
with 1,0,0,0,0,0,0, °Wait States 
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Figure 22. Accesses Generated by Quad Word Read Bus Request, 
Misaligned One Byte from Quad Word Boundary (1, 0, 0, 0, 0, 0, 0, 0) Wait States 
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5.0 
REVISION 
HISTORY 


This data sheet 
supersedes 
data sheet 270917-004, 
which 
applied 
to both the 80960SA 
and the 80960S8. 
The 80960SA 
is now documented 
in 272206-001. 


The sections 
significantly 
changed 
since the previous 
revision are: 


Section 
Last 
Description 
Rev. 


2.3 Connection 
Recommendations 
-004 
Deleted corresponding 
graph of Open Drain Voltage vs. 
(pg. 15) 
Output Current. 


Figure 7. Typical Supply Current vs. 
-004 
Regraphed 
data in three graphs instead of two. 


Case Temperature 
(pg. 16), Figure 
8. Typical Current vs. Frequency 
(Room Temp) (pg. 16) and Figure 
9. Typical Current vs. Frequency 
(Hot Temp) (pg. 17) 


Table 6. DC Characteristics 
-004 
Input Leakage Current (ll12) specification 
added to 
(pg. 19) 
accurately 
describe 
leakage of INTO and LOCK as inputs. 


Table 7. 80960S8 
AC 
-004 
T7 minimum specification 
improved: 
Characteristics 
(10 MHz) (pg. 21) 
Power Supply Current: 
Was: 
Is: 


and Table 8. 80960S8 
AC 
10MHz 
24 ns 
T1-11 
ns 
Characteristics 
(16 MHz) (pg. 22) 
16MHz 
15 ns 
T1-11 
ns 


The sections 
significantly 
changed 
between 
revisions 
-003 and -004 of the 80960SAlS8 
data sheet were: 


Section 
Last 
Description 
Rev. 


DC Characteristics 
(pg. 15). 
-003 
Operating 
temperature 
for PLCC package 
changed: 
Was: 
TCASE = O°C to + 100°C 
Is: 
TCASE = O°C to + 85°C 
The test program 
has not changed. 


Table 7. QFP Package, Thermal 
-003 
Corrected 
QFP Package Therma] 
Resistance 
values: 
Resistance-°C/Watt 
(pg. 21) 
for (JJA at 0 ft/min 
airflow: 
for (JJC at 0 ft/min 
airflow: 


Was: 
45.7°/W 
Was: 
4°/W 
Is: 
54°/W 
Is: 
11°/W 


Table 8. PLCC Package, Thermal 
-003 
Corrected 
PLCC Package Thermal 
Resistance 
values: 
Resistance-°C/Watt 
(pg. 22) 
for (JJA: 
at 50 ft/min 
airflow 
at 100 ftl min airflow 
Was: 
NA 
Was: 
NA 
Is: 
31 
Is: 
28.5 


for (JJC: 
at 0 ft/min 
airflow 
at 50 ft -1000 
ftl min airflow 
Was: 
13 
Was: 
NA 
Is: 
11 
Is: 
11 


Table 9. 8096SA and 80960S8 
-003 
Signal A 12 incorrectly 
shown as Pin 28; is now correctly 
QFP Pinout - 
In Pin Order (pg. 23) 
shown as Pin 38. Note added to clarify No Connect 
Pins. 
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i960 KA/KB 
PROCESSOR 
PRODUCT OVERVIEW 


INTRODUCTION 


This chapter provides an overview of the Intel i960 KB 
processor (which is part of the i960 K series of embed- 
ded-processor products). 


All of the processors in the i960 K series of products 
are based on the Inte! i960 architecture. Most of the 
information in this overview also applies to the i960 
KA processor. The only difference between the i960 
KB and i960 KA processors is that the i960 KA proc- 
essor does not provide on-chip support for tloating- 
point operations or operations on decimal numbers. 


OVERVIEW 
OF THE i960 KB 
ARCHITECTURE 


The i960 KB processor introduced the i960 architec- 
ture-a 
new 32-bit architecture from Inte!. This archi- 


tecture has been designed to meet the needs of embed- 
ded applications 
such as machine control, robotics, 
process control, avionics and instrumentation. 


The i960 architecture can best be characterized as a 
high-performance computing engine. It features high- 
speed instruction execution and ease of programming. 
It is also easily extensible, allowing processors and con- 
trollers based on this architecture to be conveniently 
customized to meet the needs of specific processing and 
control applications. 


The following are some of the important attributes of 
the i960 architecture: 
• full 32-bit registers 
• high-speed, pipelined instruction execution 
• a convenient program execution environment with 
32 general-purpose registers and a versatile set of 
special-function registers 
• a highly optimized procedure call mechanism that 
features on-chip caching of local variables and pa- 
rameters 
• extensive facilities for handling interrupts and faults 
• extensive tracing facilities to support efficient pro- 
gram debugging and monitoring 
• register scoreboarding and write buffering to permit 
efficient operation when used with lower perform- 
ance memory subsystems 
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OVERVIEW 
OF THE SINGLE 
PROCESSOR 
SYSTEM 
ARCHITECTURE 


The central processing module, memory module and 
1/0 module form the natural boundaries for the hard- 
ware system architecture. The modules are connected 
together by the high bandwidth 
32-bit multiplexed 
L-bus, which can transfer data at a maximum sustained 
rate of 53 Mbytes per second for an i960 processor op- 
erating at 20 MHz. 


Figure I shows a simplified block diagram of one possi- 
ble system configuration. The heart of this system is the 
i960 KB processor, which fetches instructions, executes 
code, manipulates 
stored information 
and interacts 
with 1/0 devices. The high bandwidth L-bus connects 
the i960 KB processor to memory and 1/0 modules. 
The i960 KB processor stores system data, instructions 
and programs in the memory module. By accessing var- 
ious peripheral devices in the 1/0 module, the i960 KB 
processor supports communication to terminals, mo- 
dems, printers, disks and other 1/0 devices. 


i960 KB Processor and the L-Bus 


The i960 KB processor performs bus operations using 
multiplexed address and data signals, and provides all 
the necessary control signals. For example standard 
control signals, such as Address Latch Enable (ALE), 
Address/Data 
Status (ADS), Write/Read 
Command 
(WIR), Data TransmitIReceive (DTIR) 
and Data En- 
able (DEN), are provided by the i960 KB processor. 
The i960 processor also generates byte enable signals 
that specify which bytes on the 32-bit data lines are 
valid for the transfer. 


The L-bus supports burst transactions, which access up 
to four data words at a maximum rate of one word per 
clock cycle. The i960 KB processor uses the two low- 
order address lines to indicate how many words are to 
be transferred. The i960 KB processor performs burst 
transactions to load the on-chip 512-byte instruction 
cache to minimize memory accesses for instruction 
fetches. Burst transactions can also be used for data 
access. 


To transfer control of the bus to an external bus master, 
the i960 KB provides two arbitration signals: hold re- 
quest (HOLD) and hold acknowledge (HLDA). After 
receiving HOLD, the processor grants control of the 
bus to an external master by asserting HLDA. 
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Figure 1. Basic 1960 KB System Configuration 


The i960 KB processor provides a flexible interrupt 
structure by using an on-chip interrupt controller, an 
external interrupt controller or both. The type of inter- 
rupt structure is specified by an internal interrupt vec- 
tor register. For a system with multiple processors, 
another method is available, called inter-agent commu- 
nication (lAC) where a processor can interrupt another 
processor by sending an lAC message. 


Memory Module 


A memory module can consist of a memory controller, 
Erasable 
Programmable 
Read 
Only 
Memory 


(EPROM), 
and static or dynamic Random 
Access 
Memory (RMA). The memory controller first condi- 
tions the L-bus signals for memory operation. It demul- 
tiplexes the address and data lines, generates the chip 
select signals from the address, detects the start of the 
cycle for burst mode operation and latches the byte 
enable signals. 


The memory controller generates the control signals for 
EPROM, SRAM and DRAM. Specifically, it provides 
the control signals, multiplexed row/column 
address 


and refresh control for dynamic RAMs. The controller 
I 


can be designed to accommodate the burst transaction 
of the i960 KB processor by using the static column 
mode or nibble mode features of the dynamic RAM. In 
addition to supplying the operational signals, the con- 
troller generates the READY signal to indicate that 
data can be transferred to or from the i960 KB proces- 
sor. 


The 
i960 KB 
processor 
directly 
addresses 
up 
to 
4 Gbytes of physical memory. The processor does not 
allow burst accesses to cross a 16-byte boundary, to 
ease the design of the controller. Each address specifies 
a four-byte data word within the block. Individual data 
bytes can be accessed by using the four byte-enable sig- 
nals from the i960 KB processor. Chapter 5 provides 
design guidelines for the memory controller. 


1/0 Module 


The I/O module consists of the I/O components and 
the interface circuit. I/O components can be used to 
allow the i960 KB processor to use most of its clock 
cycles for computational and system management ac- 
tivities. Time consuming tasks can be off-loaded to spe- 
cialized slave-type components, such as the 8259A Pro- 
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grammable Interrupt 
Controller or the 82530 Serial 
Communication Controller. Some tasks may require a 
master-type component, such as the 82586 Local Area 
Network Control. 


The interface circuit performs several functions. It de- 
multiplexes the address and data lines, generates the 
chip select signals from the, address, produces the I/O 
read or I/O write command from the processor's W/R 
signal, latches the byte enable signals and generates the 
READY 
signals. Since some of these functions are 
identical to those of the memory controller, the same 
logic can be used for both interfaces. For master-type 
peripherals that operate on a 16-bit data bus, the inter- 
face circuit translates the 32-bit data bus to a l6-bit 
data bus. 


The i960 KB processor uses memory-mapped addresses 
to access I/O devices. This allows the CPU to use many 
of the same instuctions to exchange information for 
both memory and peripheral devices. Thus, the power- 
ful memory-type instructions can be used to perform 8-, 
16- and 32-bit data transfers. 


HIGH PERFORMANCE 
PROGRAM 
EXECUTION 


Much of the design of 'the i960 architecture has been 
aimed at maximizing the processor's computational 
and data processing speed through the use of increased 
parallelism. The following paragraphs describe several 
of the mechanisms and techniques used to accomplish 
this goal. 


Load and Store Model 


One of the more important features of the i960 archi- 
tecture is its performance of most operations on oper- 
ands in registers, rather than in memory. For example, 
all arithmetic, logic, comparison, branching and bit op- 
erations are performed with registers and literals. 


This feature provides two benefits. First, it increases 
program execution speed by minimizing the number of 
memory accesses necessary to execute a program. Sec- 
ond, it reduces the memory latency encountered when 
using slower, lower-cost memory parts. 


To support this concept, the architecture provides a 
generous supply of general-purpose registers. For each 
procedure, 32 registers are available, 28 of which are 
available for general use. These registers are divided 
into two types: global and local. Both types of registers 
can be used for general storage of operands. The only 
difference is that global registers retain their contents 
across procedure boundaries, whereas the processor al- 
locates a new set of local registers each time a new 
procedure is called. 
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The architecture also provides a set of fast, versatile 
load and store instructions. These instructions allow 
burst transfers of I, 2, 4, 8, 12 or 16 bytes of informa- 
tion between memory and the registers. 


On-Chip Caching of Code and Data 


To further reduce memory accesses, the architecture 
offers two mechanisms for caching code and data on 
chip: an instruction cache and multiple sets of local 
registers. The instruction cache allows prefetching of 
blocks of instruction from memory. This helps ensure 
that the instruction execution pipeline is supplied with 
a steady stream of instructions. 
It also reduces the 
number of memory accesses required when performing 
iterative operations such as loops. The architecture al- 
lows the size of the instruction cache to vary. For the 
i960 KB processor, it is 512 bytes. 


To optimize the architecture's 
procedure call mehan- 
ism, the processor provides multiple sets of local regis- 
ters. This allows the processor to perform procedure 
calls without having to write the local registers out to 
the stack in memory. The number of register sets de- 
pends on the processor implementation. The i960 KB 
processor provides four sets of local registers. 


Overlapped 
Instruction 
Execution 


The i960 architecture also enchances program execu- 
tion speed by overlapping the execution of some in- 
structions. In the i960 K series of processors, this is 
accomplished through register scoreboarding. 


Register scoreboarding permits instruction execution to 
continue while data is being fetched from memory. 
When a load instruction is executed, the processor sets 
one.or more scoreboard bits to indicate the target regis- 
ters to be loaded. After the target registers are loaded, 
the scoreboard bits are cleared. While the target regis- 
ters are being loaded, the processor is allowed to exe- 
cute other instructions that do not use these registers. 


The processor uses the scoreboard bits to ensure that 
the target registers are not used until the load is com- 
plete. (Scoreboard bits are checked transparently from 
software.) This technique allows code to be executed 
such that some instructions can be executed in zero 
clock cycles (that is, executed for free). 


Single-Clock 
Instructions 


The i960 architecture is designed to let a processor exe- 
cute commonly used instructions, such as moves, adds, 
subtracts, logical operations and branches, in a mini- 
mum number of clock cycles (preferably one cycle). 
The 
architecture 
supports 
this 
concept 
in 
several 
I 


ways. For example, the load and store model described 
earlier eliminates the clock cycles required to perform 
memory-to-memory 
operations, 
by concentrating 
on 
register-to-register operations. 


In addition, all of the instructions in the i960 architec- 
ture are 32 bits long and aligned on 32-bit boundaries. 
This lets instructions be decoded in one clock cycle, 
and eliminates the need for an instruction-alignment 
stage in the pipeline. 


The i960 KB processor takes full advantage 'of these 
features of the architecture, resulting in more than 50 
instructions that can be executed in a single clock cycle. 


Efficient 
Interrupt 
Model 


The i960 architecture provides an efficient mechanism 
for servicing interrupts from external sources. To han- 
dle interrupts, the processor maintains an interrupt ta- 
ble of 248 interrupt vectors, 240 of which are available 
for general use. When an interrupt is signaled, the proc- 
essor uses a pointer to the interrupt table to perform an 
implicit call to an interrupt handler procedure. In per- 
forming this call, the processor automatically saves the 
state of the processor prior to receiving the interrupt, 
performs the interrupt routine, then restores the state of 
the processor, A separate interrupt stack is also provid- 
ed to segregate interrupt 
handling from application 
programs. 


The interrupt handling facilities also allow interrupts to 
be evaluated by priority, The processor is then able to 
store interrupt vectors that are lower in priority than 
the current processor task in a pending interrupt sec- 
tion of the interrupt table. The processor checks and 
services the pending interrupts at defined times. 


SIMPLIFIED 
PROGRAMMING 
ENVIRONMENT 


Because of its streamlined 
execution 
environment, 
processors based on the i960 architecture are particu- 
larly easy to program. The following paragraphs de- 
scribe some of the architecture features that simplify 
programming. 


Highly Efficient 
Procedure 
CaU 
Mechanism 


The procedure call mechanism makes procedure calls 
and parameter passing between procedures simple and 
compact. Each time a call instruction is issued, the 
processor automatically saves the current set of local 
registers and allocates a new set for the called proce- 
dure. Likewise, on a return from a procedure, the cur- 
rent set of local registers is deallocated and the local 
I 
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registers for the procedure being returned to are re- 
stored. This means a program never has to explicitly 
save and restore those local variables that are stored in 
local registers. 


Versatile 
Instruction 
Set and 
Addressing 


The selection of instructions and addressing modes also 
simplifies programming. A full set of load, store, move, 
arithmetic, 
comparison 
and branch 
instructions 
are 
provided, with operations on both integer and ordinal 
data types. Operations on bits and bit strings are simpli- 
fied by a complete set of Boolean and bit-field instruc- 
tions. 


The addressing modes are efficient and straightforward, 
while at the same time providing the necessary indexing 
and scaling modes required to address complex arrays 
and record structures. 
The large 4-gigabyte address 
space provides ample room to store programs and data. 3 
The availability of 32 addressing lines allows some ad- 
dress lines to be memory-mapped to control hardware 
functions. 


Extensive 
Fault Handling Capability 


To aid in program development, the i960 architecture 
defines a wide range of faults that the processor detects, 
including, arithmetic, faults, invalid operations, invalid 
operands and machine faults. When a fault is detected, 
the processor makes an implicit call to a fault handler 
routine, in a way similar to the interrupt mechanism 
described previously. The information 
collected for 
each fault allows program developers to quickly correct 
faulting code, and allows automatic 
recovery from 
some faults, 


Debugging and Monitoring 


To support debugging systems, the i960 architecture 
provides a mechanism for monitoring processor activity 
by means of trace events. When the processor detects a 
trace event, it signals a trace fault and calls a fault han- 
dler. Intel provides several tools that use this feature, 
including an in-circuit emulator (ICE) device. 


SUPPORT 
FOR ARCHITECTURAL 
EXENSIONS 
. 


The i960 architecture provides several features that en- 
able processors based on this architecture to be easily 
customized to meet the needs of specific embedded ap- 
plications, such as signal processing, array processing 
or graphics processing. 


3-77 


1960 KA/KB 
PROCESSOR 
PRODUCT 
OVERVIEW 


The most important of these features is the set of 32 
special function registers. These regisers provide a con- 
venient interface to circuitry in the processor or pins 
that can be connected to external hardware. They can 
be used to control timers, to perform operations on spe- 
cial data types or to perform I/O functions. The special 
function registers are similar to the global registers. 
They can be addressed by all of the register access in- 
structions. 


EXTENSIONS 
INCLUDED 
IN THE 
1960 K SERIES 
PROCESSORS 


The i960 K series of processors provides a complete 
implementation of the i960 architecture, plus several 
extensions to that architecture. These extensions fall 
into two categories: floating-point processing and inter- 
agent communication. 


On-Chip Floating Point 


The i960 KB processor provides a complete implemen- 
tation of the IEEE standard for binary floating-point 
arithmetic 
(IEEE 754-185). This implementation 
in- 
cludes a full set of floating-point operations, includ- 
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ing add, subtract, multiply, divide, trigonometric func- 
tions and logarithmic functions. These operations are 
performed on single precision (32-bit), double precision 
(64-bit) and extended precision (80-bit) real numbers. 


One of the benefits of this implementation is that the 
floating-point handling facilities are integrated into the 
normal instruction execution environment. Single and 
double precision floating-point values are stored in the 
same registers as non-floating point values. Four 80-bit 
floating-point registers are provided to hold extended- 
precision values. 


Interagent 
Communication 


All of the processors in the i960 K series provide an 
inter-agent communication (lAC) mechanism, allowing 
agents connected to the processor's bus to communi- 
cate with one another. This mechanism operates simi- 
larly to the interrupt mechanism, except that lAC mes- 
sages are passed through dedicated sections of memory. 
The sort of tasks handled with lAC messages are proc- 
essor reinitialization, stopping the processor, purging 
the instruction cache and forcing the processor to check 
pending interrupts. 
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80960KA 
EMBEDDED 32-BIT MICROPROCESSOR 


• 
High-Performance 
Embedded 
Architecture 
- 
25 MIPS Burst Execution at 25 MHz 
- 
9.4 MIPS· Sustained Execution at 
25 MHz 


• 
512-Byte On-Chip Instruction 
Cache 
- 
Direct Mapped 
- 
Parallel Load/Decode 
for Uncached 
Instructions 


• 
Multiple Register Sets 
- 
Sixteen Global 32-Blt Registers 
- 
Sixteen Local 32-Blt Registers 
- 
Four Local Register Sets Stored 
On-Chip 
- 
Register Scoreboardlng 


• 
4 Glgabyte, Linear Address Space 


• 
Pin Compatible with 80960KB 


• 
Built-In Interrupt Controller 
- 
31 Priority Levels, 256 Vectors 
- 
3.4 J.LsLatency 
@ 25 MHz 


• 
Easy to Use, High Bandwidth 32-Bit Bus 
- 
66.7 Mbytes/s 
Burst 
- 
Up to 16 Bytes Transferred 
per Burst 


• 
132-Lead Packages 
- 
Pin Grid Array (PGA) 
- 
Plastic Quad Flat-Pack (PQFP) 


The 80960KA 
is a member of Intel's i960~ 32-bit processor 
family, which is designed 
especially 
for embedded 
applications. 
It includes 
a 512-byte 
instruction 
cache 
and a built-in 
interrupt 
controller. 
The 80960KA 
has a 
large 
register 
set, multiple 
parallel 
execution 
units and a high-bandwidth 
burst 
bus. Using advanced 
RiSe 
technology, 
this high performance 
processor 
is capable 
of execution 
rates in excess of 9.4 million instructions 
per second. 
The 80960KA 
is well-suited 
for a wide range 
of applications 
including 
non-impact 
printers, 
1/0 
control 
and specialty 
instrumentation. 
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Figure 1. The 80960KA Processor's Highly Parallel Architecture 


'Relative 
to Digital Equipment 
Corporation's 
VAX-111780 
at 1 MIPS CVAX-11 is a trademark 
of Digital Equipment 
Corporation.) 


May 
1993 


Order 
Number: 
270775-005 
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supports 
multitasking, 
memory 
management, 
multi- 


processing 
and fault tolerance. 
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Figure 2. 80960KA 
Programming 
Environment 
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1.1 Key Performance Features 


The 80960 architecture is based on the most recent 
advances 
in microprocessor 
technology 
and 
is 
grounded in Intel's long experience in the design 
and manufacture of embedded microprocessors. 
Many features contribute to the 80960KA's excep- 
tional performance: 
1. Large Register Set. Having a large number of 
registers reduces the number of times that a 
processor needs to access memory. Modern 
compilers can take advantage of this feature to 
optimize execution speed. For maximum flexibili- 
ty, the' 80960KA provides thirty-two 32-bit regis- 
ters. (See Figure 2.) 


2. Fast Instruction 
Execution. 
Simple functions 
make up the bulk of instructions in most pro- 
grams so that execution speed can be improved 
by ensuring that these core instructions are exe- 
cuted as quickly as possible. The most frequently 
executed instructions such as register-register 
moves, 
add/subtract, 
logical 
operations 
and 
shifts execute in one to two cycles. (Table 1 con- 
tains a list of instructions.) 
3. Load/Store 
Architecture. 
One way to improve 
execution speed is to reduce the number of times 
that the processor must access memory to per- 
form an operation. As with other processors 
based on RISC technology, the 80960KA has a 
Load/Store architecture. As such, only the LOAD 
and STORE instructions reference memory; all 
other instructions operate on registers. This type 
of architecture simplifies instruction decoding 
and is used in combination with other techniques 
to increase parallelism. 


4. Simple Instruction 
Formats. All instructions in 
the 80960KA are 32 bits long and must be 
aligned on word 
boundaries. This alignment 
makes it possible to eliminate the instruction 
alignment stage in the pipeline. To simplify the 
instruction decoder, there are only five instruction 
formats; each instruction uses only one format. 
(See Figure 3.) 
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5. Overlapped Instruction 
Execution. Load oper- 
ations allow execution of subsequent instructions 
to continue before the data has been retumed 
from memory, so that these instructions can 
overlap the load. The 80960KA manages this 
process transparently to software through the 
use of a register scoreboard. Conditional instruc- 
tions also make use of a scoreboard so that sub- 
sequent unrelated instructions may be executed 
while the conditional instruction is pending. 


6. Integer Execution Optimization. 
When the re- 
sult of an arithmetic execution is used as an oper- 
and in a subsequent calculation, the value is sent 
immediately to its destination register. Yet at the 
same time, the value is put on a bypass path to 
the ALU, thereby saving the time that otherwise 
would be required to retrieve the value for the 
next operation. 


7. Bandwidth 
Optimizations. 
The 80960KA gets 
optimal use of its memory bus bandwidth be- 
cause the bus is tuned for use with the on-chip 3 
instruction cache: instruction cache line size 
matches the maximum burst size for instruction 
fetches. The 80960KB automatically fetches four 
words in a burst and stores them directly in the 
cache. Due to the size of the cache and the fact 
that it is continually filled in anticipation of need- 
ed instructions in the program flow, the 80960KA 
is relatively insensitive to memory wait states. 
The benefit is that the 80960KA delivers out- 
standing performance even with a low cost mem- 
ory system. 


8. Cache Bypass. If a cache miss occurs, the proc- 
essor fetches the needed instruction then sends 
it on to the instruction decoder at the same time it 
updates the cache. Thus, no extra time is spent 
to load and read the cache. 
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Table 
1. 80960KA 
Instruction 
Set 


Data Movement 
Arithmetic 
Logical 
Bit and Bit Field 


Load 
Add 
And 
Set Bit 
Store 
Subtract 
Not And 
Clear Bit 
Move 
Multiply 
And Not 
Not Bit 
Load Address 
Divide 
Or 
Check Bit 
Remainder 
Exclusive 
Or 
Alter Bit 
Modulo 
Not Or 
Scan For Bit 
Shift 
Or Not 
Scan Over Bit 
Exclusive 
Nor 
Extract 
Not 
Modify 
Nand 
Rotate 


Comparison 
Branch 
Call/Return 
Fault 


Compare 
Unconditional 
Branch 
Call 
Conditional 
Fauit 
Conditional 
Compare 
Conditional 
Branch 
Call Extended 
Synchronize 
Faults 
Compare 
and Increment 
Compare 
and Branch 
Call System 
Compare 
and Decrement 
Return 
Branch and Link 


Debug 
Miscellaneous 
Decimal 


Modify Trace Controls 
Atomic Add 
Decimal Move 
Mark 
Atomic 
Modify 
Decimal Add with Carry 
Force Mark 
Flush Local Registers 
Decimal Subtract 
with 
Modify Arithmetic 
Controls 
Carry 
Scan Byte for Equal 
Test Condition 
Code 
Modify Process Controls 


Synchronous 


Synchronous 
Load 
Synchronous 
Move 
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Figure 3. Instruction 
Formats 


1.1.1 
MEMORY 
SPACE 
AND ADDRESSING 
MODES 


The 80960KA 
offers 
a linear programming 
environ- 
ment so that all programs 
running on the processor 
are contained 
in a single address 
space. 
Maximum 
address 
space size is 4 Gigabytes 
(232 bytes). 


For ease of use the 80960KA 
has a small number of 
addressing 
modes, 
but includes 
all those 
necessary 
to ensure efficient 
execution 
of high·level 
languages 
such as C. Table 2 lists the modes. 


Table 2. Memory 
Addressing 
Modes 


• 
12·Bit Offset 


• 
32·Bit Offset 


• 
Register·lndirect 


• 
Register + 12-Bit Offset 


.• 
Register + 32-Bit Offset 


• 
Register + (Index-Register 
x Scale-Factor) 


• 
Register x Scale Factor + 32-Bit 
Displacement 


• 
Register 
+ (Index-Register 
x Scale-Factor) 
+ 32-Bit 
Displacement 


• 
Scale-Factor 
is 1, 2, 4, 8 or 16 


I 


1.1.2 
DATA TYPES 


The 80960KA 
recognizes 
the following 
data types: 


Numeric: 


• 
8-, 16-, 32- and 64-bit ordinals 


• 
8-, 16-, 32- and 64-bit integers 


Non-Numeric: 


• 
Bit 


• 
Bit Field 


• •Triple Word (96 bits) 


• 
Quad-Word 
(128 bits) 


1.1.3 
LARGE 
REGISTER 
SET 


The 80960KA 
programming 
environment 
includes 
a 
large number 
of registers. 
In fact, 
32 registers 
are 
available 
at any time. 
The availability 
of this 
many 
registers 
greatly reduces 
the number of memory 
ac- 
cesses 
required 
to perform 
algorithms, 
which 
leads 
to greater 
instruction 
processing 
speed. 


There 
are two 
types 
of general-purpose 
registers: 
local and global. The global registers 
consist 
of six- 
teen 32-bit registers 
(GO though 
G15). These 
regis- 
ters 
perform 
the 
same 
function 
as 
the 
general- 
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purpose 
registers 
provided 
in other 
popular 
micro- 
processors. 
The term 
global 
refers 
to the fact that 
these 
registers 
retain 
their 
contents 
across 
proce- 
dure calls. 


The 
local 
registers, 
on the other 
hand, 
are proce- 
dure specific. 
For each procedure 
call, the 80960KA 
allocates 
16 local registers 
(RO through 
R15). Each 
local register 
is 32 bits wide. 


1.1-4 
MULTIPLE 
REGISTER 
SETS 


10 further 
increase 
the efficiency 
of the register 
set, 
multiple 
sets 
of local 
registers 
are stored 
on-chip 
(see Figure 4). This cache holds up to four local reg- 
ister frames, 
which 
means 
that 
up to three 
proce- 
dure calls can be made without 
having to access the 
procedure 
stack resident 
in memory. 


Although 
programs 
may have procedure 
calls nest- 
ed many calls 
deep, 
a program 
typically 
oscillates 
back and forth between 
only two to three levels. As 
a result, 
with 
four 
stack 
frames 
in the 
cache, 
the 
probability 
of having 
a free frame 
available 
on the 
cache when a call is made is very high. In fact, runs 
of representative 
C-Ianguage 
programs 
show 
that 
80% of the calls are handled without 
needing to ac- 
cess memory. 


If four or more procedures 
are active and a new pro- 
cedure 
is called, 
the 80960KA 
moves the oldest 
lo- 
cal register 
set in the stack-frame 
cache to a proce- 
dure stack in memory to make room for a new set of 
registers. 
Global 
register 
G15 is the frame 
pointer 
(FP) to the procedure 
stack. 


Global 
registers 
are not exchanged 
on a procedure 
call, but retain their contents, 
making them available 
to all procedures 
for fast parameter 
passing. 


1.1.5 
INSTRUCTION 
CACHE 


To further 
reduce 
memory 
accesses, 
the 80960KA 


includes 
a 512-byte 
on-chip 
instruction 
cache. 
The 
instruction 
cache is based on the concept 
of locality 
of reference; 
most programs 
are not usually execut- 
ed in a steady stream but consist 
of many branches, 
loops and procedure 
calls that lead to jumping 
back 
and forth 
in the same small section 
of code. 
Thus, 
by maintaining 
a block 
of instructions 
in cache, 
the 
number 
of memory 
references 
required 
to read in- 
structions 
into the processor 
is greatly 
reduced. 
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To 
load 
the 
instruction 
cache, 
instructions 
are 
fetched 
in ts-byte 
blocks; up to four instructions 
can 
be fetched 
at one time. 
An efficient 
prefetch 
algo- 
rithm increases 
the probability 
that an instruction 
will 
already 
be in the cache when it is needed. 


Code 
for small 
loops 
often 
fits 
entirely 
within 
the 
cache, 
leading 
to 
a great 
increase 
in processing 
speed since further memory references 
might not be 
necessary 
until the program 
exits the loop. Similarly, 
when calling short procedures, 
the code for the call- 
ing procedure 
is likely to remain 
in the cache 
so it 
will be there on the procedure's 
retum. 


1.1.6 
REGISTER 
SCOREBOARDING 


The instruction 
decoder 
is optimized 
in several ways. 


One optimization 
method 
is the ability to overlap 
in- 
structions 
by using register 
scoreboarding. 


Register scoreboarding 
occurs when a LOAD moves 
a variable 
from 
memory 
into a register. 
When 
the 
instruction 
initiates, 
a scoreboard 
bit on the target 
register 
is set. Once the register 
is loaded, 
the bit is 
reset. In between, 
any reference 
to the register 
con- 
tents is accompanied 
by a test of the scoreboard 
bit 
to ensure 
that the load has completed 
before 
pro- 
cessing 
continues. 
Since 
the 
processor 
does 
not 
need to wait for the LOAD to complete, 
it can exe- 
cute 
additional 
instructions 
placed 
between 
the 
LOAD and the instruction 
that uses the register con- 


tents, as shown 
in the following 
example: 


Id data_2. r4 
Id data_2. r5 
Unrelated 
instruction 
Unrelated 
instruction 
add R4. R5. R6 


In essence, 
the two unrelated 
instructions 
between 
LOAD and ADD are executed 
"for free" 
(i.e., take no 
apparent 
time to execute) 
because 
they are execut- 


ed while 
the 
register 
is being 
loaded. 
Up to three 
load instructions 
can be pending 
at one time 
with 
three corresponding 
scoreboard 
bits set. By exploit- 
ing this feature, 
system 
programmers 
and compiler 
writers 
have a useful 
tool 
for optimizing 
execution 
speed. 
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Figure 4. Multiple Register 
Sets Are Stored On-Chip 


1.1.7 
HIGH BANDWIDTH 
LOCAL 
BUS 


The 80960KA CPU resides on a high-bandwidth ad- 
dress/data bus known as the local bus (L-Bus). The 
L-Bus provides a direct communication path be- 
tween the processor and the memory and 110 sub- 
system interfaces. The processor uses the L-Bus to 
fetch instructions, manipulate memory and respond 
to interrupts. L-Bus features include: 
• 32-bit multiplexed address/data path 
• Four-word burst capability which allows transfers 
from 1 byte to 16 bytes at a time 
• High 
bandwidth 
reads 
and 
writes 
with 
66.7 Mbytes/s burst (at 25 MHz) 


Table 3 defines L-bus signal names and functions; 
Table 4 defines other component-support signals 
such as interrupt lines. 


1.1.8 
INTERRUPT 
HANDLING 


The 80960KA can be interrupted in two ways: by the 
activation of one of four interrupt pins or by sending 
a message on the processor's data bus. 


The 80960KA is unusual in that it automatically han- 
dles interrupts on a priority basis and can keep track 
of 
pending 
interrupts 
through 
its 
on-chip 


I 


interrupt controller. Two of the interrupt pins can be 
conflgured to provide 8259A-style handshaking for 
expansion beyond four interrupt lines. 


1.1.9 
DEBUG 
FEATURES 


The 80960KA has built-in debug capabilities. There 
are two types of breakpoints and six trace modes. 
Debug features are controlled by two intemal 32-bit 
registers: the Process-Controls Word and the Trace- 
Controls Word. By. setting bits in these control 
words, a software debug monitor can closely control 
how the processor responds during program execu- 
tion. 


The 80960KA provides two hardware breakpoint 
registers on-chip which, by using a special com- 
mand, can be set to any value. When the instruction 
pointer matches either breakpoint register value, the 
breakpoint handling routine is automatically called. 


The 80960KA also provides software breakpoints 
through the use of two instructions: MARK and 
FMARK. These can be placed at any point in a pro- 
gram and cause the processor to halt execution at 
that point and call the breakpoint handling routine. 
The breakpoint mechanism is easy to use and pro- 
vides a powerful debugging tool. 


Tracing is available for instructions (single step exe- 
cution), calls and returns and branching. Each trace 
type 
may be enabled separately by a special 


• 
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debug 
instruction. 
In each case, the 80960KA 
exe- . 


cutes the instruction 
first and then calls a trace han- 
dling routine 
(usually part of a software 
debug moni- 
tor). 
Further 
program 
execution 
is halted 
until 
the 


routine completes, 
at which time execution 
resumes 
at the next instruction. 
The 80960KA's 
tracing mech- 
anisms, 
implemented 
completely 
in hardware, 
great- 
ly simplify 
the task of software 
test and debug. 


1.1.10 
FAULT 
DETECTION 


The 80960KA 
has an automatic 
mechanism 
to han- 
dle faults. 
Fault types 
include 
trace 
and arithmetic 
faults. 
When 
the processor 
detects 
a fault, 
it auto- 
matically 
calls the appropriate 
fault handling 
routine 
and saves the current instruction 
pointer and neces- 
sary 
state 
information 
to 
make 
efficient 
recovery 
possible. 
Like interrupt 
handling 
routines, 
fault han- 


dling routines 
are usually written 
to meet the needs 
of specific 
applications 
and 
are often 
included 
as 
part of the operating 
system 
or kernel. 


For each of the fault types, there are numerous 
sub- 
types that provide 
specific 
information 
about a fault. 


The fault handler can use this specific 
information 
to 
respond 
correctly 
to the fault. 
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1.1.11 
BUILT-IN 
TESTABILITY 


Upon reset, the 80960KA 
automatically 
conducts 
an 
exhaustive 
internal 
test of its major blocks 
of logic. 


Then, before 
executing 
its first instruction, 
it does a 
zero check 
sum on the first eight words 
in memory 
to ensure that the memory 
image was programmed 
correctly. 
If a problem 
is discovered 
at any point dur- 


ing the self-test, 
the 80960KA 
asserts 
its FAILURE 
pin and will not begin 
program 
execution. 
Self test 


takes approximately 
47,000 
cycles 
to complete. 


System 
manufacturers 
can use the 80960KA's 
self- 
test 
feature 
during 
incoming 
parts 
inspection. 
No 
special diagnostic 
programs 
need to be written. 
The 


test is both thorough 
and fast. The self-test 
capabili- 


ty helps ensure 
that defective 
parts are discovered 
before 
systems 
are shipped 
and, once 
in the field, 
the self-test 
makes 
it easier to distinguish 
between 
problems 
caused 
by processor 
failure and problems 
resulting 
from other causes. 


1.1.12 
CHMOS 


The 80960KA 
is fabricated 
using Intel's 
CHMOS 
IV 
(Complementary 
High Speed Metal Oxide Semicon- 
ductor) 
process. 
The 80960KA 
is currently 
available 
in 16, 20 and 25 MHz versions. 
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Table 3. 80960KA 
Pin Description: 
L-Bus Signals 


Name 
Type 
Description 


CLK2 
I 
SYSTEM 
CLOCK provides 
the fundamental 
timing for 80960KA 
systems. 
It is divided 
by two inside the 80960KA 
to generate 
the internal processor 
clock. 


LAD31:0 
I/O 
LOCAL ADDRESS/DATA 
BUS carries 32-bit physical 
addresses 
and data to and 
T.S. 
from memory. 
During an address (Ta) cycle, bits 2-31 
contain 
a physical word 
address 
(bits 0-1 
indicate SIZE; see below). During a data (Td) cycle, bits 0-31 
contain 
read or write data. These pins float to a high impedance 
state when not 
active. 


Bits 0-1 
comprise 
SIZE during aT a cycle. SIZE specifies 
burst transfer 
size in words. 


LAD1 
LADO 
0 
0 
1 Word 
0 
1 
2 Words 
1 
0 
3 Words 
1 
1 
4 Words 


ALE 
0 
ADDRESS 
LATCH ENABLE indicates 
the transfer 
of a physical address. ALE is 
T.S. 
asserted 
during a Ta cycle and deasserted 
before the beginning 
of the Td state. It is 
active LOW and floats to a high impedance 
state during a hold cycle (Th)' 


ADS 
0 
ADDRESS/DATA 
STATUS 
indicates 
an address state. ADS is asserted 
every Ta 
0.0. 
state and deasserted 
during the following 
Td state. For a burst transaction, 
ADS is 
asserted 
again every Td state where READY was asserted 
in the previous 
cycle. 


W/R 
0 
WRITE/READ 
specifies, 
during aT a cycle, whether 
the operation 
is a write or read. It 
0.0. 
is latched on-chip and remains valid during Td cycles. 


DT/R 
0 
DATA TRANSMIT/RECEIVE 
indicates 
the direction 
of data transfer 
to and from the 
0.0. 
L-Bus. It is low during Ta and Td cycles for a ~ad 
or interrupt 
acknowledgment; 
it is 
high during Ta and Td cycles for a write. DT /R never changes 
state when DEN is 
asserted. 


READY 
I 
READY indicates 
that data on LAD lines can be sampled 
or removed. 
If READY 
is not asserted 
during a Td cycle, the Td cycle is extended 
to the next cycle by 
inserting a wait state (Tw) and ADS is not asserted 
in the next cycle. 


LOCK 
I/O 
BUS LOCK prevents 
bus masters from gaining control 
of the L-Bus during Read/ 
0.0. 
Modify/Write 
(RMW) cycles. The processor 
or any bus agent may assert LOCK. 


At the start of a RMW operation, 
the processor 
examines 
the LOCK pin. If the pin is 
already asserted, 
the processor 
waits until it is not asserted. 
If the pin is not asserted, 


the processor 
asserts 
LOCK during the Ta cycle of the read transaction. 
The 
processor 
deasserts 
LOCK in the Ta cycle of the write transaction. 
During the time 
LOCK is asserted, 
a bus agent can perform a normal read or write but not a RMW 
operation. 


The processor 
also asserts LOCK during interrupt-acknowledge 
transactions. 
Do not 
leave LOCK unconnected. 
It must be pulled high for the processor 
to function 
properly. 


1/0 = Input/Output, 0 = Output. I = Input. 0.0. = Open Drain, T.S. = Three-State 


I 
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Table 3. 80960KA 
Pin Description: 
L·Bus Signals (Continued) 


Name 
Type 
Description 


BE3:0 
0 
BYTE ENABLE LINE~ecify 
the data bytes (up to four) on the bus which are used in 
0.0. 
the current bus cycle. BE3 corresponds 
to LAD31 :24; BEO corresponds 
to LAD7:0. 


The byte enables 
are provided 
in advance 
of data: 


• 
Byte enables 
asserted 
during Ta specify the bytes of the first data word. 


• 
Byte enables 
asserted 
during Td specify the bytes of the next data word, if any (the 


word to be transmitted 
following 
the next assertion 
of READY). 


Byte enables that occur during Td cycles that precede 
the last assertion 
of READY are 
undefined. 
Byte enables are latched on-chip and remain constant 
from one Td cycle to 
the next when READY is not asserted. 


For reads, byte enables 
specify the byte(s) that the processor 
will actually 
use. L-Bus 
~nts 
are required to assert only adjacent 
byte enables 
(e.g., asserting 
just BEO and 


BE2 is not permitted) 
and are required to assert at least one byte enable. Address 
bits 
Aa and A1 can be decoded 
externally 
from the byte enables. 


HOLD 
I 
HOLD: A request from an external 
bus master to acquire the bus. When the processor 
receives 
HOLD and grants bus control to another 
master, it floats its three- state bus 
lines and open-drain 
control 
lines, asserts HLDA and enters the T h state. When HOLD 


deasserts, 
the processor 
deasserts 
HLDA and enters the Ti or Ta state. 


HLDA 
0 
HOLD ACKNOWLEDGE: 
Notifies an external bus master that the processor 
has 
T.S. 
relinquished 
control 
of the bus. 


CACHE 
0 
CACHE indicates 
when an access is cacheable 
during a Ta cycle. It is not asserted 
T.S. 
during any synchronous 
access, such as a synchronous 
load or move instruction 
used 


for sending an lAC message: 
The CACHE signal floats to a high impedance 
state 
when the processor 
is idle. 


1/0 = InputlOutput, 0 = Output, I = Input, 0.0. = Open Drain, T.S. = Three-State 


Table 4. 80960KA 
Pin Description: 
Support 
Signals 


Name 
Type 
Description 


BADAC 
I 
BAD ACCESS, 
if asserted 
in the cycle following 
the one in which the last. READY of a 


transaction 
is asserted, 
indicates 
an un recoverable 
error occurred 
on the current 
bus 
transaction 
or a synchronous 
load/store 
instruction 
has not been acknowledged. 


During system reset the BADAC signal is interpreted 
differently. 
If the signal is high, it 


indicates 
that this processor 
will perform 
system initialization. 
If it is low, another 
processor 
in the system will perform 
system initialization 
instead. 


RESET 
I 
RESET clears the processor's 
intemallogic 
and causes it to reinitialize. 


During RESET assertion, 
the input pins are ignored (except for BADAC and TAG/INTo), 


the three-state 
output pins are placed in a high impedance 
state and other output pins 


are placed in their non-asserted 
states. 


RESET must be asserted 
for at least 41 CLK2 cycles for a predictable 
RESET. The 
HIGH to LOW transition 
of RESET should occur after the rising edge of both CLK2 and 
the extemal 
bus clock and before the next rising edge of CLK2. 


1/0 = Input/Output, 0 = Output, I = Input, 0.0. = Open Drain. T.S. = Three-State 
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Table 
4. 80960KA 
Pin Description: 
Support 
Signals 
(Continued) 


Name 
Type 
Description 


FAILURE 
0 
INITIALIZATION 
FAILURE 
indicates 
that the processor 
did not initialize correctly. 


0.0. 
After RESET deasserts 
and before the first bus transaction 
begins, FAILURE 
asserts while the processor 
performs 
a self-test. 
If the self-test 
completes 
successfully, 
then FAILURE 
deasserts. 
The processor 
then performs 
a zero 
checksum 
on the first eight words of memory. 
If it fails, FAILURE 
asserts for a 
second time and remains asserted. 
If it passes, system initialization 
continues 
and 


_ FAILURE 
remains deasserted. 


iAC/INTo 
I 
INTERAGENT 
COMMUNICATION 
REQUEST/INTERRUPT 
0 indicates 
an lAC 
message 
or an interrupt 
is pending. The bus interrupt 
control 
register determines 
how the signal is interpreted. 
To signal an interrupt 
or lAC request in a synchronous 
system, this pin-as 
well as the other interrupt 
pins-must 
be enabled 
by being 
deasserted 
for at least one bus cycle and then asserted 
for at least one additional 
bus cycle. In an asynchronous 
system the pin must remain deasserted 
for at least 
two bus cycles and then asserted 
for at least two more bus cycles. 


During system reset, this signal must be in the logic high condition 
to enable 
normal processor 
operation. 
The logic low condition 
is reserved. 


INT1 
I 
INTERRUPT 
1, like INTo, provides 
direct interrupt 
signaling. 


INT2/INTR 
I 
INTERRUPT 
2/INTERRUPT 
REQUEST: 
The interrupt 
control 
register determines 
how this pin is interpreted. 
If INT 2, it has the same interpretation 
as the INTo and 
INT1 pins. If INTR, it is used to receive an interrupt 
request from an external 
interrupt controller. 


INT3/INTA 
I/O 
INTERRUPT 
3/INTERRUPT 
ACKNOWLEDGE: 
The bus interrupt 
control 
register 
0.0. 
determines 
how this pin is interpreted. 
If INT 3, it has the same interpretation 
as the 
INTo, INT 1 and INT 2 pins. If INTA, it is used as an output to control 
interrupt- 
acknowledge 
transactions. 
The INTA output is latched on-chip and remains valid 
during Td cycles; as an output, it is open-drain. 


N.C. 
N/A 
NOT CONNECTED 
indicates 
pins should not be connected. 
Never connect 
any pin 
marked N.C. as these pins may be reserved 
for factory use. 


1/0 = Input/Output, 0 = Output, I = Input, 0.0. = Open Drain, T.S. = Three-State 


2.0 
ELECTRICAL SPECIFICATIONS 
2.2 
Power Decoupling 
Recommendations 


2.1 
Power and Grounding 


The 80960KA 
is implemented 
in CHMOS 
IV technol- 
ogy and therefore 
has modest 
power 
requirements. 


Its high clock 
frequency 
and numerous 
output 
buff- 
ers (address/data, 
control, 
error and arbitration 
sig- 
nals) 
can 
cause 
power 
surges 
as multiple 
output 
buffers 
simultaneously 
drive 
new signal 
levels. 
For 
clean 
on-chip 
power 
distribution, 
vcc and VSS pins 
separately 
feed the device's 
functional 
units. Power 
and 
ground 
connections 
must 
be 
made 
to 
all 
80960KA 
power 
and 
ground 
pins. 
On 
the 
circuit 
board, all vcc pins must be strapped 
closely togeth- 
er, preferably 
on a power plane; all Vss pins should 
be strapped 
together, 
preferably 
on a ground plane. 


Place 
a liberal 
amount 
of decoupling 
capacitance 
near the 80960KA. 
When driving the L-bus the proc- 
essor can cause transient 
power surges, particularly 
when connected 
to a large capacitive 
load. 


Low 
inductance 
capacitors 
and 
interconnects 
are 
recommended 
for best high frequency 
electrical 
per- 
formance. 
Inductance 
is 
reduced 
by 
shortening 
board traces between 
the processor 
and decoupling 
capacitors 
as much as possible. 


L 


• 
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2.3 
Connection Recommendations 


For reliable operation, always connect unused in- 
puts to an appropriate signal level. In particular, if 
one or more interrupt lines are not used, they should 
be pulled up. No inputs should ever be left floating. 


All open-drain outputs require a pullup device. While 
in most cases a simple pullup resistor is adequate, a 
network of pullup and pulldown resistors biased to a 
valid VIH (>3.0V) and terminated in the characteris- 
tic impedance of the circuit board is recommended 
to limit noise and AC power consumption. Figure 5 
and Figure 6 show recommended values for the re- 
sistor network for low and high current drive, assum- 
ing a characteristic impedance of 1000. Terminating 
output signals in this fashion limits signal swing and 
reduces AC power consumption. 


NOTE: 
Do not connect external logic to pins marked N.C. 


Open-Drain 
Output 
vcc 


220n 


330n 


270775-5 


Low Drive 
Network: 
VOH = 3.0V 
IOL = 20.7 mA 


Figure 5. Connection 
Recommendations 
for 
Low Current 
Drive Network 
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Open-Drain 
Output 
vcc 


180n 


390n 


270775-6 


High Drive 
Network: 
VOH = 3.4V 
IOL = 25.3 mA 


Figure 6. Connection 
Recommendations 
for 
High Current 
Drive Network 


2.4 
Characteristic Curves 


Figure 7 shows typical supply current requirements 
over the operating temperature range of the proces- 
sor at supply voltage (VcC>of 5V. Figure B and Fig- 
ure 9 show the typical power supply current (lcC> 
that the B0960KA requires at various operating fre- 
quencies when measured at three input voltage 
(VcC>levels and two temperatures. 


For a given output current (Iou the curve in Figure 
10 shows the worst case output low voltage (Vou. 
Figure 11 shows the typical capacitive derating 
curve for the B0960KA measured from 1.5V on the 
system clock (ClK) to 1.5V on the falling edge and 
1.5V on the rising edge of the LoBusaddress/data 
(LAD) signals. 
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Figure 7. Typical 
Supply Current 
vs. Case Temperature 
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Figure 8. Typical Current 
vs. Frequency 
(Room Temp) 
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Figure 9. Typical Current vs. Frequency (Hot Temp) 
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Figure 10. Worst-Case Voltage vs. Output 
Current on Open-Drain Pins 
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Figure 11. Capacitive Derating Curve 
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2.5 
Test Load ClrculJ 


Figure 12 illustrates the load circuit used to test the 
80960KA's three-state pins; Figure 13 shows the 
load circuit used to test the open drain outputs. The 
open drain test uses an active load circuit in the form 
of a matched diode bridge. Since the open-drain out- 
puts sink current, only the IOL legs of the bridge are 
necessary and the IOH legs are not used. When the 
80960KA driver under test is turned off, the output 
pin is pulled up to VREF (Le., VOH). Diode 01 is 
turned off and the IOL current source flows through 
diode 02. 


When the 80960KA open-drain driver under test is 
on, diode 01 is also on and the voltage on the pin 
being tested drops to VOL. Diode 02 turns off and 
IOL flows through diode 01. 


I 


Three-State 
Output 
O~--l 


c;. 


~ 
270775-12 


NOTE: 
CL = 50 pF for all signals 


Figure 12. Test Load Circuit for 
Three-State 
Output Pins 


Open-Drain 
Output 


0, f 
°2 
0 
c;.I 
I~ 
.1 
0 


~ 


NOTES: 
IOL Tested 
at 25 mA 


VREF = vcc 
D1 and D2 are matched 
CL = 50 pF for all signals 


Figure 13. Test Load Clrcultfor 
Open-Drain 
Output Pins 


80960KA 


270775-13 
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2.6 
Absolute Maximum Ratings 
NOTICE: This is a production data sheet. The specifi- 
cations are subject to change without notice. 


Operating 
Temperature 


. 
PGA 
O·C to + S5·C Case 
PQFP 
O·C to + 1OO·CCase 


Storage Temperature 
-65·C 
to + 150·C 


Voltage on Any Pin 
-0.5V 
to VCC + 0.5V 


Power Dissipation 
2.5W (25 MHz) 


• WARNING: Stressing the device beyond the "Absolute 
Maximum 
Ratings" 
may 
cause 
permanent 
damage. 
These are stress ratings 
only. Operation 
beyond 
the 
"Operating 
Conditions" 
is not recommended 
and ex- 
tended 
exposure beyond 
the 
"Operating 
Conditions" 


may affect device reliability. 


2.7 
DC Characteristics 


PGA: 
S0960KA 
(16 MHz) TCASE = O·C to + S5·C, vcc = 5V ± 10% 


S0960KA 
(20 and 25 MHz) TCASE = O·C to +S5·C, 
vcc = 5V ±5% 


PQFP: 
S0960KA 
(16 MHz) TCASE = O·C to + 100·C, VCC = 5V ± 10% 


S0960KA 
(20 and 25 MHz) TCASE = O·C to + 100·C, vcc = 5V ± 5% 


Table 5. DC Characteristics 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


Vll 
Input Low Voltage 
-0.3 
+O.S 
V 


VIH 
Input High Voltage 
2.0 
vcc + 0.3 
V 


VCl 
CLK2 Input Low Voltage 
-0.3 
+O.S 
V 


VCH 
CLK2 Input High Voltage 
0.55Vcc 
vcc + 0.3 
V 


VOl 
Output Low Voltage 
0.45 
V 
(1,2) 


VOH 
Output High Voltage 
2.4 
V 
(3,4) 


Icc 
Power Supply Current: 


16 MHz 
315 
mA 
(5) 
20MHz 
360 
mA 
(5) 
25MHz 
420 
mA 
(5) 


III 
Input Leakage Current 
±15 
/-LA 
o ~ VIN S vcc 


ILO 
Output Leakage Current 
±15 
/-LA 
0.45 s vo s VCC 


CIN 
Input Capacitance 
10 
pF 
fc = 1 MHz(6) 


Co 
Output Capacitance 
12 
pF 
fc = 1 MHz(6) 


CClK 
Clock Capacitance 
10 
pF 
fc = 1 MHz(6) 


NOTES: 
1. For three-state outputs, this parameter is measured at: 
Addresss/Data 
04.0 mA 
Controls 
5.0 mA 
2. For open-drain outputs .............................................................•.................... 
25 mA 
3. This parameter is measured at: 


Address/Data .....................................•..........................•....................... 
-1.0 
mA 
Controls 
: ...........................•....................... 
-0.9 mA 
ALE 
-5.0 
mA 
4. Not measured on open-drain outputs. 
5. Measured at worst case frequency, vcc and temperature, with device operating and outputs loaded to the test conditions 
in Figures 12 and 13. Figure 7, Figure 8 and Figure 9 indicate typical values. 


6. Input. output and clock capacitance are not tested. 
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This section describes the AC specifications for the 
80960KA pins. All input and output timings are spec- 
ified relative to the 1.5V level of the rising edge of 
CLK2. For output timings the specifications refer to 
the time it takes the signal to reach 1.5V. 


For input timings the specifications refer to the time 
at which the signal reaches (for input setup) or 
leaves (for hold time) the TTL levels of LOW (0.8V) 
or HIGH (2.0V). All AC testing should be done with 
input voltages of 0.4V and 2.4V. except for the clock 
(CLK2). which should be tested with input voltages 
of 0.45V and 0.55 vcc- 


2.8 
AC Specifications 


EDGE 


CLK2 


OUTPUTS: 


LAD31 
:0 


ADS 
will. 
DEN 


BE3:0 


HLDA 


CACHE 


LOCK. 
INTA 


DT/R 


INPUTS: 


LAD31:0 


BAOAC 


IAC/iNTO. 
INT 1 


INT2/INTR. 
INB 


HOLD 


LOCK 


READY 


A 
c 


I 


c 
A 
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Figure 14. Drive Levels and Timing Relationships 
for 80960KA 
Signals 
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2.8.1 
AC SPECIFICATION 
TABLES 


Table 6. 80960KA 
AC Characteristics 
(16 MHz) 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


INPUT CLOCK 


T1 
Processor 
Clock Period (CLK2) 
31.25 
125 
ns 
VIN = 1.5V 


T2 
Processor 
Clock Low Time (CLK2) 
8 
ns 
VIL = 10% Point = 1.2V 


T3 
Processor 
Clock High Time (CLK2) 
8 
ns 
VIH = 90% Point = 0.1V + 0.5 vcc 


T4 
Processor 
Clock Fall Time (CLK2) 
10 
ns 
VIN = 90% Point to 10% Point(1) 


T5 
Processor 
Clock Rise Time (CLK2) 
10 
ns 
VIN = 10% Point to 90% Point(1) 


SYNCHRONOUS 
OUTPUTS 


T6 
Output Valid Delay 
2 
25 
ns 


T6H 
HLDA Output Valid Delay 
4 
28 
ns 


T7 
ALE Width 
15 
ns 


Ta 
ALE Output Valid Delay 
2 
18 
ns 


T9 
Output Float Delay 
2 
20 
ns 
(2) 


T9H 
HLDA Output Float Delay 
4 
20 
ns 
(2) 


SYNCHRONOUS 
INPUTS 


T10 
Input Setup 1 
3 
ns 
(3) 


T11 
Input Hold 
5 
ns 
(3) 


T11H 
HOLD Input Hold 
4 
ns 
(3) 


T12 
Input Setup 2 
8 
ns 
(3) 


T13 
Setup to ALE Inactive 
10 
ns 


T14 
Hold after ALE Inactive 
8 
ns 


T15 
Reset Hold 
3 
ns 
(3) 


T16 
Reset Setup 
5 
ns 
(3) 


T17 
Reset Width 
1281 
ns 
41 CLK2 Periods Minimum 


NOTES: 
1. Clock rise and tall times are not tested. 
2. A float condition occurs when the maximum output current becomes less than ILO.Float delay is not tested; however, it 


should not be longer than the valid delay. 


3. LAD31:0, BADAC, HOLD,LOCK and READY are synchronous inputs. iAC/lNTo, INT1,INT2/1NTRand INT3may be 
synchronous or asynchronous. 


3-96 
I 


intel~ 
80960KA 


Table 7. 80960KA 
AC Characteristics 
(20 MHz) 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


INPUT CLOCK 


T1 
Processor 
Clock Period (CLK2) 
25 
125 
ns 
VIN = 1.5V 


T2 
Processor 
Clock Low Time (CLK2) 
6 
ns 
VIL = 10% Point = 1.2V 


T3 
Processor 
Clock High Time (CLK2) 
6 
ns 
VIH = 90% Point = 0.1V + 0.5 vcc 


T4 
Processor 
Clock Fall Time (CLK2) 
10 
ns 
VIN = 90% Point to 10% Point(1) 


T5 
Processor 
Clock Rise Time (CLK2) 
10 
ns 
VIN = 10% Point to 90% Point(1) 


SYNCHRONOUS 
OUTPUTS 


T6 
Output Valid Delay 
2 
20 
ns 


T6H 
HLDA Output Valid Delay 
4 
23 
ns 


T7 
ALE Width 
12 
ns 


Ta 
ALE Output Valid Delay 
2 
18 
ns 


Tg 
Output Float Delay 
2 
20 
ns 
(2) 


TgH 
HLDA Output Float Delay 
4 
20 
ns 
(2) 


SYNCHRONOUS 
INPUTS 


T10 
Input Setup 1 
3 
ns 
(3) 


T11 
Input Hold 
5 
ns 
(3) 


T11H 
HOLD Input Hold 
4 
ns 
(3) 


T12 
Input Setup 2 
7 
ns 
(3) 


T13 
Setup to ALE Inactive 
10 
ns 


T14 
Hold after ALE Inactive 
8 
ns 


T15 
Reset Hold 
3 
ns 


T15 
Reset Setup 
5 
ns 


Tl7 
Reset Width 


I 
1025 
ns 
41 CLK2 Periods Minimum 


NOTES: 
1. Clock 
rise and fall times 
are not tested. 
2. A float condition 
occurs 
when 
the maximum 
output 
current 
becomes 
less than ILO.Float delay is not tested; 
however, 
it 
should 
not be longer 
than the valid delay. 
3. LAD31:0, BADAC, HOLD, I:OCK and READY are synchronous 
inputs. lAC/INTo, INT1, INT2/INTR and INT3 may be 
synchronous 
or asynchroncous. 
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T6H 
HLDA Output Valid Delay 
4 
23 
ns 


T7 
ALE Width 
12 
ns 


Ta 
ALE Output Valid Delay 
2 
18 
ns 


Tg 
Output Float Delay 
2 
18 
ns 
(2) 


TgH 
HLDA Output Float Delay 
4 
20 
ns 
(2) 


SYNCHRONOUS 
INPUTS 


T10 
Input Setup 1 
3 
ns 
(3) 


Tll 
Input Hold 
5 
ns 
(3) 


TllH 
HOLD Input Hold 
4 
ns 


T12 
Input Setup 2 
7 
ns 


T13 
Setup to ALE Inactive 
8 
ns 


T14 
Hold after ALE Inactive 
8 
ns 


T15 
Reset Hold 
3 
ns 


T16 
Reset Setup 
5 
ns 


T17 
Reset Width 
820 
ns 
41 CLK2 Periods Minimum 


NOTES: 
1. Clock 
rise and tall times 
are not tested. 
2. A float condition 
occurs 
when the maximum 
output 
current 
becomes 
less than 
ILO' Float delay is not tested; 
however, 
it 
should 
not be longer 
than the valid delay. 


3. LAD31:0, 
BADAC, 
HOLD, met< and READY 
are synchronous 
inputs. iAC/INTo, 
INT1, INT2/INTR 
and INT3 may be 
synchronous 
or asynchronous. 
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Figure 15. Processor 
Clock Pulse (CLK2) 
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Figure 16. RESET Signal Timing 
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3.0 
MECHANICAL DATA 


3.1 Packaging 


The 80960KA 
is available 
in two package 
types: 


'. 
132-lead 
ceramic 
pin-grid 
array 
(PGA). 
Pins are 
arranged 
0.100 inch (2.54 mm) center-to-center, 
in a 14 by 14 matrix, three rows around 
(see Fig- 
ure 17). 


• 
132-lead 
plastic 
quad 
flat 
pack 
(PQFP). 
This 
package 
uses fine-pitch 
gull wing leads arranged 
in a single row along the package 
perimeter 
with 
0.025 inch (0.64 mm) spacing 
(see Figure 20). 


Dimensions 
for both package 
types are given in the 
Intel Packaging 
handbook 
(Order 
#240800). 


3.1.1 
PIN ASSIGNMENT 


The PGA and PQFP have different 
pin assignments. 
Figure 18 shows the view from the PGA bottom 
(pins 
facing up) and Figure 19 shows a view from the PGA 
top (pins facing 
down). 
Figure 20 shows 
the PQFP 
package; 
Figure 21 shows the PQFP pinout with sig- 
nal names. 
Notice that the pins are numbered 
in or- 
der from 
1 to 132 around 
the 
package 
perimeter. 


Table 9 and Table 
10 list the function 
of each PGA 
pin; Table 
11 and Table 
12 list the function 
of each 
PQFP pin. 


A 
BeD 
E 
F 
G 
H 
J 
K 
L 
~ 
N 
P 
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Figure 
17. 132·Lead 
Pln-Grld 
Array 
(PGA) Package 
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2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 


P 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
p 


VCC 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
VSS 
VCC 


N 
0 
0 
O· 0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
N 


VSS 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 


t.l 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
t.l 


N.C. 
N.C. 
VSS 
VSS 
VCC 
N.C. 
N.C. 
N.C. 
N.C. 
VSS 
VCC 
N.C. 
VCC 
N.C. 


L 
0 
0 
0 
0 
0 
0 
L 


DEN 
N.C. 
VCC 
VSS 
N.C. 
N.C. 


K 
0 
0 
0 
0 
0 
0 
K 
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N.C. 
N.C. 
0 
0 
0 
0 
0 
0 
DT/R 
BE2 
VSS 
N.C. 
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N.C. 
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0 
0 
0 
0 
0 
0 
H 
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BEe 
LOCK 
N.C. 
N.C. 
N.C. 


G 
0 
0 
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0 
0 
0 
G • 
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READY 
BE, 
N.C. 
N.C. 
N.C. 
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0 
0 
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N.C. 
N.C. 
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0 
0 
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LAD26 
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VSS 
N.C. 
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0 
0 
° 
ALE 
ADS 
HLDA 
Vcc 
N.C. 
N.C. 


C 
0 
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0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
C 


HOLD 
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BADAC 
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LAD20 
LA143 
LAOs 
LAD3 
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VSS 
INT3 
INTl 
INTo 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
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0 
0 
B 
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LAD21 
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LA140 
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CLK2 
LADo 
RESET 
VSS 


A 
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0 
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0 
0 
0 
0 
0 
0 
0 
A 
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Figure 18. 80960KA 
PGA Plnout-Vlew 
from Bottom 
(Pins Facing Up) 


l 


3-101 


80960KA 
infel~ 


14 
13 
12 
11 
10 


0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0'0 
0 
0 


VCC 
VSS 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
VCC 


N 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
N 


N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
H.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
VSS 


". 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
". 
N.C. 
N.C. 
N.C. 
VCC 
VSS 
N.C. 
N.C. 
N.C. 
N.C. 
VCC 
VSS 
VSS 
VCC 
N.C. 


0 
0 
0 
0 
0 
0 


N.C. 
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Figure 19. 80960KA 
PGA Plnout-Vlew 
from Top (Pins Facing Down) 
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Figure 20. 80960KA 
132-Lead 
Plastic Quad Flat-Pack 
(PQFP) Package 
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Figure 21. PQFP Plnout-Vlew 
from Top 


I 


3-103 


80960KA 


3.2 Pinout 


Pin 
Signal 


A1 
Vcc 


A2 
Vss 


A3 
LAD19 


A4 
LAD17 


A5 
LAD16 


A6 
LAD14 


A7 
LAD 11 


A8 
LAD9 


A9 
LAD7 


A10 
LAD5 


A11 
LAD4 


A12 
LAD1 


A13 
INT2/INTR 


A14 
Vcc 


81 
LAD23 


82 
LAD24 


83 
LAD22 


B4 
LAD21 


B5 
LAD18 


86 
LAD15 


B7 
LAD12 


B8 
LAD10 


B9 
LAD6 


B10 
LAD2 


B11 
CLK2 


B12 
LADo 


B13 
RESET 


B14 
Vss 


C1 
HOLD 


C2 
LAD25 


C3 
BADAC 


C4 
Vcc 


C5 
Vss 


Table 9.80960KA 
PGA 
Pinout-In Pin Order 


Pin 
Signal 


C6 
LAD20 


C7 
LAD13 


C8 
LAD8 


C9 
LAD3 


C10 
Vcc 


C11 
Vss 


C12 
INT3/INTA 


C13 
INT1 


C14 
IACIINTo 


01 
ALE 


02 
ADS 


03 
HLDA 


012 
Vcc 


013 
N.C. 


014 
N.C. 


E1 
LAD28 


E2 
LAD26 


E3 
LAD27 


E12 
N.C. 


E13 
Vss 


E14 
N.C. 


F1 
LAD29 


F2 
LAD31 


F3 
CACHE 


F12 
N.C. 


F13 
N.C. 


F14 
N.C. 


G1 
LAD30 


G2 
READY 


G3 
BE1 


G12 
N.C. 


G13 
N.C. 


G14 
.N.C. 


NOTE: 
Do not connect 
any external 
logic to any pins marked 
N.C. 


3-104 


Pin 
Signal 


H1 
W/R 


H2 
BEo 


H3 
LOCK 


H12 
N.C. 


H13 
N.C. 


H14 
N.C. 


J1 
DT/R 


J2 
8~2 


J3 
Vss 


J12 
N.C. 


J13 
N.C. 


J14 
N.C. 


K1 
BE3 


K2 
FAILURE 


K3 
Vss 


K12 
Vcc 


K13 
N.C. 


K14 
N.C. 


L1 
DEN 


L2 
N.C. 


L3 
Vcc 


L12 
Vss 


L13 
N.C. 


L14 
N.C. 


M1 
N.C. 


M2 
Vcc 


M3 
Vss 


M4 
Vss 


M5 
Vcc 


M6 
N.C. 


M7 
N.C. 


M8 
N.C. 


M9 
N.C. 


Pin 
Signal 


M10 
Vss 


M11 
Vcc 


M12 
N.C. 


M13 
N.C. 


M14 
N.C. 


N1 
Vss 


N2 
N.C. 


N3 
N.C. 


N4 
N.C. 


N5 
N.C. 


N6 
N.C. 


N7 
N.C. 


N8 
N.C. 


N9 
N.C. 


N10 
N.C. 


N11 
N.C. 


N12 
N.C. 


N13 
N.C. 


N14 
N.C. 


P1 
Vcc 


P2 
N.C. 


P3 
N.C. 


P4 
N.C. 


P5 
N.C. 


P6 
N.C. 


P7 
N.C. 


P8 
N.C. 


P9 
N.C. 


P10 
N.C. 


P11 
N.C. 


P12 
N.C. 


P13 
Vss 


P14 
Vcc 


I 


intel~ 


Signal 
Pin 


ADS 
02 


ALE 
01 


BADAC 
C3 


BEo 
H2 


BE1 
G3 


BE2 
J2 


BE3 
K1 


CACHE 
F3 


CLK2 
B11 


DEN 
L1 


DT/R 
J1 


FAILURE 
K2 


HLDA 
03 


HOLD 
C1 


lAC/INTo 
C14 


INT1 
C13 


INT211NTR 
A13 


INT3/INTA 
C12 


LADo 
B12 


LAD1 
A12 


LAD2 
B10 


LAD3 
C9 


LAD4 
A11 


LAD5 
A10 


LAD6 
B9 


LAD7 
A9 


LAD8 
ca 


LAD9 
A8 


LAD10 
B8 


LAD11 
A7 


LAD12 
B7 


LAD13 
C7 


LAD14 
A6 


Table 10.80960KA 
PGA 
Plnout-In SignalOrder 


Signal 
Pin 


LAD15 
B6 


LAD16 
A5 


LAD17 
A4 


LAD18 
B5 


LAD19 
A3 


LAD20 
C6 


LAD21 
B4 


LAD22 
B3 


LAD23 
B1 


LAD24 
B2 


LAD25 
C2 


LAD26 
E2 


LAD27 
E3 


LAD28 
E1 


LAD29 
F1 


LAD30 
G1 


LAD31 
F2 


LOCK 
H3 


N.C. 
013 


N.C. 
014 


N.C. 
E12 


N.C. 
E14 


N.C. 
F12 


N.C. 
F13 


N.C. 
F14 


N.C. 
G12 


N.C. 
G13 


N.C. 
G14 


N.C. 
H12 


N.C. 
H13 


N.C. 
H14 


N.C. 
J12 


N.C. 
J13 


NOTE: 
Do not connect 
any external 
logic to any pins marked 
N.C. 


I 


Signal 
Pin 


N.C. 
J14 


N.C. 
K13 


N.C. 
K14 


N.C. 
L13 


N.C. 
L14 


N.C. 
M1 


N.C. 
M6 


N.C. 
M7 


N.C. 
M8 


N.C. 
M9 


N.C. 
M12 


N.C. 
M13 


N.C. 
M14 


N.C. 
N2 


N.C. 
N3 


N.C. 
N4 


N.C. 
N5 


N.C. 
N6 


N.C. 
N7 


N.C. 
N8 


N.C. 
N9 


N.C. 
N10 


N.C. 
N11 


N.C. 
N12 


N.C. 
N13 


N.C. 
N14 


N.C. 
P2 


N.C. 
P3 


N.C. 
P4 


N.C. 
P5 


N.C. 
P6 


N.C. 
P7 


N.C. 
P8 


80960KA 


Signal 
Pin 


N.C. 
P9 


N.C. 
P10 


N.C. 
P11 


.N.C. 
P12 


N.C. 
L2 


READY 
G2 


RESET 
B13 


Vcc 
A1 


Vcc 
A14 


VCC 
C4 


Vcc 
C10 


Vcc 
012 


Vcc 
K12 


Vcc 
L3 


Vcc 
M2 


Vcc 
M5 


Vcc 
M11 


Vcc 
P1 


Vcc 
P14 


Vss 
A2 


VSS 
B14 


Vss 
C5 


Vss 
C11 


Vss 
E11 


Vss 
J3 


Vss 
K3 


Vss 
L12 


Vss 
M3 


Vss 
M4 


Vss 
M10 


Vss 
N1 


Vss 
P13 


W/R 
H1 


• 


3-105 


80960KA 


Pin 
Signal 


1 
HLDA 


2 
ALE 


3 
~D26 


4 
LAD2? • 
5 
LAD2B 


6 
LAD29 


7 
LAD30 


8 
LAD3l 


9 
Vss 


10 
GAGHE 


11 
W/R 


12 
READY 


13 
DT/R 


14 
BEo 


15 
BEl 


16 
BE2 


17 
BE3 


18 
FAILURE 


19 
VSS 


20 
LOGK 


21 
DEN 


22 
VSS 


23 
VSS 


24 
N.G. 


25 
N.G. 


26 
Vss 


27 
VSS 


28 
N.G. 


29 
VCC 


30 
VCC 


31 
N.G. 


32 
VSS 


33 
VSS 


Table 11. 80960KA PQFP Plnout-In 
Pin Order 


Pin 
Signal 


34 
N.G. 


35 
Vcc 


36 
Vcc 


37 
N.G. 


38 
N.G. 


39 . 
N.G. 


40 
N.G. 


41 
Vcc 


42 
Vss 


43 
N.G. 


44 
N.G. 


45 
N.G. 


46 
N.G. 


47 
N.G. 


48 
N.G. 


49 
N.G. 


50 
N.G. 


51 
N.G. 


52 
Vss 


53 
Vss 


54 
N.G. 


55 
Vcc 


56 
Vcc 


57 
Vss 


58 
N.G. 


59 
N.G. 


60 
N.G. 


61 
N.G. 


62 
N.G. 


63 
N.G. 


64 
N.G. 


65 
N.G. 


66 
N.G. 


NOTE: 
Do not connect 
any external 
logic to any pins marked 
N.C. 


3-106 


Pin 
Signal 


67 
VSS 


68 
Vss 


69 
N.G. 


70 
VCC 


71 
VCC 


72 
N.G. 


73 
VSS 


74 
VCC 


75 
N.G. 


76 
N.G. 


77 
N.G. 


78 
N.G. 


79 
VSS 


80 
VSS 


81 
N.G. 


82 
VCC 


83 
VCC 


84 
VSS 


85 
IAG/INTo 


86 
INTl 


87 
INT211NTR 


88 
INT3/INTA 


89 
N.G. 


90 
VSS 


91 
GLK2 


92 
Vcc 


93 
RESET 


94 
N.G. 


95 
N.G. 


96 
N.G. 


97 
N.G. 


98 
N.G. 


99 
VSS 


intel~ 


Pin 
Signal 


100 
LADO 


101 
LADl 


102 
LAD2 


103 
VSS 


104 
LAD3 
, 


LAD4 
105 


106 
LAD5 


107 
LAD6 


108 
LAD? 


109 
LADB 


110 
LAD9 


111 
LAD10 


112 
LADll 


113 
LAD12 


114 
VSS 


115 
LAD13 


116 
LAD14 


117 
LAD15 


118 
LAD16 


119 
LAD17 


120 
LAD1B 


121 
LAD19 


122 
LAD20 


123 
LAD2l 


124 
LAD22 


125 
VSS 


126 
LAD23 


127 
LAD24 


128 
LAD25 


129 
BADAG 


130 
HOLD 


131 
N.G. 


132 
ADS 


I 


Signal 
Pin 


ADS 
132 


ALE 
2 


BADAC 
129 


BEo 
14 


BE1 
15 


BE2 
16 


BE3 
17 


CACHE 
10 


CLK2 
91 


DEN 
21 


DT/R 
13 


FAILURE 
18 


HLDA 
1 


HOLD 
130 


IAC/lNTo 
85 


INT1 
86 


INT2/INTR 
87 


INT3/INTA 
88 


LADo 
100 


LAD1 
101 


LAD2 
102 


LAD3 
104 


LAD4 
105 


LAD5 
106 


LAD6 
107 


LAD7 
108 


LAD8 
109 


LAD9 
110 


LAD10 
111 


LAD11 
112 


LAD12 
113 


LAD13 
115 


LAD14 
116 


I 


Table 12.80960KA 
PQFP Plnout-In 
Signal Order 


Signal 
Pin 


LAD15 
117 


LAD16 
118 


LAD17 
119 


LAD18 
120 


LAD19 
121 


LAD20 
122 


LAD21 
123 


LAD22 
124 


LAD23 
126 


LAD24 
127 


LAD25 
128 


LAD26 
3 


LAD27 
4 


LAD28 
5 


LAD29 
6 


LAD30 
7 


LAD31 
8 


LOCK 
20 


N.C. 
24 


N.C. 
25 


N.C. 
28 


N.C. 
31 


N.C. 
34 


N.C. 
37 


N.C. 
38 


N.C. 
39 


N.C. 
40 


N.C. 
43 


N.C. 
44 


N.C. 
45 


N.C. 
46 


N.C. 
47 


N.C. 
48 


Signal 
Pin 


N.C. 
49 


N.C. 
50 


N.C. 
51 


N.C. 
54 


N.C. 
58 


N.C. 
59 


N.C. 
60 


N.C. 
61 


N.C. 
62 


N.C. 
63 


N.C. 
64 


N.C, 
65 


N.C. 
66 


N.C. 
69 


N.C. 
72 


N.C. 
75 


N.C. 
76 


N.C. 
77 


N.C. 
78 


N.C. 
81 


N.C. 
89 


N.C. 
94 


N.C. 
95 


N.C. 
96 


N.C. 
97 


N.C. 
98 


N.C. 
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Pin 
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41 
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Vss 
79 


Vss 
80 
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3.3 Package Thermal Specification 
Maximum 
allowable 
ambient 
temperature 
(TA) per- 
mitted without 
exceeding 
TC is shown by the graphs 
in Figures 23, 24, 25 and 26. The curves assume the 
maximum 
permitted 
supply 
current 
(Icel 
at 
each 
speed, vcc of + 5.0V and a TCASE of + 85°e (PGA) 
or + 1000e 
(PQFP). 


The 80960KA 
is specified 
for operation 
when 
case 
temperatures 
within the range ooe to 85°e 
(PGA) or 
ooe to 1000e 
(PQFP). Measure 
case temperature 
at 
the top center of the package. 
Ambient 
temperature 
can be calculated 
from: 


TJ = Tc + p. 8JC 


TA = TJ + p. 8JA 


Tc = TA + p. [8JA - 
8Jcl 


If the 80960KA 
is to be used in a harsh environment 
where the ambient temperature 
may exceed 
the lim- 
its for the normal commercial 
part, consider 
using an 
extended 
temperature 
device. 
These 
components 
are designated 
by the prefix "TA" 
and are available 
at 16, 20 and 25 MHz in the ceramic 
PGA package. 
Extended 
operating 
temperature 
range is - 40 
0e 
to 
+ 125°e 
(case). 


Values for (JJAand (JJC for various airflows 
are given 
in Table 13 for the PGA package 
and in Table 14 for 
the PQFP package. 
The PGA's (JJAcan be reduced 
by adding 
a heatsink. 
For the 
PQFP, 
however,·a 
heatsink 
is not generally 
used 
since 
the device 
is 
intended 
to be surface 
mounted. 


Figure 
26 shows 
the 
maximum 
allowable 
ambient 
temperature 
for the 20 MHz extended 
temperature 
TA80960KA 
at various 
airflows. 
The curve assumes 
an tee of 420 
mA, vcc of 5.0V 
and 
a TCASE of 
+ 125°e. 


Table 
13. 80960KA 
PGA Package 
Thermal 
Characteristics 


Thermal 
Reslstance-°C/Watt 


Alrflow-H./mln 
(m/sec) 


Parameter 
0 
50 
100 
200 
400 
600 
800 
(0) 
(0.25) 
(0.50) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 


6 Junction-ta-Case 
2 
2 
2 
2 
2 
2 
2 
8JA t 


6 Case-ta-Ambient 
19 
18 
17 
15 
12 
10 
9 
I 


8J-PIN r\ 
I 
t ·JC 
I 
(No Heatsink) 
I 
118J-cAP 


6 Case-ta-Ambient 
16 
15 
14 
12 
9 
7 
6 
UUU 
27~ij!W 
(Omnidirectional 
Heatsink) 


6 Case-ta-Ambient 
15 
14 
13 
11 
8 
6 
5 
(Unidirectional 
Heatsink) 


NOTES: 
1. This table applies to 80960KA PGA plugged into socket or soldered 
directly to board. 


2.8JA = 8JC + 8CA 
3.8J-CAP = 4°C/Wapprox.) 
8J-PIN= 4°C/W 
(inner pins) (approx.) 
8J-PIN= 8°C/W 
(outer pins) (approx.) 
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Table 14.80960KA PQFP Package Thermal Characteristics 


Thermal Reslstance-°C/Watt 


Alrflow-ft.lmln 
(m/sec) 
Parameter 
0 
50 
100 
200 
400 
600 
800 
(0) 
(0.25) 
(0.50) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 


8 Junction-to-Case 
9 
9 
9 
9 
9 
9 
9 


8 Case-to-Ambient 
22 
19 
18 
16 
11 
9 
8 
(No Heatsink) 


NOTES: 
1. This table applies 
to 80960KA 
PQFP soldered 
directly 
to board. 
2. 6JA = 6JC + 6CA 
3. 6JL = 18'C/W 
(approx.) 
6JB = 18'C/W 
(approx.) 
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Figure 22. HOLD Timing 
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Figure 23. 16 MHz Maximum 
Allowable 
Ambient 
Temperature 
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Figure 24. 20 MHz Maximum 
Allowable 
Ambient 
Temperature 
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Figure 25. 25 MHz Maximum 
Allowable 
Ambient 
Temperature 
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Figure 26. Maximum 
Allowable 
Ambient 
Temperature 
for the 
Extended 
Temperature 
TA-80960KA 
20 MHz in PGA Package 


3-111 


80960KA 
intel~ 


4.0 
WAVEFORMS 


Figures 27, 28, 29 and 30 show the waveforms 
for various transactions 
on the 80960KA's 
local bus. 
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Figure 27. Non-Burst 
Read and Write Transactions 
without 
Wait States 
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Figure 28. Burst Read and Write Transaction 
without 
Wait States 
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Figure 29. Burst Write Transaction with 2,1,1, 1 Wait States 
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Figure 30. Accesses Generated by Quad Word Read Bus Request, 
Misaligned Two Bytes from Quad Word Boundary (1, 0, 0, 0 Wait States) 


3-113 


• 


• 
:V:, 
: 
:L..iJ 
, 
, 
, 
:/'-------~-------~:\: 


,, 
, 
,r: 


:V: 
, 
, 


ADS 
[~:-----.:~~ 
, 
, 


INTA 
[~:-----.:~\ 
: 
, 
\'-__ ~r: 
, 
, 
,\~ 
J:/: 


, 


\ 
I 
DT/R [ 
: 
,U 


"/ 
:\~ 
__ -- 
••--_- 
__ -~ __ ••--_J. 
, 


270775-33 


NOTE: 
INTR can go low no sooner 
than the input hold time following 
the beginning 
of interrupt 
acknowledgment 
cycle 
1. For a 
second 
interrupt 
to be acknowledged, 
INTR must be low for at least three 
cycles 
before 
it can be reasserted. 


Figure 31. Interrupt 
Acknowledge 
Transaction 
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5.0 
REVISION 
HISTORY 
identified 
to date 
is incorporated 
into this 
revision. 
The sections 
significantly 
changed 
since 
the previ- 
ous revision 
are: 
No revision 
history 
was 
maintained 
in earlier 
revi- 
sions 
of this 
data 
sheet. 
All errata 
that 
has been 


Section 
Last 
Description 
Rev. 


Table 3. 80960KA 
Pin Description: 
LoBus 
-004 
lOCK 
pin description 
rewritten 
for clarity. 
Signals (pg. 9) 


2.3. Connection 
Recommendations 
-004 
Changed 
suggested 
open-drain 
termination 
networks 


(pg. 12) 
to reflect more realistic operating 
conditions 
with 
reduction 
in DC power consumption. 


Figure 9. Typical Current vs. Frequency 
-004 
Added figure for typical power supply current at hot 
(Hot Temp) (pg. 14) 
temperature 
to aid thermal 
analysis. 


Figure 12. Test load 
Circuit for Three- 
-004 
All outputs now specified 
with standard 
50 pF test 
State Output Pins (pg. 15) 
loads to agree with actual test methodology. 


Figure 13. Test load 
Circuit for Open- 
Drain Output Pins (pg. 15) 


2.7. DC Characteristics 
(pg. 16) 
-004 
'cc max specification 
reduced: 


WAS: 
IS: 
AT: 
375mA 
315mA 
16MHz 
420mA 
360mA 
20MHz 
480mA 
420mA 
25MHz 


Figures 7, 8, 9, 23, 24, 25 and 26 have also been 
changed 
accordingly. 


2.8. AC Specifications 
(pg. 17) 
-004 
25 MHz operation 
extended 
to product 
in PQFP 
package. 
TB min. improved 
at all frequencies 
from 
o ns to 2 ns and TB max. improved 
from 20 ns to 
18 ns. 


TBH max improvement: 


WAS: 
IS: 
AT: 
31 ns 
28 ns 
16MHz 
26 ns 
23 ns 
20MHz 
24 ns 
23 ns 
25MHz 


Functional 
Waveforms 
-004 
Redrawn for clarity. ClK 
signal drawn with more likely 
phase relationship 
to ClK2. 
Open-drain 
output 
signals drawn to show correct 
inactive states. 


Various 
-004 
Deleted all references 
to 10 MHz. Intel no longer 
offers a 10 MHz 80960KA 
device. 


I 
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80960KB 
EMBEDDED 32-BIT MICROPROCESSOR 
WITH INTEGRATED FLOATING POINT UNIT 


• 
High-Performance 
Embedded 
Architecture 
- 
25 MIPS Burst Execution at 25 MHz 
- 
9.4 MIPS- Sustained Execution at 
25 MHz 


• 
512-Byte On-Chip Instruction 
Cache 
- 
Direct Mapped 
- 
Parallel Load/Decode 
for Uncached 
Instructions 


• 
Multiple Register Sets 
- 
Sixteen Global 32-Blt Registers 
- 
Sixteen Local 32-Blt Registers 
- 
Four Local Register Sets Stored 
On-Chip 
- 
Register Scoreboarding 


• 
4 Gigabyte, Linear Address Space 


• 
Pin Compatible with 80960KA 


• 
Built-In Interrupt Controller 
- 
31 Priority Levels, 256 Vectors 
- 
3.4 fis Latency 
@ 25 MHz 


• 
Easy to Use, High Bandwidth 32-Blt Bus 
- 
66.7 Mbytes/s 
Burst 
- 
Up to 16 Bytes Transferred 
per Burst 


• 
132-Lead Packages: 
- 
Pin Grid Array (PGA) 
- 
Plastic Quad Flat-Pack (PQFP) 


• 
On-Chip Floating Point Unit 
- 
Supports IEEE 754 Floating Point 
Standard 
- 
Four 80-Bit Registers 
-13.6 
Million Whetstones/s 
(Single 
Precision) at 25 MHz 


The B0960KB 
is a member of Intel's i960~ 32·bit processor 
family, which is designed 
especially 
for embedded 
applications. 
It includes 
a 512-byte 
instruction 
cache, 
an integrated 
floating 
point unit, and a built-in interrupt 
controller. 
The B0960KB 
has a large register 
set, multiple 
parallel execution 
units and a high-bandwidth 
burst 
bus. Using 
advanced 
RiSe technology, 
this 
high performance 
processor 
is capable 
of execution 
rates 
in 
excess 
of 9.4 million 
instructions 
per second", 
The B0960KB 
is well-suited 
for a wide range of applications 
including 
non-impact 
printers, 
1/0 control 
and specialty 
instrumentation. 


32-BIT 


BUS 


CONTROL 
LOGIC 
32-BIT 


BURST 


BUS 


Figure 
1. The 80960KB 
Processor's 
Highly 
Parallel 
Architecture 
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"Relative 
to Digital Equipment 
Corporation's 
VAX-11I7BO 
at 1 MIPS CVAX·11 is a trademark 
of Digital Equipment 
Corpora~on.) 
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Figure 2. 80960KB 
Programming 
Environment 
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1.1 Key Performance Features 


The 80960 architecture is based on the most recent 
advances 
in 
microprocessor 
technology 
and 
is 
grounded in Intel's long experience in the design 
and manufacture of embedded microprocessors. 
Many features contribute to the 80960KB's excep- 
tional performance: 
1. Large Register Set. Having a large number of 
registers reduces the number of times that a 
processor needs to access memory. Modern 
compilers can take advantage of this feature to 
optimize execution speed. For maximum flexibili- 
ty, the 80960KB provides thirty-two 32-bit regis- 
ters and four 80-bit floating point registers. (See 
Figure 2.) 


2. Fast Instruction 
Execution. 
Simple functions 
make up the bulk of instructions in most pro- 
grams so that execution speed can be improved 
by ensuring that these core instructions are exe- 
cuted as quickly as possible. The most frequently 
executed instructions such as register-register 
moves, add/subtract, 
logical 
operations 
and 
shifts execute in one to two cycles. (Table 1 con- 
tains a list of instructions.) 
3. Load/Store 
Architecture. 
One way to improve 
execution speed is to reduce the number of times 
that the processor must access memory to per- 
form an operation. As with other processors 
based on RiSe technology, the 80960KB has a 
Load/Store architecture. As such, only the LOAD 
and STORE instructions reference memory; all 
other instructions operate on registers. This type 
of architecture simplifies instruction decoding 
and is used in combination with other techniques 
to increase parallelism. 
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Figure 3. Instruction 
Formats 
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Table 
1. 80960KB 
Instruction 
Set 


Data Movement 
Arithmetic 
Logical 
Bit and Bit Field 


Load 
Add 
And 
Set Bit 


Store 
Subtract 
Not And 
Clear Bit 


Move 
Multiply 
And Not 
Not Bit 


Load Address 
Divide 
Or 
Check Bit 
Remainder 
Exclusive 
Or 
Alter Bit 
Modulo 
Not Or 
Scan For Bit 


Shift 
Or Not 
Scan Over Bit 
Exclusive 
Nor 
Extract 
Not 
Modify 
, 


Nand 
Rotate 


Comparison 
Branch 
Call/Return 
Fault 


Compare 
Unconditional 
Branch 
Call 
Conditional 
Fault 
Conditional 
Compare 
Conditional 
Branch 
Call Extended 
Synchronize 
Faults 
Compare 
and Increment 
Compare 
and Branch 
Call System 
Compare 
and Decrement 
Return 
Branch and Link 


Debug 
Miscellaneous 
Decimal 
Floating 
Point 


Modify Trace Controls 
Atomic Add 
Decimal Move 
Move Real 
Mark 
Atomic 
Modify 
Decimal Add with Carry 
Add 
Force Mark 
Flush Local Registers 
Decimal Subtract 
with 
Subtract 
Modify Arithmetic 
Controls 
Carry 
Multiply 
Scan Byte for Equal 
Divide 
Test Condition 
Code 
Remainder 
Modify Process Controls 
Scale 
Round 
Square Root 
Sine 
Cosine 
Tangent 
Arctangent 
Log 
Log Binary 
Log Natural 
Exponent 
Classify 
Copy Real Extended 
Compare 


Synchronous 
Conversion 


Synchronous 
Load 
Convert 
Real to Integer 


Synchronous 
Move 
Convert 
Integer to Real 


3·120 
I 


intel~ 


4. Simple Instruction 
Formats. All instructions in 
the a0960KB are 32 bits long and must be 
aligned on word 
boundaries. This alignment 
makes it possible to eliminate the instruction 
alignment stage in the pipeline. To simplify the 
instruction decoder, there are only five instruction 
formats; each instruction uses only one format. 
(See Figure 3.) 


5. Overlapped 
Instruction 
Execution. Load oper- 
ations allow execution of subsequent instructions 
to continue before the data has been retumed 
from memory, so that these instructions can 
overlap the load. The a0960KB manages this 
process transparently to software through the 
use of a register scoreboard. Conditional instruc- 
tions also make use of a scoreboard so that sub- 
sequent unrelated instructions may be executed 
while the conditional instruction is pending. 
6. Integer Execution Optimization. 
When the re- 
sult of an arithmetic execution is used as an oper- 
and in a subsequent calculation, the value is sent 
immediately to its destination register. Yet at the 
same time, the value is put on a bypass path to 
the ALU, thereby saving the time that otherwise 
would be required to retrieve the value for the 
next operation. 


7. Bandwidth 
Optlmlzatlons. 
The a0960KB gets 
optimal use of its memory bus bandwidth be- 
cause the bus is tuned for use with the on-chip 
instruction cache: instruction cache 
line size 
matches the maximum burst size for instruction 
fetches. The a0960KB automatically fetches four 
words in a burst and stores them directly in the 
cache. Due to the size of the cache and the fact 
that it is continually filled in anticipation of need- 
ed instructions in the program flow, the a0960KB 
is relatively insensitive to memory wait states. 
The benefit is that the a0960KB delivers out- 
standing performance even with a low cost mem- 
ory system. 


8. Cache Bypass. If a cache miss occurs, the proc- 
essor fetches the needed instruction then sends 
it on to the instruction decoder at the same time it 
updates the cache. Thus, no extra time is spent 
to load and read the cache. 


1.1.1 MEMORY SPACE AND ADDRESSING 
MODES 


The a0960KB offers a linear programming environ- 
ment so that all programs running on the processor 
are contained in a single address space. Maximum 
address space size is 4 Gigabytes (232 bytes). 


For ease of use the a0960KB has a small number of 
addressing modes, but includes all those necessary 
to ensure efficient execution of high-level languages 
such as C. Table 2 lists the modes. 
I 
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Table 2. Memory Addressing Modes 


• 12-Bit Offset 
• 32-Bit Offset 
• Register-Indirect 
• Register + 12-Bit Offset 
• Register + 32-Bit Offset 
• Register + (Index-Register x Scale-Factor) 
• Register x Scale Factor + 32-Bit 
Displacement 
• Register + (Index-Register x Scale-Factor) 


+ 32-Bit Displacement 


• Scale-Factor is 1, 2, 4, a or 16 


1.1.2 DATA TYPES 


The a0960KB recognizes the following data types: 


Numeric: 


• a-, 16-, 32- and 64-bit ordinals 
• a-, 16-, 32- and 64-bit integers 
• 32-, 64-, and so-bit real numbers 


Non-Numeric: 


• Bit 
• Bit Field 
• Triple Word (96'bits) 
• Quad-Word (12a bits) 


1.1.3 LARGE REGISTER SET 


The a0960KB programming environment includes a 
large number of registers. In fact, 32 registers are 
available at any time. The availability of this many 
registers greatly reduces the number of memory ac- 
cesses required to perform algorithms, which leads 
to greater instruction processing speed. 


There are two types of general-purpose registers: 
local and global. The global registers consist of six- 
teen 32-bit registers (GOthough G15) and four eo-en 
registers (FPO through FP3). These registers per- 
form the same function as the general-purpose reg- 
isters provided in other popular microprocessors. 
The term global refers to the fact that these regis- 
ters retain their contents across procedure calls. 
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ister frames, which means that up to three proce- 
dure calls can be made without having to access the 
procedure stack resident in memory. 


Although programs may have procedure calls nest- 
ed many calls deep, a program typically oscillates 
back and forth between only two to three levels. As 
a result, with four stack frames in the cache, the 
probability of having a free frame available on the 
cache when a call is made is very high. In fact, runs 
of representative C-Ianguage programs show that 
80% of the calls are handled without needing to ac- 
cess memory. 


If four or more procedures are active and a new pro- 
cedure is called, the 80960KB moves the oldest lo- 
cal register set in the stack-frame cache to a proce- 
dure stack in memory to make room for a new set of 
registers. Global register G15 is the frame pointer 
(FP) to the procedure stack. 


and forth in the same small section of code."Thus, 
by maintaining a block of instructions in cache, the 
number of memory references required to read in- 
structions into the processor is greatly reduced. 


To 
load the instruction cache, 
instructions are 
fetched in 16-byte blocks; up to four instructions can 
be fetched at one. time. An efficient prefetch algo- 
rithm increases the probability that an instruction will 
already be in the cache when it is needed. 


Code for small loops often fits entirely within the 
cache, leading to a great increase in processing 
speed since further memory references might not be 
necessary until the program exits the loop. Similarly, 
when calling short procedures, the code for the call- 
ing procedure is likely to remain in the cache so it 
will be there on the procedure's retum. 
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Figure 4. Multiple Register 
Sets Are Stored 
On-Chip 
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LUI\U ana me Instruction 
mat uses the register con- 
tents, as shown 
in the following 
example: 


Id data_2, r4 
Id data_2, r5 
Unrelated instruction 
Unrelated instruction 
add R4, R5, R6 


In essence, 
the two unrelated 
instructions 
between 
LOAD and ADD are executed 
"for free" 
(i.e., take no 
apparent 
time to execute) 
because 
they are execut- 
ed while 
the 
register 
is being 
loaded. 
Up to three 
load instructions 
can be pending 
at one 
time with 
three corresponding 
scoreboard 
bits set. By exploit- 
ing this feature, 
system 
programmers 
and compiler 
writers 
have 
a useful 
tool 
for optimizing 
execution 
speed. 


1.1-7 FLOATING POINT ARITHMETIC 


In the 80960KB, 
floating 
point arithmetic 
has been 
made an integral 
part of the architecture. 
Having the 
floating 
point 
unit integrated 
on chip 
provides 
two 
advantages. 
First, 
it improves 
the 
performance 
of 
the chip for floating 
point applications, 
since no addi- 
tional bus overhead 
is associated 
with floating 
point 
calculations, 
thereby 
leaving more time for other bus 
operations 
such 
as I/O. 
Second, 
the cost of using 
floating 
point operations 
is reduced 
because 
a sepa- 
rate coprocessor 
chip is not required. 


The 80960KB 
floating 
point (real number) data types 


include 
single 
precision 
(32-bit), 
double 
precision 
(64-bit) and extended 
precision 
(80-bit) floating 
point 
numbers. 
Any 
registers 
may 
be 
used 
to 
execute 
floating 
point operations. 
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Square Root 
3.T 
Arctangent 
10.1 
Exponent 
11.3 
Sine 
15.2 
Cosine 
15.2 


3.9 
13.1 
12.5 
16.6 
16.6 


1.1.8 HIGH BANDWIDTH LOCAL BUS 


The 80960KB 
CPU resides on a high-bandwidth 
ad- 
dress/data 
bus known as the local bus (t.-Bus). The 
L-Bus 
provides 
a direct 
communication 
path 
be- 
tween the processor 
and the memory 
and I/O 
sub- 
system 
interfaces. 
The processor 
uses the L-Bus to 
fetch 
instructions, 
manipulate 
memory 
and respond 
to interrupts. 
L-Bus features 
include: 


• 
32-bit multiplexed 
address/data 
path 


• 
Four-word 
burst capability 
which allows 
transfers 
from 1 byte to 16 bytes at a time 


• 
High bandwidth 
reads and writes with 
66.7 MBytes/s 
burst (at 25 MHz) 


Table 
4 defines 
L-bus signal 
names 
and functions; 
Table 
5 defines 
other 
component-support 
signals 
such as interrupt 
lines. 
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1.1.9 
INTERRUPT 
HANDLING 


The 80960KB can be interrupted in two ways: by the 
activation of one of four interrupt pins or by sending 
a message on the processor's data bus. 


The 80960KB is unusual in that it automatically han- 
dles interrupts on a priority basis and can keep track 
of pending interrupts through its on-chip interrupt 
controller. Two of the interrupt pins can be confl- 
gured to provide 8259A-style handshaking for ex- 
pansion beyond four interrupt lines. 


1.1.10 
DEBUG 
FEATURES 


The 80960KB has built-in debug capabilities. There 
are two types of breakpoints and six trace modes. 
Debug features are controlled by two intemal 32-bit 
registers: the Process-Controls Word and the Trace- 
Controls Word. By setting bits in these control 
words, a software debug monitor can closely control 
how the processor responds during program execu- 
tion. 


The 80960KB provides two hardware breakpoint 
registers on-chip which, by using a special com- 
mand, can be set to any value. When the instruction 
pointer matches either breakpoint register value, the 
breakpoint handling routine is automatically called. 


The 80960KB also provides software breakpoints 
through the use of two instructions: MARK and 
FMARK. These can be placed at any point in a pro- 
gram and cause the processor to halt execution at 
that point and call the breakpoint handling routine. 
The breakpoint mechanism is easy to use and pro- 
vides a powerful debugging tool. 


Tracing is available for instructions (single step exe- 
cution), calls and returns and branching. Each trace 
type may be enabled separately by a special debug 
instruction. In each case, the 80960KB executes the 
instruction first and then calls a trace handling rou- 
tine (usually part of a software debug monitor). Fur- 
ther program execution is halted until the routine 
completes, at which time execution resumes at the 
next instruction. The 80960KB's tracing mecha- 
nisms, implemented completely in hardware, greatly 
simplify the task of software test and debug. 
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1.1.11 
FAULT 
DETECTION 


The 80960KB has an automatic mechanism to han- 
dle faults. Fault types include floating point, trace 
and arithmetic faults. When the processor detects a 
fault, it automatically calls the appropriate fault han- 
dling routine and saves the current instruction point- 
er and necessary state information to make efficient 
recovery possible. Like interrupt handling routines, 
fault handling routines are usually written to meet 
the needs of specific applications and are often in- 
cluded as part of the .operating system or kernel. 


For each of the fault types, there are numerous sub- 
types that provide specific information about a fault. 
For example, a floating point fault may have the sub- 
type set to an Overflow or Zero Divide fault. The 
fault handler can use this specific information to re- 
spond correctly to the fault. 


1.1.12 
BUILT-IN 
TESTABILITY 


Upon reset, the 80960KB automatically conducts an 
exhaustive internal test of its major blocks of logic. 
Then, before executing its first instruction, it does a 
zero check sum on the first eight words in memory 
to ensure that the memory image was programmed 
correctly. If a problem is discovered at any point dur- 
ing the self-test, the 80960KB asserts its FAILURE 
pin and will not begin program execution. Self test 
takes approximately 47,000 cycles to complete. 


System manufacturers can use the 80960KB's self- 
test feature during incoming parts inspection. No 
special diagnostic programs need to be written. The 
test is both thorough and fast. The self-test capabili- 
ty helps ensure that defective parts are discovered 
before systems are shipped and, once in the field, 
the self-test makes it easier to distinguish between 
problems caused by processor failure and problems 
resulting from other causes. 


1.1.13 
CHMOS 


The 80960KB is fabricated using Intel's CHMOS IV 
(Complementary High Speed Metal Oxide Semicon- 
ductor) process. The 80960KB is currently available 
in 16, 20 and 25 MHz versions. 
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Table 
4. 80960KB 
Pin Description: 
L·Bus 
Signals 


Name 
Type 
Description 


CLK2 
I 
SYSTEM 
CLOCK 
provides the fundamental 
timing for a0960KB 
systems. 
It is divided 


by two inside the a0960KB 
and four so-blt registers 
(FPO through 
FP3) to generate 
the internal processor 
clock. 


LAD31:0 
I/O 
LOCAL 
ADDRESS/DATA 
BUS carries 32-bit physical addresses 
and data to and 


T.S. 
from memory. 
During an address 
(Ta) cycle, bits 2-31 
contain 
a physical word 
address 
(bits 0-1 
indicate SIZE; see below). During a data (Td) cycle, bits 0-31 
contain 
read or write data. These pins float to a high impedance 
state when not 
active. 


Bits 0-1 
comprise 
SIZE during a Ta cycie. SIZE specifies 
burst transfer 
size in words. 


LAD1 
LADO 
0 
0 
1 Word 
0 
1 
2 Words 
1 
0 
3 Words 
1 
1 
4 Words 


ALE 
0 
ADDRESS 
LATCH 
ENABLE 
indicates 
the transfer 
of a physical address. ALE is 
T.S. 
asserted 
during aT a cycle and deasserted 
before the beginning 
of the Td state. It is 
active LOW and floats to a high impedance 
state during a hold cycle (Th). 


ADS 
0 
ADDRESS/DATA 
STATUS 
indicates 
an address state. ADS is asserted 
ev~ 
a 
0.0. 
state and deasserted 
during the following 
Td state. For a burst transaction, 
AD 
is 
asserted 
again every Td state where READY was asserted 
in the previous cycle. 


W/R 
0 
WRITE/READ 
specifies, 
during a Ta cycle, whether 
the operation 
is a write or read. It 


0.0. 
is latched on-chip and remains valid during Td cycles. 


DT/R 
0 
DATA TRANSMIT/RECEIVE 
indicates 
the direction 
of data transfer 
to and from the 
0.0. 
t-Bus. It is low during Ta and Td cycles for a ~ad 
or interrupt acknowledgment; 
it is 
high during Ta and Td cycles for a write. DT /R never changes 
state when DEN is 
asserted. 


READY 
I 
READY indicates 
that data on LAD lines can be sampled 
or removed. 
If READY 
is not asserted 
during a Td cycle, the Td cycle is extended 
to the next cycle by 
inserting 
a wait state (Tw) and ADS is not asserted 
in the next cycle. 


LOCK 
I/O 
BUS LOCK prevents 
bus masters from gaining control 
of the L-Bus during Bead/ 
0.0. 
Modify/Write 
(RMW) cycles. The processor 
or any bus agent may assert L 
K. 


At the start of a RMW operation, 
the processor 
examines 
the LOCK pin. If the pin is 
already asserted, 
the processor 
waits until it is not asserted. 
If the pin is not asserted, 


the processor 
asserts LOCK during the Ta cycle of the read transaction. 
The 
POCessor deasserts 
LOCK in the Ta cycle of the write transaction. 
During the time 
L 
K is asserted, 
a bus agent can perform 
a normal read or write but not a RMW 
operation. 


The processor 
also asserts 
LOCK during interrupt-acknowledge 
transactions. 
Do not 


. leave LOCK unconnected. 
It must be pulled high for the processor 
to function 
properly. 


1/0 = InputlOutput, 0 = Output. I = Input. 0.0. = Open Drain, T.S. = Three-State 


I 
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~nts 
are required to assert only adjacent 
byte enables 
(e.g., asserting 
just BED and 
BE2 is not permitted) 
and are required to assert at least one byte enable. Address 
bits 
Ao and A1 can be decoded 
externally 
from the byte enables. 


HOLD 
I 
HOLD: A request from an external 
bus master to acquire the bus. When the processor 
receives 
HOLD and grants bus control 
to another 
master, it floats its three- state bus 
lines and open-drain 
control 
lines, asserts HLDA and enters the Th state. When HOLD 
deasserts, 
the processor 
deasserts 
HLDA and enters the Ti or Ta state. 


HLDA 
0 
HOLD ACKNOWLEDGE: 
Notifies an external 
bus master that the processor 
has 
T.S. 
relinquished 
control 
of the bus. 


CACHE 
0 
CACHE 
indicates 
when an access is cacneable 
during a Ta cycle. It is not asserted 
T.S. 
during any synchronous 
access, such as a synchronous 
load or move instruction 
used 
for sending an lAC message. 
The CACHE signal floats to a high impedance 
state 
when the processor 
is idle. 


I/O = Input/Output, 0 = Output, I = Input, 0.0. = Open Drain, T.S. = Three-State 


Name 
Type 
Description 
. 


BADAC 
I 
BAD ACCESS, 
if asserted 
in the cycle following 
the one in which the last READY of a 
transaction 
is asserted, 
indicates 
an unrecoverable 
error occurred 
on the current bus 
transaction 
or a synchronous 
load/store 
instruction 
has not been acknowledged. 


During system reset the BADAC signal is interpreted 
differently. 
If the signal is high, it 
indicates 
that this processor 
will perform 
system initialization. 
If it is low, another 
processor 
in the system will perform 
system initialization 
instead. 


RESET 
I 
RESET clears the processor's 
internal logic and causes it to reinitialize. 


During RESET assertion, 
the input pins are ignored (except for BADAC and lAC/INTo), 
the three-state 
output pins are placed in a high impedance 
state and other output pins 
are placed in their non-asserted 
states. 


RESET must be asserted 
for at least 41 CLK2 cycles for a predictable 
RESET. The 
HIGH to LOW transition 
of RESET should occur after the rising edge of both CLK2 and 
the external bus clock and before the next rising edge of CLK2. 


Table 5. 80960KB 
Pin Description: 
Support 
Signals 


1/0 = Input/Output, 0 = Output, I = Input, 0.0. = Open Drain, T.S. = Three-State 
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Table 5. 80960KB 
Pin Description: 
Support 
Signals (Continued) 


Name 
Type 
Description 


FAILURE 
0 
INITIALIZATION 
FAILURE 
indicates 
that the processor 
did not initialize correctly. 


0.0. 
After RESET deasserts 
and before the first bus transaction 
begins, FAILURE 
asserts while the processor 
performs 
a self-test. 
If the self-test 
completes 
successfully, 
then FAILURE deasserts. 
The processor 
then performs 
a zero 
checksum 
on the first eight words of memory. 
If it fails, FAILURE 
asserts for a 
second time and remains asserted. 
If it passes, system initialization 
continues 
and 
FAILURE 
remains deasserted. 


iAC/INTo 
I 
INTERAGENT 
COMMUNICATION 
REQUEST/INTERRUPT 
0 indicates 
an lAC 
message 
or an interrupt 
is pending. The bus interrupt 
control 
register determines 
how the signal is interpreted. 
To signal an interrupt 
or lAC request in a synchronous 


system, this pin-as 
well as the other interrupt 
pins-must 
be enabled 
by being 
deasserted 
for at least one bus cycle and then asserted 
for at least one additional 


bus cycle. In an asynchronous 
system the pin must remain deasserted 
for at least 


two bus cycles and then asserted 
for at least two more bus cycles. 


During system reset, this signal must be in the logic high condition 
to enable 
normal processor 
operation. 
The logic low condition 
is reserved. 


INT1 
I 
INTERRUPT 
1, like INTo, provides 
direct interrupt signaling. 


INT2/INTR 
I 
INTERRUPT 
2/INTERRUPT 
REQUEST: 
The interrupt control 
register determines 
how this pin is interpreted. 
If INT 2, it has the same interpretation 
as the INTo and 
INT1 pins. If INTR, it is used to receive an interrupt 
request from an external 
interrupt 
controller. 


INT3/INTA 
I/O 
INTERRUPT 
3/INTERRUPT 
ACKNOWLEDGE: 
The bus interrupt 
control 
register 
0.0. 
determines 
how this pin is interpreted. 
If INT3, it has the same interpretation 
as the 
INTo INT 1 and INT 2 pins. If INTA, it is used as an output to control 
interrupt- 
acknowledge 
transactions. 
The INTA output is latched on-chip 
and remains valid 
during Td cycles; as an output, it is open-drain. 


N.C. 
N/A 
NOT CONNECTED 
indicates 
pins should not be connected. 
Never connect 
any pin 


marked N.C. as these pins may be reserved 
for factory 
use. 


I/O = Input/Output, 0 = Output, I = Input, 0.0. = Open Drain, T.S. = Three-State 


2.0 
ELECTRICAL SPECIFICATIONS 
board, all vcc pins must be strapped 
closely togeth- 
er, preferably 
on a power plane; all Vss pins should 


be strapped 
together, 
preferably 
on a ground 
plane. 


2.1 
Power and Grounding 


The 80960KB 
is implemented 
in CHMOS 
IV technol- 
ogy and therefore 
has modest 
power 
requirements. 
Its high clock 
frequency 
and numerous 
output 
buff- 
ers (address/data, 
control, 
error and arbitration 
sig- 
nals) 
can 
cause 
power 
surges 
as multiple 
output 
buffers 
simultaneously 
drive 
new signal 
levels. 
For 


clean on-chip 
power 
distribution, 
Vcc 
and Vss 
pins 
separately 
feed the device's 
functional 
units. Power 
and 
ground 
connections 
must 
be 
made 
to 
all 


80960KB 
power 
and 
ground 
pins. 
On 
the 
circuit 


2.2 
Power Decoupling 
Recommendations 


Place 
a liberal 
amount 
of decoupling 
capacitance 


near the 80960KB. 
When driving the L-bus the proc- 


essor can cause transient 
power surges, 
particularly 


when connected 
to a large capacitive 
load. 


I 
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low 
inductance capacitors and interconnects are 
recommended for best high frequency electrical per- 
formance. 
Inductance is reduced by shortening 
board traces between thq processor and decoupling 
capacitors as much as possible. 


2.3 Connection Recommendations 


For reliable operation, always connect unused in- 
puts to an appropriate signal level. In particular, if 
one or more interrupt lines are not used, they should 
be pulled up. No inputs should ever be left floating. 


All open-drain outputs require a pullup device. While 
in most cases a simple pullup resistor is adequate, a 
network of pullup and pulldown resistors biased to a 
valid VtH (>3.0V) and terminated in the characteris- 
tic impedance of the circuit board is recommended 
to limit noise and AC power consumption. Figure 5 
and Figure 6 show recommended values for the re- 
sistor network for low and high current drive, assum- 
ing a characteristic impedance of 1000. Terminating 
output signals in this fashion limits signal swing and 
reduces AC power consumption. 


NOTE: 
Do not connect external logic to pins marked N.C. 


Open Drain Output 
vcc 


I 
- 


270565-5 


220n 


330n 


low Drive Network: 
VOH = 3.0V 
tOL = 20.7 mA 


Figure 5. Connection Recommendations 
for 
Low Current Drive Network 
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Open-Drain Output 
vcc 


ISOn 


390n 


270565-6 


High Drive Network: 
VOH = 3.4V 
tOL = 25.3 mA 


Figure 6. Connection Recommendations 
for 
High Current Drive Network 


2.4 Characteristic Curves 


Figure 7 shows typical supply current requirements 
over the operating temperature range of the proces- 
sor at supply voltage (Vee>of 5V. Figure 8 and Fig- 
ure 9 show the typical power supply current (Ice> 
that the 80960KB requires at various operating fre- 
quencies when measured at three input voltage 
(Vee>levels and two temperatures. 


For a given output current (IoU the curve in Figure 
10 shows the worst case output low voltage (Vou. 
Figure 11 shows the typical capacitive derating 
curve for the 80960KB measured from 1.5V on the 
system clock (ClK) to 1.5V on the falling edge and 
1.5V on the rising edge of the LoBusaddress/data 
(LAD) signals. 
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Figure 7. Typical Supply Current 
vs Case Temperature 
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• 
@5.5V 
o @5.0V 


• 
@4.5V 


Figure 8. Typical 
Current 
vs Frequency 
(Room Temp) 


OPERATING FREQUENCY (IotHz) 
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Figure 9. Typical Current 
vs Frequency 
(Hot Temp) 
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Figure 10. Worst-Case 
Voltage 
vs Output Current 
on Open-Drain 
Pins 
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Figure 11. Capacitive 
Derating 
Curve 
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2.5 
Test Load Circuit 


Figure 12 illustrates the load circuit used to test the 
80960KB's three-state pins; Figure 13 shows the 
load circuit used to test the open drain outputs. The 
open drain test uses an active load circuit in the form 
of a matched diode bridge. Since the open-drain out- 
puts sink current, only the IOl legs of the bridge are 
necessary and the IOHlegs are not used. When the 
80960KB driver under test is turned off, the output 
pin is pulled-up to 
VREF 
(Le., VOH). 
Diode 01 is 
turned off and the IOl current source flows through 
diode 02. 


When the 80960KB open-drain driver under test is 
on, diode 01 is also on and the voltage on the pin 
being tested drops to VOL. Diode 02 turns off and 
IOl flows through diode 01. 
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Three-State 
Output 
o 


270565-12 


NOTE: 
CL = 50 pF for all signals 


Figure 12. Test Load Circuit for 
Three-State 
Output Pins 


Open-Drain 
Output 
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NOTES: 
IOL Tested 
at 25 mA 
VREF = vcc 
D1 and D2 are matched 
CL = 50 pF for all signals 


Figure 13. Test Load Circuit for 
Open-Drain 
Output Pins 
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3.0 
ABSOLUTE 
MAXIMUM 
RATINGS 
NOTICE: This is a production data sheet. The specifi- 
cations are subject to change without notice. 
Operating 
Temperature 
PGA 
O°C to + 85°C Case 
PQFP 
, 
O°C to + 100°C Case 


Storage Temperature 
- 65°C to + 150°C 


Voltage 
on Any Pin 
-0.5V 
to VCC + 0.5V 


Power Dissipation 
2.5W (25 MHz) 


• WARNING: Stressing the device beyond the "Absolute 
Maximum 
Ratings" 
may 
cause 
permanent 
damage. 
These are stress ratings 
only. Operation 
beyond 
the 


"Operating 
Conditions" 
is not recommended 
and ex- 
tended 
exposure 
beyond 
the 
"Operating 
Conditions" 
may affect device reliability. 


3.1 
DC Characteristics 


PGA: 
80960KB 
(16 MHz) TCASE = O°C to + 85°C, VCC = 5V ±10% 


80960KB 
(20 and 25 MHz) TCASE = O°C to + 85°C, VCC = 5V ±5% 


PQFP: 
80960KB 
(16 MHz) TCASE = O°C to + 100°C, VCC = 5V ±10% 


80960KB 
(20 and 25 MHz) TCASE = O°C to + 100°C, VCC = 5V ± 5% 


Table 6_DC Characteristics 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


Vll 
Input Low Voltage 
-0.3 
+0.8 
V 


VIH 
Input High Voltage 
2.0 
VCC + 0.3 
V 


VCl 
CLK2 Input Low Voltage 
-0.3 
+0.8 
V 


VCH 
CLK2 Input High Voltage 
0.55 VCC 
VCC + 0.3 
V 


VOl 
Output Low Voltage 
0.45 
V 
(1,2) 


VOH 
Output High Voltage 
2.4 
V 
(3,4) 


tee 
Power Supply Current: 
16MHz 
315 
mA 
(5) 
20MHz 
360 
mA 
(5) 
25MHz 
420 
mA 
(5) 


lu 
Input Leakage Current 
±15 
p.A 
o s: VIN s VCC 


ILO 
Output Leakage Current 
±15 
p.A 
0.45 s: vo s: VCC 


CIN 
Input Capacitance 
10 
pF 
fC = 1 MHz(6) 


Co 
Output Capacitance 
12 
pF 
fC = 1 MHz(6) 


CClK 
Clock Capacitance 
10 
pF 
tc = 1 MHz(6) 


NOTES: 
1. For three-state outputs, this parameter is measured at: 
Addresss/Data 
: 
4.0 mA 
Controls 
- 
5.0 mA 
2. For open-drain outputs 
25 mA 
3. This parameter is measured at: 
Address/Data 
-1.0 
mA 
Controls .......................................•..................................................... 
-0.9 
mA 
ALE 
-5.0 
mA 
4. Not measured on open-drain outputs. 
5. Measured at worst case frequency, vcc and temperature, with device operating and outputs loaded to the test conditions 


in Figures 12 and 13. Figure 7, Figure 8 and Figure 9 indicate typical values. 


6. Input, output and clock capacitance are not tested. 
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This section describes the AC specifications for the 
80960KB pins. All input and output timings are spec- 
ified relative to the 1.5V level of the rising edge of 
CLK2. For output timings the specifications refer to 
the time it takes the signal to reach 1.5V. 


For input timings the specifications refer to the time 
at which the signal reaches (for input setup) or 
leaves (for hold time) the TTL levels of LOW (0.8V) 
or HIGH (2.0V). All AC testing should be done with 
input voltages of 0.4V and 2.4V. except for the clock 
(CLK2). which should be tested with input voltages 
of 0.45V and 0.55 vcc- 


3.2 
AC Specifications 


EDGE 


CLK2 


OUTPUTS: 
LAD31 :0 


ADS 
W/R. 
DEN 


BE3:0 


HLDA 


CACHE 


LOCK, 
INTA 


DT/R 


INPUTS: 
LAD31 :0 


BADAC 


lAC/INTO, 
INTI 


INT2/INTR, 
INB 


HOLD 


LOCK 


READY 


A 
C 


I 


270565-14 


Figure 14. Drive Levels and Timing Relationships for 80960KB Signals 
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T6 
Output Valid Delay 
2 
25 
ns 


T6H 
HLDA Output Valid Delay 
4 
28 
ns 


T7 
ALE Width 
15 
ns 


Ta 
ALE Output Valid Delay 
2 
18 
ns 


T9 
Output Float Delay 
2 
20 
ns 
(2) 


T9H 
HLDA Output Float Delay 
4 
20 
ns 
(2) 


SYNCHRONOUS 
INPUTS 


T10 
Input Setup 1 
3 
ns 
(3) 


T11 
Input Hold 
5 
ns 
(3) 


T11H 
HOLD Input Hold 
4 
ns 
(3) 


T12 
Input Setup 2 
8 
ns 
(3) 


T13 
Setup to ALE Inactive 
10 
ns 


T14· 
Hold after ALE Inactive 
8 
ns 


T15 
Reset Hold 
3 
ns 
(3) 


T16 
Reset Setup 
5 
ns 
(3) 


T17 
Reset Width 
1281 
ns 
41 CLK2 Periods Minimum 


NOTES: 
1. Clock 
rise and tall times 
are not tested. 


2. A float condition 
occurs 
when 
the maximum 
output 
current 
becomes 
less than 
ILO. Float delay is not tested; 
however, 
it 
should 
not be longer 
than the valid delay. 


3. LAD31:0, 
BADAC, 
HOLD, LOCK and READY 
are synchronous 
inputs. 
lAC/INTo, 
INT1, INT2/INTR 
and INT3 may be 
synchronous 
or asynchronous. 
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Table 8. 80960KB 
AC Characteristics 
(20 MHz) 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


INPUT CLOCK 


Tl 
Processor 
Clock Period (CLK2) 
25 
125 
ns 
VIN = 1.5V 


T2 
Processor 
Clock Low Time (CLK2) 
6 
ns 
VIL = 10% Point = 1.2V 


T3 
Processor 
Clock High Time (CLK2) 
6 
ns 
VIH = 90% Point = 0.1V + 0.5 vcc 


T4 
Processor 
Clock Fall Time (CLK2) 
10 
ns 
VIN = 90% Point to 10% Point(l) 


T5 
Processor 
Clock Rise Time (CLK2) 
10 
ns 
VIN = 10% Point to 90% Point(l) 


SYNCHRONOUS 
OUTPUTS 


T6 
Output Valid Delay 
2 
20 
ns 


T6H 
HLDA Output Valid Delay 
4 
23 
ns 


T7 
ALE Width 
12 
ns 


Ta 
ALE Output Valid Delay 
2 
18 
ns 


Tg 
Output Float Delay 
2 
20 
ns 
(2) 


T9H 
HLDA Output Float Delay 
4 
20 
ns 
(2) 


SYNCHRONOUS 
INPUTS 


A 


Tl0 
Input Setup 1 
3 
ns 
(3) 


Tll 
Input Hold 
5 
ns 
(3) 


TllH 
HOLD Input Hold 
4 
ns 
(3) 


T12 
Input Setup 2 
7 
ns 
(3) 


T13 
Setup to ALE Inactive 
10 
ns 


T14 
Hold after ALE Inactive 
8 
ns 


T15 
Reset Hold 
3 
ns 


T15 
Reset Setup 
5 
ns 


T17 
Reset Width 
1025 
ns 
41 CLK2 Periods Minimum 
. 


NOTES: 
1. Clock 
rise and fall times 
are not tested. 


2. A float condition 
occurs 
when 
the maximum 
output 
current 
becomes 
less than ILO.Float delay 
is not tested; 
however, 
it 
should 
not be longer 
than the valid delay. 


3. LAD31:0, BADAC, HOLD, LOCK and READY are synchronous 
inputs. lAC/INTo, INT1, INT2/INTR and INT3 may be 
synchronous 
or asynchroncous. 
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Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


INPUT CLOCK 


T1 
Processor 
Clock Period (CLK2) 
20 
125 
ns 
VIN = 1.5V 


T2 
Processor 
Clock Low Time (CLK2) 
5 
ns 
VIL = 10% Point = 1.2V 


T3 
Processor 
Clock High Time (CLK2) 
5 
ns 
VIH = 90% Point = 0.1V + 0.5 vcc 


T4 
Processor 
Clock Fall Time (CLK2) 
10 
ns 
VIN = 90% Point to 10% Point(1) 


Ts 
Processor 
Clock Rise Time (CLK2) 
10 
ns 
VIN = 10% Point to 90% Point(1) 


SYNCHRONOUS 
OUTPUTS 


T6 
Output Valid Delay 
2 
18 
ns 


T6H 
HLDA Output Valid Delay 
4 
23 
ns 


T7 
ALE Width 
12 
ns 


Ta 
ALE Output Valid Delay 
2 
18 
ns 


T9 
Output Float Delay 
2 
18 
ns 
(2) 


T9H 
HLDA Output Float Delay 
4 
20 
ns 
(2) 


SYNCHRONOUS 
INPUTS 


T10 
Input Setup 1 
3 
ns 
(3) 


T11 
Input Hold 
5 
ns 
(3) 


T11H 
HOLD Input Hold 
4 
ns 


T12 
Input Setup 2 
7 
ns 


T13 
Setup to ALE Inactive 
8 
ns 


T14 
Hold after ALE Inactive 
8 
ns 


T1S 
Reset Hold 
3 
ns 


T16 
Reset Setup 
5 
ns 


TH 
Reset Width 
820 
ns 
41 CLK2 Periods Minimum 


Table 9. 80960KB 
AC Characteristics 
(25 MHz) 


NOTES: 
1. Clock 
rise and tall times 
are not tested. 
2. A float condition 
occurs 
when the maximum 
output 
current 
becomes 
less than 
ILO. Float delay is not tested; 
however, 
it 
should 
not be longer 
than the valid delay. 
3. LAD31:0, BADAC, HOLD, WCK 
and READY are synchronous 
inputs. lAC/INTo, INT1, INT2/INTR and INT3 may be 
synchronous 
or asynchronous. 


3-136 
I 


intel~ 
80960KB 


T, 


lOW 
lEVEL 
(MAX) 
0.8V 


90% 
HIGH LEVEL 
(M IN) 
0.55VCC 


t-------": 
T2 


I 


270565-15 


Figure 15. Processor 
Clock Pulse (CLK2) 


ClK2 


fiRST 


A 
B 
C 
D 
A 
••• 


ClK 
••• 


,;.---t- ••• ---I~.I 


T'7 --+--J 


~~-+-•••--+---+--+--+--~ 


RESET 


OUTPUTS 


INIT PARAMETERS 
(BADAC, 
INTo/lAC) 
MUST BE SET UP 8 CLOCKS 
.....J 


PRIOR TO THIS ClK2 
EDGE 


INIT PARAMETERS 
MUST BE HELD ---------- 
BEYOND THIS ClK2 
EDGE 


NOTES: 
T15 = RESET 
HOLD 


T16 = RESET 
SETUP 


Tu 
= RESET WIDTH 


270565-16 


Figure 16. RESET Signal Timing 
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1 
@@@@@@@@@@@@@@ 


2 
@@@@@@@@@@@@@@ 


3 
@@@@@@@@@@@@@@ 


4 
@@@ 
@@@ 


5 
@@@ 
@@@ 


6 
@@@ 
@@@ 


7 
@@@ 
@@@ 


8 
@@@ 
@@@ 


9 
@@@ 
@@@ 


10 
@@@ 
@@@ 


11 
@@@ 
@@@ 


12 
@@@@@@@@@@@@@@ 


13 
@@@@@@@@@@@@@@ 


14 
@@@@@@@@@@@@@@ 


A 
BeD 
E 
F 
G 
H 
J 
K 
L 
M 
N 
P 
270565-17 


Figure 17. 132-Lead 
Pin-Grid Array (PGA) Package 
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1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 


P 
0 
0 
0 
0 
o 
0 
0 
0 
0 
0 
0 
0 
0 
0 
p 


VCC 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
VSS 
VCC 


N 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
N 


VSS 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 


~ 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
,0 0 
0 
0 
~ 


N.C. 
N.C. 
VSS 
VSS 
VCC 
N.C. 
N.C. 
N.C. 
N.C. 
VSS 
VCC 
N.C. 
VCC 
N.C. 


L 
0 
0 
0 
0 
0 
0 
L 


DEN 
N.C. 
VCC 
VSS 
N.C. 
N.C. 


K 
0 
0 
0 
0 
0 
0 
K 


BE3 
fAIL 
VSS 
VCC 
N.C. 
N.C. 


J 
0 
0 
0 
0 
0 
0 
J 


DT/R 
BE2 
VSS 
N.C. 
N.C. 
N.C. 


H 
0 
0 
0 
0 
0 
0 
H 


W/R 
BEa 
LOCK 
N.C. 
N.C. 
N.C. 


G 
0 
0 
0 
0 
0 
0 
G 


LAD30 
READY 
BE, 
N.C. 
N.C. 
N.C. 


f 
0 
0 
0 
0 
0 
0 
f 


LAD29 
LAD31 
CACHE 
N.C. 
N.C. 
N.C. 


E 
0 
0 
0 
0 
0 
0 
E 


LAD2S 
LAD26 
LAD27 
N.C. 
VSS 
N.C. 


° 
0 
0 
0 
0 
0 
0 
° 
ALE 
ADS 
HLDA 
VCC 
N.C. 
N.C. 


C 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
C 


HOLD 
LAD2S 
BADAC 
VCC 
VSS 
LAD20 
LAD'3 
LADS 
LAD3 
VCC 
VSS 
INT3 
INT, 
INTO 


B 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
B 


LAD23 
LAD24 
LAD22 
LAD21 
LAD,S 
LAD,S 
LAC, 2 
LAC, 0 
LAD6 
LAD2 
CLK2 
LADO 
RESET 
VSS 


A 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
A 


VCC 
VSS 
LAD'9 
LAD'7 
LAC, 6 
LAC, 4 
LAC, 1 
LAD9 
LA~ 
LADS 
LAD. 
LAD, 
INT2 
VCC 


1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
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Figure 18. 80960KB 
PGA Plnout-View 
from Bottom 
(Pins Facing Up) 
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14 
13 
12 
11 
10 
9 
8 
7 
6 
5 
4 
3 
2 
1 


P 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
p 


Vee 
Vss 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
Vee 


N 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
N 


N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
N.C. 
Vss 


\j 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
\j 


N.C. 
N.C. 
N.C. 
"cc 
Vss 
N.C. 
N.C. 
N.C. 
N.C. 
VCC 
Vss 
Vss 
Vee 
N.C. 


L 
0 
0 
0 
0 
0 
0 
L 


N.C. 
N.C. 
Vss 
"cc 
N.C. 
DEN 


K 
0 
0 
0 
0 
0 
0 
K 


N.C. 
N.C. 
Vee 
Vss 
rAIL 
BE3 


0 
0 
0 
Lt) 
0 
0 
0 
J 
C\I 
J 


N.C. 
N.C. 
N.C. 
I 
Yss 
BE2 
or/R 


H 
0 
0 
0 
III 
0 
0 
0 
H 


N.C. 
N.C. 
N.C. 
~ 
LOCK 
BEa 
W/R 
0 
X 


G 
0 
0 
0 
<0 x 
0 
0 
0 
G 
X 
X 
X 
N.C. 
N.C. 
N.C. 
0) 
X 
X x 
BE1 
READY LA030 
0 
X 
X 
X 
0 
0 
0 
CO 
x x x 
0 
0 
0 
r 
-c 
x x x 
r 


N.C. 
N.C. 
N.C. 
X 
X 
X 
CACHE 
LA031 
LA029 


E 
0 
0 
0 
• 
0 
0 
0 
E 


N.C. 
Vss 
N.C. 
LA027 
LA026 
LA028 
° 
0 
0 
0 
0 
0 
0 
° 
N.C. 
N.C. 
Vee 
HLOA 
ADS 
ALE 


C 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
C 
\ 
INTo 
INTl 
INT3 
Vss 
Vee 
LA03 
LA08 
LAI13 
LA020 
Yss 
Vee 
BAOAC LA025 
HOLD 


B 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
B 


Vss 
RESET 
LAOo 
CLK2 
LA02 
LA06 
LAl10 
LAI12 
LAI15 
LAI18 
LA021 
LA022 
LA024 
LA023 


A 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
A 


Vee 
INT2 
LAI1 
LA04 
LA05 
LA~ 
LADg 
LAI11 
LAI14 
LAI16 
LAI17 
LAI19 
Vss 
Vee 


- 


14 
13 
12 
11 
10 
9 
8 
7 
6 
5 
4 
3 
2 
1 


270565-19 


Figure 19. 80960KB 
PGA Plnout-Vlew 
from Top (Pins Facing Down) 


r:> 99 


Pin 1 


,- 
Pin 
67 
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Figure 20. 80960KB 
132-Lead 
Plastic Quad Flat-Pack 
(PQFP) Package 
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LADO 


LADl 


LAD2 


VSS 


LAD3 


LAD4 


LADS 


LAD6 


LAD7 


LAD8 


LAD9 


LAD10 


LADll 


LAD12 


VSS 


LAD13 


LAD14 


LAD15 


LAD16 


LAD17 


LAD18 


LAD19 


LAD20 


LAD21 


LAD22 


VSS 


LAD23 


LAD24 


LAD25 


8ADAC 


HOLD 


NC 


ADS 


64 


63 


62 


61 


60 


59 


58 


57 


56 


55 


54 


53 


52 


51 


50 


49 


48 


47 


46 


45 
" 
43 


42 


41 


40 
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104 
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110 


111 


112 
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114 


115 


116 


117 


118 


119 


120 


121 


122 


123 
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125 


126 
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lO 
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CC 
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Figure 21. PQFP Plnout-Vlew 
from Top 
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NC 


NC 


NC 


NC 


NC 


NC 


NC 


NC 


NC 


VSS 


"cc 


VCC 


NC 


Vss 


VSS 


NC 


NC 


NC 


NC 


NC 


NC 


NC 


NC 


NC 


Vss 


Vcc 


NC 


NC 


NC 


NC 


Vcc 


Vcc 


NC 
• 


AB 
LAD9 


A9 
LAD7 


A10 
LAD5 


A11 
LAD4 


A12 
LADl 


A13 
INT2/INTR 


A14 
VCC 


B1 
LAD23 


B2 
LAD24 


B3 
LAD22 


B4 
LAD2l 


B5 
LAD18 


B6 
LAD15 


B7 
LAD12 


B8 
LADlO 


B9 
LAD6 


B10 
LAD2 


B11 
GLK2 


B12 
LADO 


B13 
RESET 


B14 
VSS 


G1 
HOLD 


G2 
LAD25 


G3 
BADAG 


G4 
VCC 


G5 
VSS 


G13 
INTl 


G14 
iAC/lNTO 


D1 
ALE 


D2 
ADS 


D3 
HLDA 


D12 
VCC 


D13 
N.G. 


D14 
N.G. 


E1 
LAD28 


E2 
LAD26 


E3 
LAD27 


E12 
N.G. 


E13 
Vss 


E14 
N.G. 


F1 
LAD29 


F2 
LAD3l 


F3 
GAGHE 


F12 
N.G. 


F13 
N.G. 


F14 
N.G. 


G1 
LAD30 


G2 
READY 


G3 
BEl 


G12 
N.G. 


G13 
N.G. 


G14 
N.G. 


NOTE: 
Do not connect 
any external 
logic to any pins marked 
N.C. 
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J2 
BE2 


J3 
VSS 


J12 
N.G. 


J13 
N.G. 


J14 
N.G. 


K1 
BE3 


K2 
FAILURE 


K3 
VSS 


K12 
VCC 


K13 
N.G. 


K14 
N.G. 


L1 
DEN 


L2 
N.G. 


L3 
Vcc 


L12 
VSS 


L13 
N.G. 


L14 
N.G. 


M1 
N.G. 


M2 
VCC 


M3 
VSS 


M4 
VSS 


M5 
VCC 


M6 
N.G. 


M7 
N.G. 


M8 
N.G. 


M9 
N.G. 


- - 
N3 
N.G. 


N4 
N.G. 


N5 
N.G. 


N6 
N.G. 


N7 
N.G. 


N8 
N.G. 


N9 
N.G. 


N10 
N.G. 


N11 
N.G. 


N12 
N.G. 


N13 
N.G. 


N14 
N.G. 


P1 
VCC 


P2 
N.G. 


P3 
N.G. 


P4 
N.G. 


P5 
N.G. 


P6 
N.G. 


P7 
N.G. 


P8 
N.G. 


P9 
N.G. 


P10 
N.G. 


P11 
N.G. 


P12 
N.G. 


P13 
Vss 


P14 
VCC 


I 


Signal 
Pin 


ADS 
02 


ALE 
01 


BADAC 
C3 


BEo 
H2 


BEl 
G3 


BE2 
J2 


BE3 
K1 


CACHE 
F3 


CLK2 
B11 


DEN 
L1 


DT/R 
J1 


FAILURE 
K2 


HLDA 
03 


HOLD 
C1 


iAC/TN'fo 
C14 


INTl 
C13 


INT2/1NTR 
A13 


INT3/1NTA 
C12 


LADo 
B12 


LADl 
A12 


LAD2 
B10 


LAD3 
C9 


LAD4 
A11 


LAD5 
A10 


LAD6 
B9 


LAD7 
A9 


LAD8 
C8 


LAD9 
A8 


LADlO 
B8 


LADll 
A7 


LAD12 
B7 


LAD13 
C7 


LAD14 
A6 


Table 11.80960KB 
PGA Plnout-In 
Signal Order 


Signal 
Pin 


LAD15 
B6 


LAD16 
A5 


LAD17 
A4 


LAD18 
B5 


LAD19 
A3 


LAD20 
C6 


LAD2l 
B4 


LAD22 
B3 


LAD23 
B1 


LAD24 
B2 


LAD25 
C2 


LAD26 
E2 


LAD27 
E3 


LAD28 
E1 


LAD29 
F1 


LAD30 
G1 


LAD3l 
F2 


LOCK 
H3 


N.C. 
013 


N.C. 
014 


N.C. 
E12 


N.C. 
E14 


N.C. 
F12 


N.C. 
F13 


N.C. 
F14 


N.C. 
G12 


N.C. 
G13 


N.C. 
G14 


N.C. 
H12 


N.C. 
H13 


N.C. 
H14 


N.C. 
J12 


N.C. 
J13 


NOTE: 
Do not connect 
any external 
logic to any pins marked 
N.C. 
l 


Signal 
Pin 


N.C. 
J14 


N.C. 
K13 


N.C. 
K14 


N.C. 
L13 


N.C. 
L14 


N.C. 
M1 


N.C. 
M6 


N.C. 
M7 


N.C. 
M8 


N.C. 
M9 


N.C. 
M12 


N.C. 
M13 


N.C. 
M14 


N.C. 
N2 - 


N.C. 
N3 


N.C. 
N4 


N.C. 
N5 


N.C. 
N6 


N.C. 
N7 


N.C. 
N8 


N.C. 
N9 


N.C. 
N10 


N.C. 
N11 


N.C. 
N12 


N.C. 
N13 


N.C. 
N14 


N.C. 
P2 


N.C. 
P3 


N.C. 
P4 


N.C. 
P5 


N.C. 
P6 


N.C. 
P7 


N.C. 
P8 


80960KB 


Signal 
Pin 


N.C. 
P9 


N.C. 
P10 


N.C. 
P11 


N.C. 
P12 


N.C. 
L2 


READY 
G2 


RESET 
B13 


Vcc 
A1 


Vcc 
A14 


Vcc 
C4 


Vcc 
C10 


Vcc 
012 


Vcc 
K12 


Vcc 
L3 


Vcc 
M2 


Vcc 
M5 


Vcc 
M11 


Vcc 
P1 


Vcc 
P14 


Vss 
A2 


Vss 
B14 


Vss 
C5 


Vss 
C11 


Vss 
E11 


Vss 
J3 


Vss 
K3 


Vss 
L12 


Vss 
M3 


Vss 
M4 


Vss 
M10 


Vss 
N1 


Vss 
P13 


W/R 
H1 
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Pin 
Signal 


1 
HLDA 


2 
ALE 


3 
LAD26 


4 
LAD27 


5 
LAD28 


6 
LAD29 


7 
LAD30 


8 
LAD3l 


9 
VSS 


10 
GAGHE 


11 
W/R 


12 
READY 


13 
DT/R 


14 
BEo 


15 
BEl 


16 
BE2 


17 
BE3 


18 
FAILURE 


19 
Vss 


20 
LOGK 


21 
DEN 


22 
Vss 


23 
VSS 


24 
N.G. 


25 
N.G. 


26 
Vss 


27 
VSS 


28 
N.G. 


29 
VCC 


30 
Vcc 


31 
N.G. 


32 
VSS 


33 
VSS 


Table 12.80960KB PQFP Plnout-In PinOrder 


Pin 
Signal 


34 
N.G. 


35 
Vcc 


36 
Vcc 


37 
N.G. 


38 
N.G. 


39 
N.G. 


40 
N.G. 


41 
Vcc 


42 
Vss 


43 
N.G. 


44 
N.C. 


45 
N.G. 


46 
N.G. 


47 
N.G. 


48 
N.G. 


49 
N.G. 


50 
N.G. 


51 
N.G. 


52 
Vss 


53 
Vss 


54 
N.G. 


55 
Vcc 


56 
Vcc 


57 
Vss 


58 
N.G. 


59 
N.G. 


60 
N.G. 


61 
N.G. 


62 
N.G. 


63 
N.G. 


64 
N.G. 


65 
N.G. 


66 
N.G. 


NOTE: 
Do not connect 
any external 
logic to any pins marked 
N.C. 
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Pin 
Signal 


67 
VSS 


68 
VSS 


69 
N.G. 


70 
VCC 


71 
VCC 


72 
N.G. 


73 
VSS 


74 
VCC 


75 
N.G. 


76 
N.G. 


77 
N.G. 


78 
N.G. 


79 
VSS 


80 
VSS 


81 
N.G. 


82 
VCC 


83 
VCC 


84 
VSS 


85 
IAGIINTo 


86 
INTl 


87 
INT2/INTR 


88 
INT311NTA 


89 
N.G. 


90 
VSS 


91 
GLK2 


92 
Vcc 


93 
RESET 


94 
N.G. 


95 
N.G. 


96 
N.G. 


97 
N.G. 


98 
N.G. 


99 
VSS 


-intel~ 


Pin 
Signal 


100 
LADO 


101 
LADl 


102 
LAD2 


103 
VSS 


104 
LAD3 


105 
LAD4 


106 
LAD5 


107 
LAD6 


108 
LAD7 


109 
LAD8 


110 
LAD9 


111 
LAD10 


112 
LADll 


113 
LAD12 


114 
VSS 


115 
LAD13 


116 
LAD14 


117 
LAD15 


118 
LAD16 


119 
LAD17 


120 
LAD18 


121 
LAD19 


122 
LAD20 


123 
LAD2l 


124 
LAD22 


125 
VSS 


126 
LAD23 


127 
LAD24 


128 
LAD25 


129 
BADAG 


130 
HOLD 


131 
N.G. 


132 
ADS 


I 


intet~ 


Signal 
Pin 


ADS 
132 


ALE 
2 


BADAG 
129 


BEo 
14 


BE1 
15 


BE2 
16 


BE3 
17 


GAGHE 
10 


GLK2 
91 


DEN 
21 


DT/R 
13 


FAILURE 
18 


HLDA 
1 


HOLD 
130 


IAGIINTo 
85 


INT1 
86 


INT2"NTR 
87 


INT311NTA 
88 


LADO 
100 


LAD1 
101 


LAD2 
102 


LAD3 
104 


LAD4 
105 


LAD5 
106 


LAD6 
107 


LAD7 
108 


LAD8 
109 


LAD9 
110 


LAD10 
111 


LAD11 
112 


LAD12 
113 


LAD13 
115 


LAD14 . 
116 


Table 13. 80960KB 
PQFP Pinout-In 
Signal Order 


Signal 
Pin 


LAD15 
117 


LAD16 
118 


LAD17 
119 


LAD18 
120 


LAD19 
121 


LAD20 
122 


LAD21 
123 


LAD22 
124 


LAD23 
126 


LAD24 
127 


LAD25 
128 


LAD26 
3 


LAD27 
4 


LAD28 
5 


LAD29 
6 


LAD30 
7 


LAD31 
8 


LOGK 
20 


N.G. 
24 


N.G. 
25 


N.G. 
28 


N.G. 
31 


N.G. 
34 


N.G. 
37 


N.G. 
38 


N.G. 
39 


N.G. 
40 


N.G. 
43 


N.G. 
44 


N.G. 
45 


N.G. 
46 


N.G. 
47 


N.G. 
48 


NOTE: 
Do not connect 
any external 
logic to any pins marked 
N.C. 


I 


Signal 
Pin 


N.G. 
49 


N.G. 
50 


N.G. 
51 


N.G. 
54 


N.G. 
58 


N.G. 
59 


N.G. 
60 


N.G. 
61 


N.G. 
62 


N.G. 
63 


N.G. 
64 


N.G, 
65 


N.G. 
66 


N.G. 
69 


N.G. 
72 


N.G. 
75 


N.G. 
76 


N.G. 
77 


N.G. 
78 


N.G. 
81 


N.G. 
89 


N.G. 
94 


N.G. 
95 


N.G. 
96 


N.G. 
97 


N.G. 
98 


N.G. 
131 


READY 
12 


RESET 
93 


VCC 
29 


Vcc 
30 


Vcc 
35 


Vcc 
36 


80960KB 


Signal 
Pin 


Vcc 
41 


Vcc 
55 


VCC 
56 


Vcc 
70 


VCC 
71 


VCC 
74 


VCC 
82 


Vcc 
83 


VCC 
92 


Vss 
9 


Vss 
19 


Vss 
22 


Vss 
23 


Vss 
26 


Vss 
27 


Vss 
32 


Vss 
33 


Vss 
42 


Vss 
52 


Vss 
53 


Vss 
57 


Vss 
67 


Vss 
68 


Vss 
73 


Vss 
79 


Vss 
80 


Vss 
84 


Vss 
90 


Vss 
99 


Vss 
103 


Vss 
114 


Vss 
125 


W/R 
11 
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_.• 
_ 
. 
_0 
0 
,.._ 
•••••, ••••:::.•••••••• 
"" 
I 
_'""'.., 
vJA 
\.Iall 
U'=' 1,t1UU"-'t1U 
by adding a heatsink. For the PQFP, however, a 
heatsink is not generally used since the device is 
intended to be surface mounted. 


Figure 26 shows the maximum allowable ambient 
temperature for the 20 MHz extended temperature 
TA80960KB at various airflows. The curve assumes 
an tee of 420 mA, VCC of 5.0V and a TCASEof 
+ 125°C. 


Table 
14 80960KB 
PGA Package 
Thermal 
Characteristics 


Thermal 
Reslstance-°C/Watt 


Airflow-ft./mln 
(m/sec) 


Parameter 
0 
50 
100 
200 
400 
600 
800 
(0) 
(0.25) 
(0.50) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 


(J Junctton-to-Case 
2 
2 
2 
2 
2 
2 
2 


(J Case-to-Ambient 
19 
18 
17 
15 
12 
10 
9 


(No Heatsink) 


8U• t 
(J Case-to-Ambient 
16 
15 
14 
12 
9 
7 
6 


(Omnidirectional 
I 


8J-P1N 0 
-1 
t 'JC 
1 
Heatsink) 
II 
ll'HAP 


(J Case-to-Ambient 
15 
14 
13 
11 
8 
6 
5 
UUL 
IUUU 
(Unidirectional 
Heatsink) 
270565-22 


NOTES: 
1. This table applies to 80960KB 
PGA plugged into socket or soldered direct- 
Iy to board. 
2. (JJA = (JJC + (JCA 
3. (JJ-CAP = 4°C/W 
(approx.) 
(JJ-PIN = 4°C/W 
(inner pins) (approx.) 


(JJ-PIN = 8°C/W 
(outer pins) (approx.) 
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Table 
15. 80960KB 
PQFP Package 
Thermal 
Characteristics 


Thermal 
Resistance-°C/Watt 


Alrflow-ft.lmln 
(m/sec) 


Parameter 
0 
50 
100 
200 
400 
600 
800 
(O) 
(O.25) 
(O.50) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 


8 Junction-to-Case 
9 
9 
9 
9 
9 
9 
9 


8 Case-to-Ambient 
22 
19 
18 
16 
11 
9 
8 
(No Heatsink) 


NOTES: 
1. This table applies 
to S0960KB 
PQFP soldered 
directly 
to board. 
2. 9JA = 9JC + 9CA 
3. 9JL = 1soC/W 
(approx.) 
9JB = 1S"C/W 
(approx.) 
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Figure 22. HOLD Timing 
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Figure 23. 16 MHz Maximum 
Allowable 
Ambient 
Temperature 
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Figure 24. 20 MHz Maximum 
Allowable 
Ambient 
Temperature 
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• 
PQfP 
0 
PGA with no 
• 
PGA with omni- 
0 
PGA with 
uni- 
heatsink 
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heatsink 
directional 
heotsink 


Figure 25. 25 MHz Maximum 
Allowable 
Ambient 
Temperature 
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I 


• 
PQfP 
0 
PGA with no 
• 
PGA with omni- 
heatsink 
directional 
heatsink 


Figure 26. Maximum 
Allowable 
Ambient 
Temperature 
for the 
Extended 
Temperature 
TA-8096,oKB 
20 MHz In PGA Package 
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Figure 27. Non-Burst 
Read and Write Transactions 
without 
Wait States 
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Figure 28. Burst Read and Write Transaction 
without 
Wait States 
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Figure 29. Burst Write Transaction 
with 2.1.1.1 
Wait States 
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Figure 30. Accesses Generated by Quad Word Read Bus Request, 
Misaligned Two Bytes from Quad Word Boundary (1, 0, 0, 0 Wait States) 
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NOTE: 
INTR can go low no sooner 
than the input hold time following 
the beginning 
of interrupt 
acknowledgment 
cycle 
1. For a 
second 
interrupt 
to be acknowledged. 
INTR must be low for at least three 
cycles 
before 
it can be reasserted. 


Figure 31. Interrupt 
Acknowledge 
Transaction 
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3.6 
Revision History 


No revision 
history was maintained 
in earlier revisions 
of this data sheet. All errata that has been identified 
to 
date is incorporated 
into this revision. 
The sections 
significantly 
changed 
since the previous 
revision 
are: 


Section 
Last 
Description 
Rev. 


Table 4. 80960KB 
Pin Description: 
L-Bus 
-006 
LOCK pin description 
rewritten 
for clarity. 


Signals (pg. 10) 


2.3. Connection 
Recommendations 
-006 
Changed 
suggested 
open-drain 
termination 
networks 
(pg. 13) 
to reflect more realistic operating 
conditions 
with 
reduction 
in DC power consumption. 


Figure 9. Typical Current vs. Frequency 
-006 
Added figure for typical power supply current at hot 
(Hot Temp) (pg. 15) 
temperature 
to aid thermal 
analysis. 


Figure 12. Test Load Circuit for Three- 
-006 
All outputs 
now specified 
with standard 
50 pF test 
State Output Pins (pg. 16) 
loads to agree with actual test methodology. 


Figure 13. Test Load Circuit for Open- 
Drain Output Pins (pg. 16) 


3.1. DC Characteristics 
(pg. 17) 
-006 
'cc max specification 
reduced: 


WAS: 
IS: 
AT: 
375mA 
315mA 
16MHz 
420mA 
360mA 
20MHz 
480mA 
420mA 
25MHz 


3.2. AC Specifications 
(pg. 18) 
-006 
25 MHz operation 
extended 
to product 
in PQFP 
package. 
T8 min. improved 
at all frequencies 
from 
o ns to 2 ns and T8 max. improved 
from 20 ns to 
18 ns. 


T8H max improvement: 


WAS: 
IS: 
AT: 
31 ns 
28 ns 
16MHz 
26 ns 
23 ns 
20MHz 
24ns 
23 ns 
25MHz 


Functional 
Waveforms 
-006 
Redrawn for clarity. CLK signal drawn with more likely 
phase relationship 
to CLK2. Open-drain 
output 
signals drawn to show correct 
inactive states. 


Various 
-006 
Deleted all references 
to 10 MHz. Intel no longer 
offers a 10 MHz 80960KB 
device. 
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80960CA 
PRODUCT OVERVIEW 


1.0 PURPOSE 


The 80960CA 
Product 
Overview is a summary of the 
features and operation of Intel's 80960CA Embedded 
Processor. The Product Overview is intended for those 
who are not familiar with the 80960 architecture or the 
8096OCA,a product built around this architecture. The 
80960CA Product Overview provides a programmer or 
a system designer with a quick, global view of software 
and hardware design considerations for the 8096OCA. 
For further information, refer to the following refer- 
ence documents: 
- 
The 80960CA 
User's Manual 
contains detailed tech- 
nical information and examples for designing em- 
bedded systems using the 8096OCA. 
- 
The 80960CA 
Data Sheet provides electrical specifi- 
cations for the device, such as the DC and AC pa- 
rameters, operating conditions, and packaging spec- 
ifications. 


2.0 80960CA 
32-BIT 
EMBEDDED 
PROCESSOR 


The 80960CA (Figure 2-1) is optimized for embedded 
processing applications. This product features the high- 
performance c-Series core plus built-in system periph- 
erals, effectively integrating a high-speed CPU and sys- 
tem components onto a single silicon die. The 80960CA 
is a member of Intel's 80960 embedded processor fami- 
ly. Each member of the 80960 family is based on a 
common architectural definition referred to as the core 
architecture. 


An 80960 family member, such as the 8096OCA, is 
made up of an implementation of the core architecture 
plus application-specific extensions. These extensions 
may consist of integrated peripherals, instruction-set 
extensions, or additional registers and caches beyond 
those defmed by the architecture. The common core 
architecture provides a basis for code compatibility for 
all 80960 family products, while application-specific ex- 
tensions optimize a particular product for a class of 
applications. 


The 80960 architectural target is the execution of mul- 
tiple instructions per clock (i.e., fractional clocks per 
instruction). 
By defming an architecture 
which sup- 


ports parallel instruction execution and out-of-order in- 
struction execution, performance advances are not con- 
strained by the system clock. 


The 80960CA is capable of launching and executing 
instructions in parallel. This is accomplished by the use 
of advanced silicon technology as well as innovative 
"microarchitectural" 
constructs. The term microarchi- 
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tecture refers to the implementation of the instruction 
set and programming resources. For example, different 
microarchitectures 
may have different pipeline con- 
struction, internal bus widths, register set porting, de- 
grees of parallelism, and cache parameterization (two- 
way, four-way, etc.). 


A principal objective of the 80960 architecture is to 
provide the framework to allow microarchitectutal 
ad- 
vances to translate directly into increased performance 
without architectural limitations. 


270669-2 


Figure 2-1. 80960CA 


2.1 80960 Architecture 


Embedded applications are cost sensitive, require a dif- 
ferent mix of instructions than reprogrammable appli- 
cations, have demanding interrupt 
response require- 
ments, and often use real-time executives rather than 
full-blown operating systems. The 80960 architecture 
was developed with these factors in mind. Several key 
optimizations which are provided by the architecture 
are explained below. 


Instruction Set: Powerful Boolean operations are pro- 
vided. Frequently executed functions are available as 
single instructions for greater code density and per- 
formance. Call, Return, Compare-and-Branch, 
Condi- 
tional-Compare, 
Compare-and-Increment 
or 
Decre- 
ment, and Bit-Field-Extract 
are each single instruc- 
tions. 


Interrupts: A priority interrupt structure simplifies the 
management of real-time events. With 31 discrete levels 
of priority and 248 possible interrupt-handling 
proce- 
dures, this structure provides the low latency and high 
throughput 
interrupt handling required in embedded 
processor applications. 


,Faults: A generalized fault-handling mechanism simpli- 
fies the task of detecting errant arithmetic calculations 
or other conditions that typically require a significant 
amount of in-line user code. 


Application-Specific Extensions: The core architecture 
is designed to accept application-specific extensions 
such as instruction set extensions (e.g., string functions, 
floating point), special purpose registers, larger caches, 
on-chip program and data memory, a memory manage- 
ment and protection unit, fault-tolerance support, mul- 
tiprocessing support, and real-time peripherals (DMA, 
serial ports, etc.). 


2.2 80960 C-Series Core 


The C-series core is an implementation of the 80960 
core architecture. The core can execute instructions at 
a sustained speed of 66 MIPS(I) with bursts of perform- 
ance up to 99 MIPS. To achieve this level of perform- 
ance, Intel 
has incorporated 
state-of-the-art 
silicon 
technology 
and 
innovative 
microarchitectural 
con- 
structs into the C-Series core. Factors which contribute 
to the core's performance are listed below. 
Parallel instruction decoding allows the 80960CA 
to start two instructions in every clock, with bursts 
of three instructions per clock. 
Most instructions execute in a single clock cycle. 
Multiple independent execution units enable over- 
lapping instruction execution. 
Advanced silicon technology allows operation with 
a 33 MHz internal clock. 
- 
Efficient instruction pipeline is designed to mini- 
mize pipeline break losses. 
Register and resource scoreboarding transparently 
manage parallel execution. 
Branch look-ahead feature enables branches to exe- 
cute in parallel with other instructions. 
Local register cache is integrated on-chip. 
1 Kbyte two-way set associative instruction cache is 
integrated on-chip. 


1 Kbyte Static Data RAM is integrated on-chip. 


These factors combine to make the 80960CA an ultra- 
high performance computing engine. 


NOTE: 
1. Single clock instructions at 33 MHz. 


2.3 
80960CA System Peripherals 


The 80960CA features several extensions to the core 
architecture 
in the form of integrated 
peripherals. 
These peripherals are intended to reduce the external 
system requirements needed for embedded applications. 
These peripherals are described below. 
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Bus Controller Unit: A 32-bit high-performance bus 
controller interfaces the 80960CA to external memory 
and peripherals. The bus controller transfers instruc- 
tions or data at a maximum rate of 132 Mbytes per 
second'(2) Internally programmable wait states and 16 
separately configurable memory regions allow the bus 
controller to interface with a variety of memory subys- 
tems with minimum system complexity and maximum 
performance. 


DMA Controller: A four channel DMA controller per- 
forms high speed data transfers between peripherals 
and memory. The DMA controller provides advanced 
features such as data chaining, byte assembly and disas- 
sembly, and a fly-by mode capable of transfer speeds of 
up to 66 Mbytes per second. The DMA controller fea- 
tures a performance and flexibility which is only.possi- 
ble 
by 
integrating 
the 
DMA 
controller 
and 
the 
80960CA core. 


Interrupt 
Controller: 
A priority interrupt 
controller 
manages 8 external interrupt inputs, 4 internal inter- 
• 
rupt sources from the DMA controller, and a single 
non-maskable interrupt input (NMI). A total of 248 
external interrupt sources are supported by the inter- 
rupt controller by configuring the 8 external interrupt 
pins as an 8-bit input port. The interrupt controller pro- 
vides the mechanism for the low latency and high 
throughput interrupt service featured by the 8096OCA. 
The interrupt latency for the 80960CA is typically less 
.than 1 I-I-s. 


3.0 
EXECUTION 
ENVIRONMENT 


The Execution Environment (Figure 3-1) refers to the 
resources which are available for executing code on the 
8096OCA.The following sections describe the elements 
of the execution environment. 


3.1 
Registers and Literals 


The 80960CA provides four types of working data reg- 
isters: Global Registers, Local Registers, Special Func- 
tion Registers (SFRs), and Control Registers. 


Global and local registers are general purpose 32-bit 
data registers. The SFRs and the control registers pro- 
vide a programmer's interface to the on-chip peripher- 
als (i.e., the DMA controller, interrupt controller, and 
bus controller). 


NOTE: 
2. 33 MHz internal clock, load or instruction fetch on 
o wait state, pipelined burst bus. 
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Figure 3-1. Execution Environment 


The 80960 architecture is a register-oriented architec- 
ture. That is, operands and results of instructions are 
placed in working data registers rather than in memory. 
Since the architecture 
is register oriented, an ample 
supply of registers is provided. The architecture's work- 
ing register set consists of 16, 32"bit global registers and 
16, 32-bit local registers. 


3.1.1 GLOBAL 
AND 
LOCAL 
REGISTERS 


The procedure call and return mechanism, which is 
part of the 80960 architecture, inspires the names given 
to the local and global registers. When a procedure call 
or return is executed, the contents of global registers 
are preserved across procedure boundaries. In other 
words, the same set of global registers is used for each 
procedure. A new set of local registers, however, is allo- 
cated for each procedure. The 80960's call and return 
mechanism is explained in Section 3.8. 


The 80960CA supplies 16, 32-bit global registers desig- 
nated gO through g15. Registers gO through g14 are 
general purpose global registers. Register g15 is re- 
served for the current Frame Pointer. This register is 
available in assembly language as the fp register. The fp 
contains the address of the first byte in the current 
stack frame. The fp register and the stack frame are 
described in Section 3.8. 
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The 80960CA supplies 16, 32-bit Local Registers desig- 
nated rOthrough r15. Registers r3 through r15 are gen- 
eral purpose local registers. Registers rO, rt, and r2 are 
reserved for special functions as follows: rOcontains the 
Previous Frame Pointer, rl contains the Stack Pointer, 
and r2 is reserved for the Return Instruction Pointer. 
These registers are available in assembly language as, 
respectively, the pfp, sp, and rip registers. The pfp, sp, 
and rip registers manage. stack frame linkage for the 
80960's procedure 
call and return mechanism. 
The 
function of these registers is decribed in Section 3.8. 


3.1.2 SPECIAL 
FUNCTION 
REGISTERS 
AND 
CONTROL 
REGISTERS 


The 80960CA uses 3 Special Function Registers (SFRs) 
for communicating 
with on-chip peripherals. 
These 
SFR's are an architectural 
extension specific to the 
8096OCA.The SFRs on the 80960CA are designated as 
sft), sft, and sfl. SFRs are accessed as source operands 
by most of the 8096OCA's instructions. The registers 
serve as part of the programmer's 
interface to the 
DMA and interrupt controller. 


Control registers, like SFRs are used to communicate 
with the on-chip peripherals. Configuration informa- 
tion for the peripherals is generally stored in these reg- 
isters. Control registers can only be accessed by using 
the system control (sysctI) instruction. The sysctI in- 
struction is used to load the internal control register 
from a table in external memory called the control ta- 
ble. In order to simplify the process of peripheral con- 
figuration, the control registers are automatically load- 
ed from this table at initialization. 


3.1.3 LITERALS 


The 80960CA provides literals which may be used in 
the place of source register operands in most instruc- 
tions. The literals range from 0 to 31 (5 bits). When a 
literal is used as an operand, the processor expands it to 
32 bits by adding leading zeros. If the instruction de- 
fines an operand larger than 32 bits, the processor zero 
extends the literal to the operand size. 
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3.2 Address Space and Memory 


The address space of the 80960CA (Figure 3-2) is con- 
sidered a subset of the execution environment since the 
code, data, data structures, and external peripherals for 
the processor reside here. The 80960 family has an ad- 
dress space which is 232 bytes (4 Gbytes) in size. This 
address space is linear (unsegmented); therefore, code, 
data, and peripherals may be placed anywhere in the 
usable space. For the 8096OCA, some memory loca- 
tions are reserved or are assigned special functions as 
shown in Figure 3-2. 


3.2.1 INTERNAL 
DATA 
RAM 


The 80960CA provides 1 Kbyte of internal static RAM 
for fast access of frequently used data. The data RAM 
allows time critical data storage and retrieval, with no 
dependence on the performance of the external bus. 
Any 
load 
or 
store, 
including 
quad-word 
opera- 


ADDRESS 


00000000 


0000003F 
00000040 


OOOOOOBF 
OOOOOOCO 


OOOOOOFF 
00000100 


000003FF 
00000400 


FEFf FFFF 
FFOO0000 


FFFF FEFF 
FFFf FFOO 


FFFf Ff2C 
FFFf FF2D 


FFFF FfFF 
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H 
Interrupt 
Vectors 
(optional) 


H 
(Internal 
SRAIol) 


H 
DIolARegisters 
(optional) 
(Internal 
SRAIol) 
H 
H 
Dota RAIol(Internal 
SRAIol, 


User Write Protected) 


H 
H 
Dota RAIol(Internal 
SRAIol. 


Programmable 
User 
Write Protection) 
H 
H 


Code/Dota 


OOC~ 
Archltecturally 
OOC~ 
Defined 
Dota 
Structures 
(External 
Iolemory) 


H 
H 


OOC~ 
Reserved 
·OOC~ 
H 
H 
Initialization 
Boot Record 


H 
(External 
Iolemory) 


H 
Reserved 
H 


o 


64 


192 


256 


1024 


232 -1 (4 Gbytes) 


Figure 3·2. Address 
Space 
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- 
- 
- 
structures 
are 
isolated 
from 
user 
code. 
As 
shown 
in 
Figure 
3-2, the first 256 bytes of data RAM 
are always 
write 
protected 
when 
a program 
is executing 
in user 
mode 
but 
may 
always 
be written 
when 
executing 
in 
supervisor 
mode. 
The remainder 
of the data RAM 
can 
be programmed 
for this 
protection 
feature. 
The 
user 
and supervisor 
modes 
are described 
further 
in Section 
3.7. 


3.2.2 RESERVED 
ADDRESS 
SPACE 


The 
upper 
16 
Mbytes 
of 
memory 
(FFOOOOOOH- 
FFFFFFFFH) 
are reserved 
for specific 
functions 
and 
extensions 
to the 80960 architecture. 
The 
12 words 
in 
reserved 
space (FFFFFFOOH-FFFFFF2CH) 
are used 
to start 
up the processor 
when 
it comes 
out of reset. 
These 
12 words 
are called the initialization 
boot record. 


3.2.3 ARCHITECTURALLY 
DEFINED 
DATA 
STRUCTURES 


To execute 
a program 
on the 8096OCA, data structures 
specific 
to the 
80960 
architecture 
must 
reside 
in the 
processor's 
address 
space. 
Architecture-defined 
data 
structures 
include 
stacks, 
initialization 
structures, 
and 
various 
procedure 
entry 
tables. 
These 
data 
structures 
may 
generally 
be 
located 
anywhere 
in 
the 
address 
space. 
Pointers 
to each 
data 
structure 
are 
specified 
when 
the 
80960CA 
is initialized. 
The architecture-de- 
fined data structures 
include: 


Interrupt 
Table 


System-Procedure 
Table 


Fault 
Table 


User Stack 


Interrupt 
Stack 


Supervisor 
Stack 


In addition 
to the data 
structure 
defined 
by the archi- 
tecture, 
the 80960CA 
requires 
several 
implementation- 
specific data 
structures 
which 
are used for configuring 
peripherals 
and initialization. 
These 
data structures 
in- 
clude: 


Control 
Table 


- 
Process 
Control 
Block 


- 
Initialization 
Boot Record 


Each 
data 
structure 
will be explained 
in more 
detail 
later in this product 
overview. 
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the 
MhMH 
tormat, 
the 
ottset 
IS an Integer 
(called 
a 
displacement) 
ranging 
from 
-231-1 
to 231. An assem- 
bler will choose the MEMA 
or MEMB 
format 
based on 
the size of the offset. 


Register-indirect 
addressing 
modes 
use a 32-bit ordinal 
value in a register 
as the base for the address 
calcula- 
tion. Offsets and indexes 
are added 
to this address 
base 
depending 
on the particular 
addressing 
mode. 
The reg- 
ister-indirect-with-index 
addressing 
mode adds a scaled 
index 
to the address 
base. The 
index 
is specified 
as a 
value in a register. 
The scale value may be selected 
as I, 
2, 4, 8, or 16. 


The 
index-with-displacement 
addressing 
mode 
uses 
a 
scaled 
index plus an integer 
displacement. 
No address 
base is used in this address 
calculation. 


The IP-with-displacement 
addressing 
mode is used with 
load and store instructions 
to make them IP relative. 
In 
this mode, an integer 
displacement 
plus a constant 
of 8 
is added 
to the 
IP of the 
instruction 
to calculate 
the 
next address. 


Table 3-1. Memory 
Addressing 
Modes 


Mode 
Description 


Absolute 
Offset 
Offset 


Absolute 
Displacement 
Displacement 


Register 
Indirect 
Abase 


Register 
Indirect with 
Abase + Offset 
Offset 


Register 
Indirect with 
Abase + (Index'Scale) 
Index 


Register 
Indirect with 
Abase + (Index' Scale) 
Index and Displacement 
+ Displacement 


Index with Displacement 
(Index' Scale) + 
Displacement 


Register 
Indirect with 
Abase + Displacement 
Displacement 


IP with Displacement 
IP + Displacement 
+ 8 
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3.4 Data Types 


The 80960CA operates on the following data types (Figure 3-3): 
- 
Integer (8, 16, 32, and 64 bits) 
- 
Ordinal (8, 16, 32, and 64 bits) 


- 
Bit 
- 
Bit Field 


- 
Triple Word (96 bits) 
- 
Quad Word (128 bits) 


~ 
31 
L-...J 
0 


LENGTH 
(1 TO 32 
BITS) .-t 
16 I 
BITS 
15 


B~IBYTEI 


7 
0 


SHORT I 
o 


32 I 
WORDI 
BITS 
••••• 
-"", 


31 
0 
;~I~ 
L_O_NG~I 


~~I~ 
~I~ 
~ 
ffi_IP_L_E_W_O_RD~1 
~~I~ 
.....II 
..••.. 
O_U_A_D_W_O~RDI 
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Class 
Data Type 
Length 
Range 


Numeric 
Byte Integer 
8 bits 
-27 
to 27 - 
1 
(Integer) 
Short Integer 
16 bits 
-215 
to 215 - 
1 
Integer 
32 bits 
-231 
to 231 - 
1 
Long Integer 
64 bits 
-263 
to 263 - 
1 


Numeric 
Byte Ordinal 
8 bits 
Oto28 
- 
1 
(Ordinal) 
Short Ordinal 
16 bits 
Oto216 
- 
1 
Ordinal 
32 bits 
Oto232 
- 
1 
Long Ordinal 
64 bits 
Oto264 
- 
1 


Non-Numeric 
Bit 
t-bit 
Bit Field 
1-32 
bits 
N/A 
Triple Word 
96 bits 
Quad Word 
128 bits 


Figure 3-3. Data Types 
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The following sections .describe the data types support- 
ed by the 8096OCA. 


3.4.1 NUMERICDATA TYPES 


Integers and ordinals are considered numeric 
data 
types since the processor performs arithmetic opera- 
tions with this data. The integer data type is a signed 
binary value in standard 2's complement representa- 
tion. The ordinal data type is an unsigned binary value. 


3.4.2 NON-NUMERICDATA TYPES 


The remaining data types (bit field, triple word, and 
quad word) represent groupings of bits or bytes that the 
processor can operate on as a whole, regardless of the 
nature of the data contained in the group. These data 
types facilitate the moving of blocks of bits or bytes. 


3.5 Instruction Set 


The 80960CA features a comprehensive instruction set 
(Table 3-2). Much of the instruction set is that of a 
RISC architecture. Unlike pure RISC machines, how- 
ever, the 80960CA provides an extension to the RISC 
instruction set with instructions that perform complex 
functions such as procedure calls and returns, high- 
speed multiplies, and other complex control, arithme- 
tic, and logical operations. The instruction set allows 
functionally complex yet highly compact code to be 
written for embedded control applications where mem- 
ory is a valuable commodity. 


3.5.1 INSTRUCTIONGROUPS 


The 80960CA instruction set is most easily described if 
grouped by the functions listed below: 


- 
Data Movement 
- 
Address Computation 
- 
Logical and Arithmetic 
- 
Bit and Bit Field 
Comparison 
Branch 
Call and Return 
Fault 
Debug 
Processor Management 


The instructions which make up each of these groups 
are described in the following sections. 
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3.5.1.1 Data Movement Instructions 


The data movement instructions move data from mem- 
ory to registers, from registers to memory, and between 
registers. The load instructions copy bytes, words, or 
multiple words from memory to a selected register or 
group of registers. Conversely, the store instructions 
copy bytes, words, or groups of words from a selected 
register or group of registers to memory. The move in- 
structions copy data between registers. 


Load Instructions 
loadword 
loadordinalbyte 
loadordinalshort 
load integerbyte 
load integershort 
load long 
loadtriple 
loadquad 


-Id 


, -Idob 
-ldoS 
-Idlb 
-Idls 
-Idl 
-Idt 
-Idq 


Store Instructions 
- st 
storeword 
- stob 
store ordinalbyte 
- stos 
store ordinalshort 
- stlb 
store integerbyte 
- stls 
store integershort 
- stl 
store long 
- stt 
storetriple 
- stq 
store quad 


Move Instructions 
- mov 
moveword 
- movl 
movelong 
- movt 
movetriple 
- movq 
movequad 


3.5.1.2 Address Computation Instructions 


The load address (lda) instruction causes a 32-bit ad- 
dress to be computed and placed in a destination regis- 
ter. The address is computed based on the addressing 
mode selected. The load and store instructions perform 
a function identical to that of the lda instruction when 
calculating a source or destination address. The lda in- 
struction is useful for loading a 32-bit constant into a 
register. 


3.5.1.3 Logical and Arithmetic Instructions 


Logical instructions 
perform bitwise Boolean opera- 
tions on operands in registers. Since this group of in- 
structions performs only bitwise manipulations of data. 
separate logical instructions 
for. integer and ordinal 
data types do not exist. In the table below, srcl and 
src2 represent processor registers or literals which are 
the operands for these instructions. 
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Table 3-2. Instruction 
Set Summary 


Data 
Arithmetic 
Logical 
Bit and 
Movement 
Bit Field 


Load 
Add 
And 
Set Bit 


Store 
Subtract 
Not And 
Clear Bit 
Move 
Multiply 
And Not 
Not Bit 
Divide 
Or 
Check Bit 
Remainder 
Exclusive 
Or 
Alter Bit 
Modulo 
Not Or 
Scan for Bit 
Scan for Byte 
Shift 
Or Not 
Span over Bit 
Extended 
Nor 
Extract 
Shift 
Exclusive 
Nor 
Modify 
Extended 
Not 
Multiply 
Nand 
Extended 
Rotate 
Divide 
Add with 
Carry 
Subtract 
with 
Carry 
( 


Comparison 
Branch 
Call and 
Fault 
Return 


Compare 
Unconditional 
Call 
Conditional 
Condition 
Branch 
Call Extended 
Fault 
Compare 
Conditional 
Call System 
Synchronize 
Compare 
and 
Branch 
Return 
Faults 
Increment 
Branch and 
. 


Compare 
and 
Link 
Decrement 
Condition 
Condition 
Test 
Compare 
and Conditional 
Branch 


Debug 
Processor 
Address 
Atomic 
Management 
Computation 


Modify Trace 
Modify 
Load Address 
Atomic Add 
Controls 
Process 
Atomic 
Modify 
Mark 
Controls 
Force Mark 
Modify 
Arithmetic 
Controls 
System Control 
Update DMA 
Setup DMA 
Flush Local 
Registers 
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ply, divide, and shift operations on integer or ordinal 
operands in registers. 


Arithmetic Instructions 
- addI 
addinteger 
- addo 
addordinal 
- sublsubtract 
integer 
- subo 
subtractordinal 
- mull 
multiplyinteger 
- mulo 
multiplyordinal 
- dlvl 
divideinteger 
- dlvo 
divideordinal 
- reml 
remainderinteger 
- remo 
remainderordinal 
- modi 
modulointeger 
- rotate 
rotatebitleft 
- shll 
shiftleftinteger 
- shlo 
shiftleftordinal 
- shrl 
shiftrightinteger 
- shro 
shiftrightordinal 
- shrdl 
shiftrightdividinginteger 


Extended arithmetic instructions facilitate computation 
on ordinals and integers which are longer than 32 bits. 
In add with carry and subtract with carry instructions, 
the carry out from the previous arithmetic instruction 
is used in the computation. The extended multiply in- 
struction multiplies two, ordinal source operands pro- 
ducing a long ordinal result (64 bits). The extended 
divide instruction divides a long ordinal dividend by an 
ordinal divisor and produces a 64-bit result. The ex- 
tended shift right instruction shifts a 64-bit source val- 
ue and produces the lower order 32 bits of the shifted 
value. 


ExtendedArithmetic Instructions 
- addc 
addordinalwithcarry 
- subc 
subtractordinalwithcarry 
- emuI 
extendedmultiply 
- ediv 
extendeddivide 
- eshro 
shiftrightextendedordinal 
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3.5.1.4Bit and Bit Field Instructions 


The bit instructions operate on a specified bit in a regis- 
ter. 


Bit Instructions 
- setblt 
setbit 
- clrblt 
clearbit 
- notblt 
notbit 
- alterblt 
alterbit 
- scanbit 
scanfor bit 
- spanbit 
spanoverbit 


Bit field instructions operate on a specified contiguous 
group of bits in a register. This group of bits can be 
from 0 to 32 bits in length. 


Bit Field Instructions 
- extract 
extractfield 
- modify 
modifyfield 
- scanbyte 
scanfor byte 


3.5.1.5Branch Instructions 


The branch instructions allow the direction of program 
flow to be changed by explicitly modifying the Instruc- 
tion Pointer 
(JP). The target IP in a branch instruction 
is generally specified as a displacement to be added to 
the current IP. The extended branch instructions allow 
IP calculation using any addressing mode. 


The unconditional branch instructions always alter pro- 
gram flow when executed. 


Unconditional Branch 
Instructions 
- b 
branch 
- bx 
branchextended 


The RISe branch-and-link 
instructions automatically 
save a Return 
Instruction 
Pointer (RIP) before the 
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jump 
is taken. 
The RIP is the address 
of the instruction 
following 
the branch 
and link. 


Branch and Link Instructions 


- bal 
branch and link 
- balx 
branch and link extended 


Conditional 
branch 
instructions 
alter 
program 
flow 
only if the condition 
code flags in the arithmetic 
control 
register 
match 
a value specified 
in the instruction. 
The 
condition 
code flags indicate 
conditions 
of equality 
or 


inequality 
between 
two operands 
in a previously 
execut- 
ed instruction. 
The arithmetic 
control 
register 
and con- 
dition 
code flags are described 
in Section 
3.6. 


Based 
on a branch 
prediction 
flag 
located 
in the ma- 


chine 
level instruction, 
the 80960CA 
will assume 
that 
an instruction 
usually 
takes 
or does not take 
a condi- 
tional branch. 
By executing 
along the predicted 
path of 
program 
flow, delays 
due to breaks 
in the instruction 
stream 
are often avoided. 
This feature 
of the 80960CA 


is referred 
to as branch prediction. 
The 80960CA 
incor- 
porates 
the branch 
prediction 
feature 
because 
code us- 
ing a conditional 
branch 
instruction 
usually 
favors 
a 
single direction 
of program 
flow. 


The branch 
prediction 
flag is specified 
at the assembly 
level by appending 
a .t or .f to a conditional 
branch 


instruction 
meaning, 
respectively, 
"assume 
branch 
tak- 
en" 
or "assume 
branch 
not taken". 
For 
example, 
the 
assembler 
mnemonic 
be.t means that the processor 
will 
assume 
that 
this 
branch-if-equal 
instruction 
usually 
branches 
when 
encountered. 
In the following 
table 
.p 
represents 
the branch 
prediction 
flag. 


Conditional 
Branch Instructions 


- be.p 
branch if equal 
- bne.p 
branch if not equal 
- bl.p 
branch if less 
- ble.p 
branch if less or equal 
- bg.p 
branch if greater 
- bge.p 
branch if greater or equal 
- bo.p 
branch if ordered 
- bno.p 
branch if unordered 


Compare 
and conditional 
branch 
instructions 
compare 
two operands, 
then branch 
according 
to the immediate 
results. 


Conditional 
Compare 
and 
Conditions 
Branch Instructions 


- cmplbe.p 
compare 
integer 
and branch if 
equal 


- cmplbne.p 
compare 
integer 
and branch if 
not equal 
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-cmplbl.p 
compare 
integer 
and branch if less 
-cmplble.p 
compare 
integer 
and branch if less 
or equal 
-cmplbg.p 
compare 
integer 
and branch if 
greater 
-cmplbge.p 
compare 
integer 
and branch if 
greater or equal 
-cmplbo.p 
compare 
integer 
and branch if 
ordered 
-cmplbno.p 
compare 
integer 
and branch if 
unordered 
-cmpobe.p 
compare 
ordinal 
and branch if 
equal 
-cmpobne.p 
compare 
ordinal 
and branch if 
not equal 
-cmpobl.p 
compare 
ordinal 
and branch if less 
-cmpoble.p 
compare 
ordinal 
and branch if less 
or equal 
-cmpobg.p 
compare 
ordinal 
and branch if 
greater 
-cmpobge.p 
compare 
ordinal 
and branch if 
greater or equal 
- bbs.p 
check bit 
and branch 
if set 
- bbc.p 
check bit 
and branch 
if clear 


3.5.1.6 
Compare 
and Condition 
Test 
Instructions 


The 
80960CA 
provides 
several 
types 
of instructions 


that are used to compare 
two operands. 
The condition 
code flags in the arithmetic 
control 
register 
are set to 


indicate 
whether 
one operand 
is less than, 
equal 
to, or 
greater 
than 
the other 
operand. 


Compare 
Instructions 


- cmpl 
compare 
integer 
- cm po 
compare 
ordinal 
- chkblt 
check bit 


Conditional 
compare 
instructions 
test 
the 
existmg 


status 
of the condition 
code flags before 
a compare 
is 


• 
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performed. These conditional compare instructions are 
provided to optimize two-sided range comparisons (i.e. 
to test if a value is less than one number but greater 
than another). 


Conditional 
Compare 
Instructions 


- concmpl 
conditional 
compare 
integer 
- concmpo 
conditional 
compare 
ordinal 


The compare and increment and compare and decre- 
ment instructions set the condition code flags based on 
a comparison of two register sources, decrements or 
increments one of the sources, and finally stores this 
result in a destination register. 


-cmplncl 
-cmplnco 
-cmpdecl 
-cmpdeco 


compare 
and increment 
integer 
compare 
and increment 
ordinal 
compare 
and decrement 
integer 
compare 
and decrement 
ordinal 


The condition test instructions allow the state of the 
condition code flags to be tested. Based on the outcome 
of the comparison, a true or false code is stored in a 
destination register. The branch prediction flag is used 
in this instruction to reduce the execution time of the 
instruction when the test outcome is predicted correct- 
ly. For example teste.t (test if equal) will execute in a 
shorter time if the condition code flags test true for the 
equal condition. 
Analogous to the function 
of the 
branch prediction flag in the conditional compare and 
branch instructions, 
the prediction flag in this case 
eliminates breaks in the micro-instruction 
sequence 
which is used to implement the condition test instruc- 
tions. 


Condition 
Test Instructions 


- teste.p 
test if equal 
- testne.p 
test if not equal 
-testl.p 
test if less 
- testle.p 
test if less or equal 
- testg.p 
test if greater 
- testge.p 
test if greater or equal 
- testo.p 
test if ordered 
- testno.p 
test if not ordered 


3.5.1.7 Call and Return 
Instructions 


The 80960CA features an on-chip call and return 
mechanism for making procedure calls to local and sys- 
tem procedures. The call instructions and the call and 
return mechanism is described in Section 3.8. 


Call and Return Instructions 


- call 
call 
- calix 
call extended 
- calls 
call system 
- ret 
return 
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3.5.1.8 Fault Instructions 


The 80960CA will fault automatically as the result of 
certain errant operations which may occur when exe- 
cuting code. Fault procedures are then invoked auto- 
matically to handle the various types of faults. In addi- 
tion, the fault instructions permit a fault to be generat- 
ed explicitly based on the value of the condition code 
flags. The branch prediction flag in these instructions is 
used to reduce the execution time of these instructions 
when the state of the condition code flags are guessed 
correctly. 


Conditional 
Fault Instructions 


- faulte.p 
fault if equal 
- faultne.p 
fault if not equal 
- faultl.p 
fault if less 
- faultle.p 
fault if less or equal 
- faultg.p 
fault it greater 
- faultge.p 
fault if greater or equal 
- faulto.p 
fault if ordered 
- faultno.p 
fault if unordered 


The syncf instruction causes the processor to wait for 
all faults to be generated which are associated with any 
prior uncompleted instructions. 


- syncf 
synchronize 
faults 


3.5.1.9 Debug Instructions 


The processor supports debugging and monitoring of 
program activity through the use of'.trace events. The 
debug instructions support debugging and monitoring 
software. 


Debug Instructions 


~modtc 
modify trace controls 
- mark 
mark 
- fmark 
force mark 


3.5.1.10 
Processor 
Management 
Instructions 


The 80960CA provides several instructions for direct 
control of processor functions and for configuring the 
8096OCA's peripherals. A brief description of the proc- 
essor management instructions is given below. 


Processor 
Management 
Instructions 


- mod pc 
modify process 
controls 
- modac 
modify arithmetic 
controls 
- sysctl 
system control 
instruction 
- udma 
update DMA SRAM 
- sdma 
setup DMA 
- flushreg 
flush local registers 


intel~ 


3.6 Arithmetic Controls 


The Arithmetic 
Control (A C) Register is a 32-bit on-chip 
register (Figure 3-4). The AC register is used primarily 
to monitor and control the execution of 80960CA arith- 
metic instructions. The processor reads and modifies 
bits in the AC register when performing many arithme- 
tic operations. The AC register is also used to control 
the faulting conditions for some instructions. The mo- 
dac instruction allows the user to directly read or modi- 
fy the AC register. 


The processor sets the condition code flags (bits 0-2) to 
indicate equality or inequality as the result of certain 
instructions (such as the compare instructions). Other 
instructions, 
such as the conditional branch instruc- 
tions, take action based on the value of the condition 
code flags. Table J·3 shows the functional assignment 
for each condition code flag. 


Table 3·3. Arithmetic Condition Codes 


Condition 
Condition 
Code 


001 
Greater Than 
010 
Equal 
100 
less Than 
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The integer overflow flag (bit 8) and the integer over- 
flow mask (bit 12) are used in conjunction with the 
arithmetic integer overflow fault. The mask bit masks 
the integer overflow fault. When the fault is masked, 
and an integer overflow occurs, the integer overflow 
flag is set but no fault handling action is taken. If the 
fault is not masked, and an integer overflow occurs, the 
integer overflow fault is taken and the integer overflow 
flag is not set. 


The no imprecise faults flag (bit 15) determines if im- 
precise faults are allowed to occur. Fault handling and 
precise and imprecise faults in the 80960CA are dis- 
cussed in Section 3.10. 


3.7 Process ,Management 


Process management 
refers to the monitoring and con- 
trol of certain properties of an executing process. The 
following sections describe the mechanisms available on 
the 80960CA to perform this function . 


t t_t===== 
Reserved 
(Initialize 
to 0) 


Figure 3·4. Arithmetic Control Register 
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State Flag-This flag determines the executing state of 
the processor. The processor state is either executing 
state (0) or interrupted state (1). 


Trace Enable Bit and Trace Fault Pending Flags- 
These fields control and monitor trace activity in the 
processor. The Trace Enable Bit enables fault genera- 
tion for trace events. The Trace Fault Pending Flag 
indicates that a trace event has been detected. 


The process controls can be modified by software with 
the modify process controls (modpc) instruction. The 
modpc instruction may only write the PC register when 
the processor is in supervisor mode. 


3.7.2 PRIORITIES 


The 80960 architecture defines a means to assign priori- 
ties to executing programs and interrupts. The current 
priority of the processor is stored in the priority field of 
the PC register. This priority is used to determine if an 
interrupt will be serviced and in which order multiple 
pending interrupts will be serviced. Setting the priority 
of an executing program above that of interrupts allows 
critical code to be prioritized and executed without in- 
terruption. 


The priority field of the PC register can be modified 
directly using the modpc instruction. The priority field 
is also modified to reflect the priority of serviced inter- 
rupts. On a return from an interrupt routine, the priori- 


protect cooe and data trom possible corruption by pro- 
grams executing in user mode. Some instructions and 
functions of the 80960CA are also insulated from code 
executing in user mode. 


The processor has two operating states: executing and 
interrupted. In executing state, the processor can exe- 
cute in user or supervisor mode. In the interrupted 
state, the processor always executes in supervisor mode. 


3.8 Call and Return Mechanism 


The 80960 architecture features a built-in call and re- 
turn mechanism. This mechanism is designed to make 
procedure calls simple and fast, and to provide a flex- 
ible method for storing and handling variables that are 
local to a procedure. A call automatically allocates a 
new set of local registers and a new stack frame. All 
linkage information is maintained by the processor, 
making procedure calls and returns virtually transpar- 
ent to the user. A system call instruction is provided as 
a method for calling privileged procedures such as a 
kernel service. The call and return model supports effi- 
cient translation of structured high level code (such as 
C, or ADA) to 80960 machine language. 


The procedure call and return mechanism provides a 
number of significant benefits which contribute to the 
performance and ease of use of the 80960CA. 
I) The call and return instructions are implemented en- 


tirely on-chip, resulting in an extremely high per- 
formance implementation of these commonly used 
functions. 


~R.s.rv.d 
L---.J 
(lnfUollze 
to 0) 


tt Trace 
Enable 
L.: Execution 
Mode 


--------- 
Trace 
rbult 
Pending 


----------- 
State 
------------- 
Priority 
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Figure 3-5. Process 
Control 
Register 
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2) A single instruction to implement each call or return 
operation results in code density improvements com- 
pared to processors which require multiple instruc- 
tions to encode these functions. 


3) By implementing the call and return functions as 
single instructions, the 80960 architecture is open for 
further 
optimization 
of these instructions, 
while 
maintaining assembly-level compatibility. 


4) A program does not have to explicitly save or restore 
the variables stored in the local registers when a call 
or return is executed. The processor does this implic- 
itly on procedure calls and on returns. 


5) The call and return mechanism provides a structure 
for storing a virtually unlimited number of local 
variables for each procedure: the on-chip local regis- 
ters provide quick access to often used variables and 
the stack provides space for additional variables. 


3.8.1 LOCAL 
REGISTERS 
AND THE STACK 
FRAME 


At any point in a program, the 80960 has access to a 
local register set and a section of the procedure stack 
referred to as a stack frame. When a call is executed, a 
new stack frame 'is allocated for the called procedure. 
Additionally, the current local register set is saved by 
the processor, freeing these registers for use by the new- 
ly called procedure. In this way, every procedure has a 
unique stack and unique set of local registers. When a 


80960CA 
PRODUCT 
OVERVIEW 


return is executed, the current local register set and 
current stack frame are dea1located. The previous local 
register set and previous stack frame are restored. This 
call and return mechanism is illustrated in Figure 3-6 
where n is procedure depth for the currently executing 
procedure. 


The procedure stack structure is defined by the 80960 
architecture. 
The procedure stack always grows up- 


ward (i.e. towards higher addresses) and the stack 
pointer (SP) always points to the next available byte of 
the stack frame. The 80960CA requires that each stack 
frame begins on a l6-byte boundary. Due to this align- 
ment requirement, a padding space of 0 to 15bytes may 
exist between adjacent stack frames in memory. When 
a stack frame is allocated, the first 16 words are always 
assigned as storage for the local registers; therefore, the 
SP initially points to the 17th word in the stack frame. 
It should be noted that although each stack frame is 
assigned storage space for the local registers, these loca- 
tions in the stack are not guaranteed to contain the 
values of the saved local registers. This is because sever- 3 
al sets of local registers are cached on-chip rather than 
written to the stack in external memory. This caching 
mechanism is described in detail later in this section. 


3.8.2 PROCEDURE 
LINKING 


The 80960 architecture automatically manages proce- 
dure linkage. One global register and three local regis- 
ters are reserved for procedure linkage information. 


Coli Nesting 
: 
STACK 


call procedurel 
call procedure2 
call procedure3 
••• 
Executing 
call procedure(n-l) 
Here ---+ 
call procedure(n) 


Stack 
Growth 
(towards 
higher 
addresses) 
+ 


Stack 
Pointer 


~ 
= Saved 
Registers 
c::J 
= Current 
Register. 
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Figure 3-6. Call and Return Mechanism 
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Figure 3-7 describes the pointer structure used to link 
frames and to provide a unique SP for each frame. Reg- 
ister g15 is the Frame Pointer (FP). The FP is the ad- 
dress of the first byte of the current (topmost) stack 
frame. The FP is always updated to point to the current 
frame when calls and returns are executed. Register rO 
is the Previous Frame Pointer (PFP). The PFP is the 
address of the first byte of the stack frame which was 
created prior to the frame containing this PFP. Register 
rl is the Stack Pointer (SP). The SP points to the next 
available byte of the stack frame. Register r2 is reserved 
for the Return Instruction Pointer (RIP). The RIP is 
the address of the instruction which follows a call in- 
struction, this is also the target address for the return 
from that procedure. The RIP is automatically stored 
in register r2 of the calling procedure when a call is 
executed. 


3.8.3 PARAMETER 
PASSING 


Parameters may be passed by value or passed by refer- 
ence between procedures. 
The global registers, the 
stack, or predefined data structures in memory may be 
used to pass these parameters. 


•• 
(towa 
• 
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The global registers provide the fastest method for pass- 
ing parameters. The values to be passed into a proce- 
dure reside in the global registers of the calling proce- 
dure. When a procedure is called, the values in the 
global registers are preserved. If more parameters are to 
be passed than will fit in the global registers, additional 
parameters may be passed in the stack of the calling 
procedure, or in a data structure which is referenced by 
a pointer passed in the global registers. 


3.8.4 LOCAL 
REGISTER 
CACHE 


The 80960CA provides an on-chip cache for saving and 
restoring the local registers on calls and returns. This 
cache greatly enhances performance of the call and re- 
turn mechanism on the 8096OCA. Movement of data 
between the local registers and the register cache is typ- 
ically accomplished in only 4 processor clocks with no 
external bus traffic. When this cache is filled, the regis- 
ters associated with the oldest stack frame are moved to 
the area reserved for those registers on the physical 
stack (Figure 3-7). 
• 


STACK 


Stack 
Growth 
rd higher 
addres.es) 


j 


Reserved 
for Local Registers 
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Figure 
3·7. Stack 
Frame 
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The local register cache is a physical extension of the 
internal data RAM. The part of the data RAM used for 
this cache is not visible to the user and is large enough 
to hold up to 5 sets oflocal registers. The register cache 
may be extended to hold up to 15 sets of local registers. 
When extended, each new register set consumes 16 
words of the user's data RAM, beginning at the highest 
address and growing downward. The size of the local 
register cache is selected when the processor is initial- 
ized. 


In some cases, the contents of the cached local register 
sets may require examination or modification (e.g. for 
fault handling). Since the local registers are cached, the 
flushreg instruction is provided to flush the local regis- 
ter cache to the locations reserved for the registers on 
the stack. This insures that the values in external mem- 
ory are consistent with the values held in the local reg- 
ister cache. 


3.8.5 LOCAL AND SYSTEM CALLS 


The 80960CA provides two methods for making proce- 
dure calls: local calls and system calls. Local and sys- 
tem calls differ in their operation and use in an applica- 
tion. 
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The local call instructions initiate a procedure call us- 
ing the call and return mechanism described earlier. 
The stack frames for these procedure calls are allocated 
on the localprocedure stack. A local call is made using 
either of two local call instructions: call or calix. The 
call instruction specifies the address of the called proce- 
dure using an IP plus displacement addressing mode 
with a range of - 223 to 223 - 4 bytes from the current 
IP. The calix (call extended) instruction specifies the 
address of the calling procedure 
using any of the 
80960's addressing modes. 


A system call is made using the calls instruction. This 
call is similar to a local call except that the processor 
gets the IP for the called procedure from a data struc- 
ture called the system procedure table. The calls in- 
struction requires a procedure number operand. This 
procedure number serves as an index into the system 
procedure table, which contains lP's for specific proce- 
dures. The system procedure table is shown in Figure 
3-8. 


The system call mechanism supports two types of pro- 
• 
cedure calls: system-local calls and system-supervisor 
calls (also referred to as supervisor calls). The system- 
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J.Ul.U"",,J.Ulau 
a \)j-KNun; 
J.C, appucauon sonware 
ooes not 
have to be changed each time the implementation of the 
kernel service is modified. Additionally, the ability to 
switch to a different execution mode and stack allows 
kernel procedures and data to be insulated from appli- 
cation code. 


3.8.6 IMPLICIT 
PROCEDURE 
CALLS 


The call and return mechanism described for procedure 
calls applies to several classes of call instructions as 
well as to the context switching initiated by interrupts 
and faults. When an interrupt or fault condition occurs, 
an implicit call is performed that saves the current state 
of the processor before branching to the interrupt or 
fault handling procedure. When this context switch oc- 
curs, the local registers are saved and a new stack frame 
is allocated. Additionally, the values of the AC register 
and PC register are saved when the implicit call occurs. 
These values are restored on the return from the inter- 
rupt or fault handler. 


3.9 
Interrupts 


An interrupt is a temporary break in the control stream 
of a program so that the processor can handle another 
task. Interrupts 
may be triggered by the instruction 
stream or by hardware sources internal and external to 
the 8096OCA.An interrupt request is associated with a 
vector (i.e. an address) of an interrupt handling proce- 
dure. The processor will branch to the handling proce- 
dure when an interrupt is serviced. When the handling 
action is completed, the processor is restored to its state 
prior to the interrupt. 


3.9.1 INTERRUPT 
VECTORS 
AND PRIORITY 


Interrupt 
vectors are simply instruction pointers (ad- 
dresses) to interrupt handling procedures. The 80960 
architecture defines 248 interrupt vectors. This means 
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The 80960CA compares its current priority and the pri- 
ority of an interrupt request to determine whether to 
service an interrupt immediately or to delay service. If a 
requested interrupt priority is greater than the proces- 
sor's current priority or equal to 31, the processor serv- 
ices the interrupt immediately; otherwise, the processor 
saves (posts) the interrupt request as a pending inter- 
rupt so that it can be serviced later. When the proces- 
sor's priority falls below the priority of a pending inter- 
rupt, the pending interrupt is serviced. With the mecha- 
nism described, interrupts with a priority of 0 will nev- 
er be serviced. For this reason, vectors numbered 0 to 7 
are not defined. 


3.9.2 INTERRUPT 
TABLE 


The interrupt table (Figure 3-9) is an architecturally 
defined data structure which holds the interrupt vectors 
and information on pending interrupts. 
The first 36 
bytes of the table are used to post interrupts. The 31 
most significant bits in the 32-bit pending priorities 
field represent a possible priority (1 to 31) of a pending 
interrupt. When the processor posts an interrupt in the 
interrupt table, the bit corresponding to the interrupt's 
priority is set. For example, if an interrupt with a prior- 
ity of 10 is posted in the interrupt table, bit 10 is set in 
the pending priorities field. 


The pending interrupts field contains a 256-bit string in 
which each bit represents an interrupt vector. When the 
processor posts an interrupt in the interrupt table, the 
bit corresponding to the vector number of that inter- 
rupt is set. 


Portions of the interrupt table are cached on-chip in a 
non-transparent 
fashion. This caching is implemented 
to minimized interrupt latency by reducing the number 
of accesses to the table in external memory when an 
interrupt is serviced. 
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Figure 3-9. Interrupt 
Table 


3.9.3 INTERRUPT 
STACK 


Stack frames for interrupt handling procedures are allo- 
cated on a separate interrupt stack. The interrupt stack 
can be located anywhere in the processor's address 
space. The beginning address of the interrupt stack is 
specified when the processor is initialized. 


3.9.4 INTERRUPT 
HANDLING 
ACTION 


When an interrupt is serviced, the processor saves the 
processor state and calls the interrupt procedure. The 
processor state is restored upon return from the inter- 
rupt procedure. 


This interrupt service mechanism is handled by an im- 
plicit call operation. When the interrupt is serviced, the 
current local registers are saved. A new local register 
set and stack frame are allocated on the interrupt stack 
for the interrupt handler procedure and the processor 
switches to supervisor execution mode. In addition to 
the local registers, the current value of the AC and PC 
registers are saved as an interrupt record on the inter- 
rupt stack. 


3.9.5 PENDING 
INTERRUPTS 


Any of the 248 interrupts can be requested by software. 
The system control instruction (sysctl) is provided to 
support this feature. When the system control instruc- 
tion requests an interrupt, one of two actions may oc- 
cur depending on the priority of the requested interrupt 


80960CA 
PRODUCT 
OVERVIEW 


and the current process priority. 1) The interrupt is 
serviced immediately, or 2) the interrupt is posted (the 
pending priorities field and the pending interrupts field 
are modified to reflect a pending interrupt). 


Interrupts may also be requested by hardware sources 
internal and external to the 8096OCA. Managing the 
hardware sources and posting these interrupts is han- 
dled by the interrupt controller. Interrupts requested by 
hardware are posted in an internal register, not in the 
interrupt table. A mask register enables or disables in- 
terrupts from each hardware source. Requesting and 
posting hardware interrupts is described in Section 4.4 
Interrupt Controller. 


3.9.6 INTERRUPT 
LATENCY 


The time required to perform an interrupt task switch 
is referred to as the interrupt latency. The latency is the 
time measured between the activation of an interrupt 
source and the execution of the first instruction for the 
interrupt-handling procedure for the source. 


Interrupt latency for the 80960CA varies depending on 
conditions such as: 
- 
Complex instructions are executing when the inter- 
rupt occurs (e.g. sysctl, call, ret, etc.). 


- 
Outstanding loads to a local register are pending, 
delaying the interrupt context switch. 
Division, multiplication, 
or other multi-cycle in- 
structions with a local register as destination are 
executing. 


The 80960CA has been designed to optimize latency 
and throughput for interrupts. Two processor features 
are designed for this purpose: 


First, in the interrupt table, all interrupt vectors with 
an index whose least significant four bits are 00102 can 
be cached in internal data RAM. The processor will 
automatically read these vectors from data RAM when 
the interrupt is serviced. This feature reduces the added 
latency due to an external access of the interrupt table 
for that vector. The NMI vector is always cached in 
data RAM. 


Second, an instruction cache locking mechanism allows 
interrupt procedures or segments of interrupt 
proce- 
dures to be stored in the instruction cache. These rou- 
tines are always executed from the internal cache, elim- 
inating external code fetches and reducing latency and 
increasing throughput for the interrupt. 


• 
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3.10 
Fault Handling and Instruction 
Tracing 


The 80960CA is able to detect various conditions in 
code or in its internal state that could cause the proces- 
sor to deliver incorrect or inappropriate results or that 
could cause it to head down an undesirable control 
path. These conditions are referred to as faults. The 
80960 architecture provides fault handling mechanisms 
to detect and, in most cases, fully recover from a fault. 


The 80960CA provides on-chip debug support by trig- 
gering trace events and servicing the trace fault. A trace 
event is activated when a particular instruction or type 
of instruction is encountered in an instruction stream. 
The trace event optionally signals a fault. A fault han- 
dling procedure for the trace fault can act as a debug 
monitor and analyze the state of the processor when the 
trace event occurred. 


3.10.1 FAULT 
TYPES 
AND SUBTYPES 


All of the faults that the processor detects are pre- 
defined. These faults are divided into types and sub- 
types, each of which is given a number. Table 3-4 lists 
the faults that the processor detects arranged by type 
and subtype. 


Table 3-4. Fault Types and Subtypes 


Fault Type 
Fault Subtype 
Fault Record 


Parallel 
XXOOOOXX 


Trace 
Instruction Type 
XX010002 
Branch Trace 
XX010004 
Call Trace 
XX010008 
Return Trace 
XX010010 
Prereturn Trace 
XX010020 
Supervisor Trace 
XX010040 
Breakpoint Trace 
XX010080 


Operation 
Invalid Opcode 
XX020001 
Unimplemented 
X0020002 
Invalid Operand 
XX020004 


Arithmetic 
Integer Overflow 
XX030001 
Arithmetic Zero-Divide 
XX030002 


Constraint 
Range 
XX050001 
Privileged 
XX050002 


Protection 
Length 
XX070001 


Type 
Mismatch 
XXOAOO01 


NOTE: Xrefers to preserved locations in the fault record. 


Parallel Fault Entry 
0 
Trace Fault Entry 
8 
Operation Fault Entry 
16 
Arithmetic Fault Entry 
24 


.:....::.:.:.:.:.....:.::.:.:.;.::.:.:.:.::.:.:.:.:-:.:.:.:.:.:.: 
....:.:.:::...:.:.:-:.: ..:.:.:.:.:.:.:.:.:.:::.:.:.:.: 


Constraint Fault Entry 
40 


Protection Fault Entry 
56 


Type Fault Entry 
80 
1IIIlIIIIII252 


Reserved 
(initialize to 0) 
270669-13 


System Procedure Table Fault Table Entry 
31 
1 0 
Index 
1 0 
Set to 0000027F 16 


Figure 3-10. Fault Table 
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31 
o 


Process Controls 


Arithmetic 
Control 


Fault Flags I Fault Type 
t.::·:.-::·:·:·:·:·: I 
Fault Subtype 


Addres. 
of Faulting 
Instruction 


(':-:-:-:-:-:-.1 
Reserved 
270669-14 


o 


'4 


8 
12 


Figure 3-11. Fault Record 


3.10.2 
FAULT 
TABLE 


The fault table (Figure 3·10) provides the processor 
with a pathway to fault handling procedures. The fault 
table is an architecture-defined data structure, which 
may be located anywhere in the processor's address 
space. The location of the fault table is specified at ini- 
tialization. When a fault occurs, an entry in the table is 
selected based on the type of fault that occurs. The 
entry in the fault table contains a pointer to a specific 
fault handler. 


The fault table can contain two types of entries (Figure 
3-10). The first type of entry is simply a pointer to the 
address of the fault-handling procedure. The second 
type of entry is an index into the system-procedure ta- 
ble. Fault-handling 
procedures accessed through the 
system-procedure table may be executed in user or su- 
pervisor execution mode. 


3.10.3 
FAULT 
HANDLING 
ACTION 


When a fault occurs, the processor performs an implicit 
call operation to the procedure specified in the fault 
table. In addition to performing the implicit call opera- 
tion, the processor creates a fault record in its newly 
allocated stack frame. This fault record contains infor- 
mation on the state of the processor when the fault 
occurred and the fault type and subtype (Figure 3-11). 


Some faults can be recovered from easily. When recov- 
ery from a fault is possible, the processor's fault han- 
dling mechanism allows the processor to automatically 
resume work where the fault was signalled. The re- 
sumption action is initiated with the ret instruction. If 
simple recovery from a fault is not possible, then the 
fault handling procedure may call a debug monitor, ini- 
tiate a reset, or take other actions to recover from the 
fault. 


3.10.4 TRACING 
AND DEBUG 


The 80960CA provides a facility for monitoring the ac- 
tivity of the processor by tracing the instruction stream, 
A trace event occurs at points in a program where cer- 
tain types of instructions are encountered or a certain 


IP or data address is encountered. When a trace event 
occurs, a trace fault can be generated and a trace-fault 
handler called which displays or analyzes the state of 
the processor. 


3.10.4.1 
Trace 
Events 


The Trace Control (TC) Register (Figure 3-12) is used 
to specify the types of instructions which cause trace 
3 


events. When a mode bit in the TC register is set, spe- 
cific instructions will generate trace events. For exam- 
ple, if the branch trace mode bit is enabled and a 
branch instruction is executed, a branch trace event will 
be signalled. An event flag is used to record trace 
events. A single event flag is provided for each mode 
bit. Any trace event generates a trace fault when the 
trace enable bit in the process control register is set. 


The 80960CA recognizes 7 trace events. These events 
are described below. 


Instruction 
Trace Event-Signalled 
each time an in- 
struction is executed. This trace event can be used with 
a debug monitor to single step the processor. 


Branch Trace Event-Signalled 
each time a branch in- 
struction is executed. For conditional branch instruc- 
tions, this event is only signalled when the branch is 
taken. Branch-and-link, call, and return instructions do 
not signal this trace event. 


Call Trace Event-Signalled 
each time a branch-and- 
link or call instruction is executed. Implicit calls, such 
as those used in interrupt or fault handling, signal this 
event. When a call trace event occurs, the prereturn 
trace flag (bit 3 in local register rO)is set by the proces- 
sor to indicate a prereturn trace pending. 


Pre-Return 
Trace Event-Signalled 
just prior to any ret 
instruction. This event is only signalled if the pre-return 
trace flag in register rOis set. Since the pre-retum trace 
flag is set when a call trace event occurs, the call trace 
mode must be enabled before a pre-return trace event 
can be signalled. 


Return Trace Event-Signalled 
each time a ret instruc- 
tion is executed. 
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Trace Event flog.: 


'---- 


__ 
Instruction 
Trace 
'------------------- 
Branch Troce 
Call Trace 


'--------------------- 
Return Trace 
Pr. return 
Trace 


'---------------------- 
Supervisor 
Troce 
L... 
Breakpoint 
Trace 


'----------------------- 
Data Addre •• Breakpoint 
0 
'------------------------ 
Data Addre.s 
Breakpoint 
1 
L.. 
Instruction 
Addre •• Breakpoint 
0 


'-------------------------- 
Instruction 
Addre •• Breakpoint 
1 


Reserved 
(Initialized to 0) 
270669-15 


Figure 3-12. Trace Control 
Register 


Supervisor Trace Event--Signalled each time a calls in- 
struction is executed where the selected entry type is 
supervisor, or when a ret from supervisor mode is exe- 
cuted. 


Breakpoint Trace Events--Signalled 
each time a mark 
instruction, fmark instruction, or specified address is 
encountered in the instruction stream. The mark in- 
struction signals an event when the breakpoint trace 
mode is enabled, the fmark (force mark) instruction 
will generate a breakpoint trace event regardless of the 
value of the breakpoint trace mode bit. 


Two IP breakpoint registers and two internal data ad- 
dress 
breakpoint 
registers 
are 
provided 
on 
the 
8096OCA. These breakpoints are loaded with an in- 
struction 
or data address using the system control 


(sysctl) instruction. When the address is encountered 
and the breakpoint trace mode bit is set, a breakpoint 
trace event occurs. A corresponding instruction or data 
address event flag is set in the 'J'C register when the 
address is encountered. 


3-178 


3.10.5 PROCESSOR 
INITIALIZATION 


The Initial Memory Image (IMI) are the data struc- 
tures needed to initialize the 80960CA (Figure 3-13). 
The initialization boot record, in reserved memory be- 
ginning at FFFFFFOOH, contains a pointer to the Proc- 
essor Control Block (PRCB). The PRCB in turn holds 
pointers to the data structures which are necessary to 
execute code on the 8096OCA. The PRCB also holds 
several fields which contain information 
to initially 
configure the 8096OCA. 


Processor initialization begins by asserting the RESET 
pin. At initialization the processor optionally performs 
an internal self-test. A bus confidence test is also per- 
formed by calculating a checksurn of 8 words read from 
external memory. If either of these self-tests fails, the 
FAIL pin indicates the failure and the processor aborts 
initialization. If the self-test passes, the 80960CA con- 
tinues with initialization and branches to the first ad- 
dress of the user's code. 
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Address 


ffrFTfOOH 


ffffff10H 


ffffff14H 


ffffff18H 


ffffff2CH 


fixed 
Data Structures 


Initialization 
Boot Record: 


Relocatable 
Data Structures 


Byte Offset 


OH 


User Code: 
- 
Bus 
..~ 
..~ 
Configuration 
(least 
Significant 
Byte) 


first 
Instruction 
Pointer 
- 
Prceess 
Control 
BI"ck (PRCB): 


PRCB Pointer 
fault 
Table 
Ba.e 
Address 


~ 
Control 
Table 
Ba.e 
Address 


AC Register 
Initial Image 


6 Check 
Word. 
(for 
bus 
confidence 
self-test) 
fault 
Configuration 
Word 


~ 
Interrupt 
Table 
Base Address 


~ 
System 
Procedure 
Table 
Base Addre.s 


~ 
Interrupt 
Stock 
Pointer 


Instruction 
Cache 
Configuration 
Word 


Register 
Cache 
Configuration 
Word 


~ 
Control 
Table 


..~ 
..~ 
- 
Interrupt 
Table 


..~ 
..~ 
- 
System 
Procedure 
Table 


..~ 
..~ 
- Other 
Archltecturally 
Defined 
Data Structures 
(not 
required 
os port 
of IMI) 


8H 


CH 


10H 


18H 


1CH 


20H 


270669-19 


Figure 3-13: Initial Memory 
Image 
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4.0 80960CA 
SYSTEM 
IMPLEMENTATION 


This section 
is an overview 
of the peripherals 
integrated 
with the 80960CA 
core. The features 
and operation 
of 
the Bus Controller, 
DMA 
Controller, 
Interrupt 
Con- 
troller, 
and the interfaces 
between 
these peripherals 
and 
the core are described. 


4.1 
Peripheral 
Interface 


A program 
communicates 
with the on-chip 
peripherals 
by reading 
or modifying 
the special 
function 
registers 
(SFRs) 
or by loading 
control 
registers. 
The SFRs 
gen- 
erally serve to transfer 
status 
information 
and data be- 
tween 
a peripheral 
and the core, and the control 
regis- 
ters 
serve 
to configure 
the 
peripherals. 
SFRs 
are ac- 
cessed 
directly 
as instruction 
operands. 
The 
control 
registers 
are loaded by using the system control 
(sysctl) 
instruction. 


4.2 
Bus Controller 
Unit 


The Bus Controller 
Unit (BCU) 
manages 
the data 
and 
instruction 
path 
between 
the 
80960CA 
and 
external 
memory. 
Data 
operations 
and instruction 
fetches 
share 
a 32-bit data 
bus. Memory 
addresses 
are output 
on a 
separate 
32-bit address 
bus. The BCU incorporates 
sev- 
eral advanced 
features 
to simplify 
the bus interface 
to 
external 
memory. 
A programmable 
memory region con- 
figuration table allows 
the characteristics 
of the exter- 
nal bus to be programmed 
differently 
for 
16 separate 
regions 
in memory. 
The attributes 
of the external 
bus 
which 
are programmable 
include 
wait states and exter- 
nal ready 
control, 
data 
bus width 
(8, 16, or 32 bits), 
burst 
mode, address 
pipe1ining, 
and byte ordering. 
The 
region 
programmable 
bus options 
are described 
in this 
section. 


4.2.1 BUS TRANSFERS, 
ACCESSES, 
AND 
REQUESTS 


The 
distinction 
between 
transfer, bus access, and bus 
request, as these terms 
apply to the 8096OCA, must be 
presented 
before beginning 
a discussion 
of the BCU. 


Transfer-A 
bus transfer is defined 
simply 
as a move- 
ment of code or data between 
a memory 
system and the 
8096OCA. 
A write 
transfer 
occurs 
when 
the memory 
system 
is the destination 
of a data 
movement. 
A read 
transfer 
occurs 
when 
the 
80960CA 
is the destination 
for a data 
or a code fetch from memory. 


Bus Access-A 
bus accessis defined 
as an address 
cycle 
and one or more transfers. 
In burst mode, an access can 
consist 
of a single address 
cycle and 
I to 4 transfers. 
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Bus Request-A 
bus request is issued 
by the core and 
directed 
to the Bus Controller. 
A bus request 
is sent to 
the BCU when a load, store, or an atomic 
instruction 
is 
executed, 
or when 
an instruction 
fetch 
is needed. 
Bus 
requests 
are also issued by the core to perform 
DMA 
transfers. 
A bus request 
can consist 
of one or more bus 
accesses. 
For example, 
an aligned 
word (32-bit) 
request 
to an 
8-bit 
memory 
region 
will 
result 
in four 
byte- 
length 
accesses. 


4.2.2 BUS CONTROL 
CO PROCESSOR 


The 8096OCA's 
peripherals 
are often referred 
to as eo- 
processors, 
since their 
operation 
is decoupled 
from the 
execution 
of the instruction 
stream. 
As an integrated 
coprocessor, 
the BCU 
receives 
bus requests 
and inde- 
pendently 
carries 
out the action 
of moving 
data or code 
between 
the processor 
and external 
memory. 
The BCU 
uses a three 
deep queue 
to store pending 
bus requests. 


The 
queue 
decouples 
the core from 
the BCU, 
since a 
series of adjacent 
requests 
may be issued faster than the 
BCU can service 
each request. 
Two of the three 
queue 
entries 
store 
requests 
from 
a user's 
program 
(loads, 
stores, 
fetches, 
etc.). The third 
queue 
entry 
is used by 
requests 
originating 
from 
a 
DMA 
operation. 
This 
queue 
entry 
takes 
user 
requests 
-when 
the 
DMA 
is 
turned 
off. The 80960CA 
alternates 
service 
of requests 
issued 
by the user 
program 
and 
requests 
issued 
by a 
DMA 
operation. 


4.2.3 SIGNAL 
DESCRIPTIONS 


The external 
bus signals consist 
of 30 address 
signals, 4 
byte enables, 
32 data lines, and various 
control 
signals. 


D31-DO 
32-bit 
Data 
Bus (bi-directional}--32-, 
16-, 
and 
8-bit 
values 
are 
transmitted 
and 
re- 
ceived 
on 
these 
lines. 
The 
8- and 
16-bit 
quantities 
are transferred 
on the low order 
data lines when a memory 
region 
is config- 
ured respectively 
for an 8- or 16-bit bus. 


A31-A2 
30-bit 
Address 
(outputs}--The 
30-bit 
ad- 
dress bus identifies 
all external 
addresses 
to 
word 
(4-byte) 
boundaries. 
The byte enable 
lines 
indicate 
the 
selected 
byte 
in 
each 
word. 


BEJ-BEO 
Byte Enables 
(outputs}--The 
byte 
enables 
select which of 4 addressed 
bytes are active 
in a memory 
access. 
When 
a memory 
re- 
~ 
is co~red 
for an 8-bit bus width, 
BEl 
and BEO act as the lower 
two bits of 
the 
address. 
For 
a 16-bit memory 
region, 


BEl, 
BE3, and BEO are encoded 
to provide 
AI, 
BHE, 
and BLE respectively. 


W/R 
Write 
or Read (output}--This 
signal is low 
for read accesses 
and high for write access- 
es. 


ADS 
Address 
Strobe 
(output}--Indicates 
valid 
address 
and the start 
of a new bus access. 


DTfR 


DIC 
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Data Transmit or Receive (output)- 
Direction control for data transceivers; 
similar to WfR. 
Data Enable (output)-Low 
during a 
bus request after the first address cy- 
cle. This signal is used to control data 
transceivers and to indicate the end of 
a bus request. 
Wait 
(output)-Indicates 
that 
wait 
states are being inserted by the internal 
wait state generator. 
Ready (input)-Signals 
that data is 
valid for a read transfer or ends data 
hold for a write transfer. This function 
can be disabled for a memory region. 
Burst Terminate (input)-Terminates 
a burst access.Another address is gen- 
erated to complete the request when 
the signal is deasserted. This function 
can be disabled for a memory region. 
Data or Code (output)-Indicates 
a 
data transfer or a code fetch. 
DMA Access (output)-Indicates 
that 
a bus request was initiated by either 
the user program or the DMA. 
Supervisor Access (output)-Indicates 
that a bus access originated from a bus 
request issued in supervisor mode. 
This signal can be used to protect sys- 
tem data structures, or peripherals 
from errant modification by the user 
code. 


HOLD 


Lock 
(output)-Indicates 
that 
an 
atomic memory operation is in prog- 
ress. This signal can be used to inhibit 
external agents from modifying memo- 
ry which is atomically accessed. 
Burst Last (output)-Indicates 
the last 


transfer in a burst access. 
Hold (input)-HOLD 
can be used by 


a bus requester to request access to the 
bus. The processor asserts HLDA af- 
ter the current bus request or locked 
requests have completed. 
Hold 
Acknowledge (output)-Indi- 


cates to a bus requester that the proc- 
essor has relinquished control of the 
bus. 
Bus Request (output)-Indicates 
that 


requests are queued in the bus control- 
ler and are waiting to be serviced. 
BREQ can be used for external bus ar- 
bitration 
logic in conjunction with 
HOLD and HLDA to regain bus mas- 
tership. 


HOLDA 


BREQ 


Figure 4-1 shows the timing for a simple, non-burst, 
non-pipelined read and write access. The timing rela- 
tions for the key control signals are shown in this fig- 
ure. 
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FIgure 4-1. Basic Read and Write Request 
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structions at sequential addresses starting at the address 
which began the burst access (Figure 4-3). In a burst 
memory system, the upper 28 bits of an address remain 
fixed while the lower two bits A2 and A3 increment to 
access subsequent locations. 


- 
Aooress l'lpeunmg 
(on/off) 
Byte Ordering 
(Big/Little Endian) 


Wait States 
(5 parameters) 
Bus Width 
(8-, 16-, or 32-bit) 


The flexibility of region programming simplifies the bus 
interface in applications where a memory system is 
made up of a variety of sub-systems, such as SRAM, 
DRAM, ROM, and memory mapped peripherals. Each 
memory sub-system can be mapped into a different re- 
gion in memory, and that region can be configured spe- 
cifically for the requirements of the particular memory 
sub-system. 


Wait state timing for the first access of a burst request 
is controlled independently from the timing for subse- 
quent accesses. A memory sub-system using static col- 
umn mode or page mode DRAMs, for example, can 
take advantage of the short column access times for 
these devices by using burst mode. Interleaved ROM or 
EPROM systems can also be constructed which simul- 
taneously access several words and then use burst mode 
to multiplex the multi-word array onto the data bus. 


WEWORY REGION 


Region 
CONFIGURATION TABLE 


o 


1 
2 
3 
4 
5 
6 


7 
8 
9 
10 


11 
12 
13 
14 
15 


REGION TABLE ENTRY 


31 
212019181716151413121110987 
65432 
1 0 
1--rrm[]·II.11 
I11 
111111 
I 
1111111 
11 


EE:] 
Reserved 
"- 
. ·:::i:·· 
(InKlollze 
To 0) 
'-- 
'--- 


BURST ENABLE 


READY ENABLE 


PIPELINE ENABLE 


NR.\D 


NROO 


NXOA 


NWAO 
NWOO 
BUS WIDTH 


BYTE ORDER 
270669-22 


Figure 4-2. Memory 
Region Configuration 
Table 
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Read Request: 


Clock 
• 
• 
• • 
• 
• • • • 
• • 


Address 
~ 
X 
X 
X 
rt/.Y#ItlI.~_ 


-------------<::)-----<::)-----<:)-----<::)--------------- 


r::::J 


270669-23 


270669-24 


Doto 


Doto Tronsfer 
~ 
. 
~ 


,:- 
,,:,;,,::::::::·-·::::1 
,::.. 
Bus Access 


Figure 4-3. Burst Memory 
Request 


Read Request: 


Wait Stot. 
Counter 
o 
3 
2 
0 
2 
10 
0 
2 
o 
2 
o 
3 
2 
0 
3 
2 


Clock 
.• 


Addres. 
X?X 
X 
X 
X 
XXXXXXXXXX 


Octo 
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Octo Tron.for 
D 
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D 
E:l 


Bus Access 


Write Request: 


I 
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NWOO=21 
Wait Stot. 
Counter 
o 
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0 
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X 
X 


X 
X 
,.' 
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320 
3 
2 


Clock 


Addres. 
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.A. 
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,JI,X••••• 
__ 
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~ 
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Octo Tron.for 
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:.:) 
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Bus Access 
:, 
(.-:.--.. 


Figure 4-4. Programmable 
Wait States 
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4.2.4.2 
Programmable 
Wait State Generation 


The 80960CA may be interfaced with a variety of mem- 
ory sub-systems and peripherals with a minimum sys- 
tem cost and complexity. To achieve this interface flexi-- 
bility, the 80960CA implements an internal program- 
mable wait state generator. Internally generated wait 
states eliminate the potential system delays which come 
from generating wait states with external logic. 


Wait states are programmed for each region in the 
memory region configuration table. The number of wait 
states is programmable over a range which allows effi- 
cient control of memory devices ranging from ultra-fast 
SRAMs to slow peripherals. An external ready signal is 
also provided for external wait state control. 


The 
wait 
states 
which 
can 
be generated 
by the 


80960CA are shown in Figure 4-4. In this table N is the 
number of wait states inserted. The wait states for read 
accesses and for write accesses are described by three 
parameters each. For read accesses, NRAD is the num- 
ber of states between the address cycle and the first 
data cycle and NRDO is the number of states between 
consecutive data cycles in a burst access. For writes, 
Nw AD is the number of states that data is held after an 
address cycle, and Nwoo 
is the number of states that 
data is held for consecutive data cycles in a burst write. 
For both reads and writes, NXOA is the number of 
dead cycles after the last data cycle and before the next 
address. 


4.2.4.3 
READY Control 


I 


The memory 
region configuration 
table allows the 
ready input (READY) to be enabled or disabled for 
each region. If the ready input is disabled, the external 
input has no effect on the wait states generated for a 
memory access; all wait states are generated internally. 
If the ready input is enabled, it works in conjunction 
with the programmable wait state generator. In this 


intel~ 


case, the ready input has no effect until the number of 
programmed wait states has expired. When the wait 
state counter reaches 0, the ready input is sampled, and 
wait states continue or are terminated based on the val- 
ue of the ready input. In order to gain complete exter- 
nal control over wait states, all wait state parameters 
for a region can be set to O. 


4.2.4.4 
Pipe lined 
Reads 


The 80960CA BCU provides an address pipelining 
mode (Figure 4-5) to optimize the performance of in- 
struction 
and 
data 
fetches from external 
memory. 


When the pipelined read mode is enabled, an address 
cycle overlaps with the last data cycle in each access, 
effectively reducing the total time needed for each ac- 
cess. Pipelining mode is selected in each region by pro- 
gramming the memory region configuration table. 


4.2.4.5 
Byte Ordering 


One of two configurations for byte ordering, often re- 
ferred to as little endian or big endian, is selected for 
each region by programming the memory region con- 
figuration table. The byte ordering options make the 
80960CA capable of sharing memory with a processor 
which uses either byte ordering scheme. Byte ordering 
refers to the way that the 80960CA relates internal data 
to the way that data is stored or fetched from memory. 
The little endian configuration orders the bytes in a 
short-word or word so that the least significant byte of 
the quantity is positioned at the lowest address and the 
most significant byte at the highest address in memory. 
Conversely, for the big endian configuration, the least 
significant byte is positioned at the highest address, and 
the most significant byte at the lowest address. For ex- 
ample, for little endian ordering, byte 0 for word data 
would be found in memory at an address of the form 
XXXX XXXOH and, for big endian, at address XXXX 
XXX3H. 
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4.2.4.6 Data Alignment 


The 80960CA can service any aligned or non-aligned 
bus request. Aligned requests are directed to their natu- 
ral boundary in memory. In other words, the addresses 
for aligned requests are even multiples of the length of 
the data transferred. Non-aligned requests are not serv- 
iced directly by the BCU but are assisted by microcode. 
Microcode automatically breaks non-aligned requests 
into multiple aligned requests which are then reissued 
to the BCU. Depending on the degree of non-alignment 
and the length of the original request, the resulting re- 
quests by microcode will consist of a combination of 
byte, short-word, and double-word requests. The BCU 
is able to generate an operation-unaligned fault when a 
non-aligned bus request is first received. This fault can 
be selectively masked at initialization. 


4.3 DMA Controller 


The DMA controller is a high-performance, full-func- 
tioned integrated peripheral. The DMA controller can 
manage 4 channels of DMA transfer concurrent with 
program execution. Separate external control for each 
channel is provided. Each channel supports high-per- 
formance 
memory 
to memory 
transfers 
where the 
source and destination can be any combination of inter- 
nal data RAM or external memory. The DMA Con- 
troller supports various types of transfers such as high- 
speed fly-by transfers and data chaining with the use of 
linked descriptor lists in memory. 


The 8096OCA's DMA controller is implemented using 
dedicated hardware and microcode. Because of the effi- 
ciency of the core, it is possible for the microcode to 
execute DMA transfers at high speeds. DMA transfers 
are performed by the core concurrently with execution 
of the user's program. Internal DMA logic is used for 
sampling requests, synchronizing transfers with exter- 
nal devices, and handling the service of multiple active 
channels. 


4.3.1 SIGNAL DESCRIPTIONS 


Twelve pins are dedicated to the DMA controller. 
Three pins are associated with each DMA channel. 
These pins are described below. In this description, the 
pin number corresponds to the channel number. for 
example, 
the 
DREQO pin is the 
request 
pin for 
channel O. 


DMA Request (input}--This 
input in- 
dicates that an external device is re- 
questing 
a DMA 
transfer. 
A DMA 
transfer refers to the complete transfer 
of one byte, short-word, word, or quad- 
word, depending on the transfer data 
width selected for the channel. 
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DMA 
Acknowledge 
(output}--This 


output 
becomes active when the re- 


questing device is accessed. 


EOP3/TC3- 
End of Process (input) 
or Terminal 
EOPO/TCO Count (output}--This 
pin functions ei- 
ther as an input (EOPx) or as an output 
(TCx). When programmed as an out- 
put, the pin is driven active for one 
clock after byte count reaches zero and 
a DMA terminates. When programmed 
as an input, 
an external device can 


cause the DMA operation to terminate. 


4.3.2 DMA TRANSFERS 


The 80960CA DMA controller supports a variety of 
transfer modes and variations of these modes, allowing 
the DMA to adapt to a number of hardware systems 
and the performance requirements of these systems. 


. 4.3.2.1 Standard Block and Demand Mode 
• 
Transfers 


A standard DMA transfer is made up of multiple bus 
requests. Loads from a source address are followed by 
stores to a destination address. The DMA controller 
issues the proper combination of these bus requests to 
execute the DMA 
transfer. For example, a typical 
DMA transfer between memory and an 8-bit peripheral 
could appear as a single byte load request directed to 
the source memory, followed by a single byte store re- 
quest directed to the 8-bit peripheral. 


The DMA controller has two basic transfer modes: 
block mode (unsynchronized) and demand mode (syn- 
chronized). Any DMA transfer will be serviced by one 
of these basic transfer modes. 


A block mode DMA is initiated by software. Block 
mode DMAs are generally between memory. Block 
mode DMA transfers are not synchronized with any 
type of request from an external device. Once the DMA 
begins, it will continue until the entire block is com- 
plete or until it is suspended. The source and destina- 
tion addresses for block mode transfers can be incre- 
mented or held constant for a DMA. 


A demand mode DMA is controlled by an external 
device. Demand mode DMAs are generally between an 
external device and memory. In demand mode, each 
individual DMA transfer can be synchronized with a 
request. The request is signalled when an external de- 
vice activates a DMA channel request pin (DREQ3- 
DREQO). The DMA controller acknowledges this re- 
quest with the DMA 
acknowledge pin (DACK3- 


DACKO) when the requesting device is accessed. A de- 
mand mode transfer may be synchronized with either 
the source or the destination device. 
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the number of bytes in the byte count field in the de- 
scriptor is transferred. At this time, another linked-list 
descriptor 
may be executed. The next descriptor 
is 
specified by the next-pointer field in the current de- 
scription. Data chaining continues until a null pointer 
is encountered in the next-pointer field. Data chaining 
can be designated as source chaining, destination chain- 
ing, or both. 


In data chaining mode, an option exists which allows 
chaining descriptors to be updated while the DMA is 
running. When this option is enabled, the DMA sets a 
bit in the DMA's special function register after loading 
a descriptor and then checks this bit before loading the 
next descriptor. If the bit has been cleared by the user, 
the DMA continues; otherwise, the DMA waits for the 
next descriptor to be set up and for the user to clear the 
bit. An interrupt can be generated when each buffer is 
complete or when the DMA is terminated with a null 
pointer or the EOP pin. 


4.3.3 TRANSFER 
CHARACTERISTICS 


The DMA controller provides the programmer with a 
number of options for configuring the characteristics of 
a DMA transfer. Intelligent selection of transfer char- 
acteristics works to balance DMA performance and 
functionality with performance of the user program 
when the DMA is in progress. 
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4.3.2.2 Fly-by Transfers 


A fly-by transfer mode is provided for the most per- 
formance-critical DMA applications. Fly-by mode also 
makes very efficient use of the external bus during a 
DMA. Standard DMA transfers involve multiple bus 
requests: load requests directed to the source and a 
store request directed to the destination. Fly-by trans- 
fers only require a single bus request. For a fly-by trans- 
fer, memory sees a load or a store on the bus while the 
requesting device is selected by the DMA acknowledge 
pin. The data is never actually read from or written to 
the 8096OCA. For memory to device transfers, the 
processor issues a load, and, while reading the memory, 
accesses the external device with the DMA acknowl- 
edge pin. The data is then written directly to the desti- 
nation device with a single bus request. For a device to 
memory transfer, the reverse operation is performed. 
The DMA issues a store, and, while writing the memo- 
ry, accesses the source device with the DMA acknowl- 
edge pin. In this case, the processor floats the data bus 
and the device's data is written directly into memory. 


4.3.2.3 Data Chaining 


Each DMA channel can be programmed 
in a data 
chaining mode. In this mode, all transfer information is 
taken from a linked-list descriptor in memory (Figure 
4-6). Data chaining is started by specifying a pointer to 
a descriptor in memory. The transfer continues until 


Internal 
Register 


BC= Byte Count 
SA= Source Address 
DA= Destination 
Address 
NPTR= Next Pointer 


Not Used For Source Chaining 


Terminate 


Source Buffers 


Figure 4-6. Source Data Chaining 
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The DMA 
controller 
provides features to opnmize 
transfers by moving a maximum amount of data for 
each bus request issued. This is controlled by specifying 
the width of the source and destination directed bus 
requests for a DMA transfer, and by on-chip assembly 
or disassembly of the transfer when source and destina- 
tion are not of equal widths. 


Data 
alignment 
is performed 
automatically 
by the 
DMA controller when the source and destination of a 
transfer. are not aligned. The alignment algorithm is 
optimized for many transfers, providing a performance 
comparable to the aligned transfer cases. 


4.3.3.1 Transfer 
Data Length 


The transfer data length specifies the length of bus re- 
quests directed to the source and destination in a stan- 
dard DMA transfer. Byte, short, word, or quad-word 
loads and stores are selected for either source or desti- 
nation when a DMA channel is set up. Assembly and 
disassembly of data is automatically performed when 
the source and destination widths are different. This 
feature provides the most efficient use of the bus when 
DMA transfers occur between a source and a destina- 
tion with different external bus widths. 
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The DMA controller provides the option of using quad 
word transfers to enhance DMA performance. When 
quad transfers are specified, the DMA will request a 
four-word load request and four-word store request for 
each DMA transfer. The trade-off for the added DMA 
performance is latency on the external bus, preventing 
requests by the core, or by another DMA channel from 
being immediately serviced. 


4.3.3.2 Data Alignment 


The DMA controller supports transfer of source and 
destination data aligned to different byte boundaries in 
memory. The DMA implements microcode algorithms 
to transfer some non-aligned data with a performance 
level approaching that for aligned transfers. The DMA 
accomplishes this by attempting to issue the maximum 
number of aligned bus requests during a DMA (Figure 
4-7). As shown, most of the overhead due to non- 
aligned ~MAs is incurred at the beginning and end of 
the DMA. DMAs with low byte counts, therefore, do 
not benefit as much from the data alignment features of • 
the DMA. The alignment feature is optimized for 8-bit 
to 8-bit, 32-bit to 32-bit and for 8-bit and 32-bit combi- 
nations of source and destination lengths. 
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Figure 4-7. DMA Data Alignment 
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4.3.3.4 
Performance 
and Latency 
Considerations 


DMA operations and the user program share the re- 
sources of the core and of the external bus. DMA per- 
formance and the performance of the user program are 
coupled directly to the balance of load sharing between 
these two processes. The core resources necessary to 
perform a DMA transfer vary depending on the way a 
channel has been configured. For example, byte assem- 
bly and disassembly requires more processor overhead 
per byte of transfer than does a transfer in which the 
source and destination transfer lengths are equal. The 
performance of a DMA is also tightly coupled to the 
user program's use of the external bus. If the user pro- 
gram does not make frequent bus requests, the requests 
by the DMA controller will be serviced with little or no 
delay. 


The user can enhance performance of the DMA with 
trade-offs in system complexity and flexibility. Aligned 
transfers eliminate the microcode overhead needed to 
perform the internal alignments. DMAs between re- 
gions of equal transfer widths eliminate overhead for 


instruction. 


The DMAC register (Figure 4-8) is described below. 


The channel enable field 
enables a DMA once the 
channel is set up. Clearing these bits will also cause a 
DMA transfer to be suspended. 


The terminal count field signals that byte count has 
reached zero and a DMA has ended. 


Tile channel activefield indicates that a channel is idle 
or active. If set, this bit indicates that the channel is 
active. This implies that the channel is servicing a 
transfer or has a request pending. The active bits are 
status information only. 


The channel donefield indicates that a DMA operation 
is complete. The done bits are status information only. 


The channel wait field is used for handshaking with a 
user program in data chaining mode. The DMA sets 
these bits when a new linked-list descriptor is read. The 
DMA will not read the next descriptor until this bit is 
cleared by the user. The user can set up the next de- 
scriptor and then clear the channel wait bits to dynami- 
cally change descriptors. 


Reserved 
(Initialize To 0) 
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Figure 4-8. DMA Command 
Register 
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A priority mode bit selects rotating or fixed priority 
mode. 


The throttle bit selects the maximum amount of core 
resources that the DMA microcode will receive in rela- 
tion to the execution of the user program. 


4.3.5 DMA INTERRUPTS 


The DMA controller is the source of 4 hardware inter- 
rupts in the 8096OCA. The DMA Controller can be 
programmed to request an interrupt when a DMA is 
complete, or when a butTer transfer is completed in 
chaining mode. Each channel requests a ditTerent inter- 
rupt. 


4.4 Interrupt Controller 


The 80960CA Interrupt Controller manages interrupts 
which are requested by external agents or by the DMA 
Controller. The interrupt controller manages 4 internal 
DMA interrupt sources, a single NMI (Non-Maskable 
Interrupt) pin, and 8 external interrupt pins. Up to 248 
external interrupt sources can be supported by the in- 
terrupt controller. The interrupt controller handles the 
prioritization 
of software interrupts, 
hardware inter- 
rupts, and the process priority, and signals the core 
when interrupts are to be serviced. The interrupt con- 
troller provides the low-latency interrupt service fea- 
tured on the 8096OCA. 


4.4.1 EXTERNAL INTERRUPTS 


The 80960CA provides 8 interrupt pins and one NMI 
pin for detecting external requests. The interrupt con- 
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troller allows the 8 interrupt pins to be configured as 
dedicated inputs capable of requesting 8 interrupts, or 
as a vectored input capable of requesting up to 248 
interrupts. The NMI pin is always a dedicated input. 
The interrupt controller pins are described below. 
XINT7- 
External 
Interrupts 
(inputs)-These 
pins 
XINTO can be used as dedicated inputs, or acting 
together as an 8-bit number, request any in- 
terrupt. The inputs are edge or level detect- 
ed, and are optionally debounced internally. 


NMI 
Non-Maskable Interrupt (input)-NMI 
re- 
quests the highest priority interrupt. NMI 
is always taken and is not maskable (as the 
name implies), and not interruptable. 


4.4.2 INTERRUPT MODES 


The 8 external interrupt pins can be configured in one 
of three modes: dedicated mode, expanded mode, or 
mixed mode (Figure 4-9). 


4.4.2.1 Dedicated Mode Interrupts 


In dedicated mode, each of the 8 interrupt pins acts as a 
dedicated input. When an external event is detected on 
an interrupt pin, a unique interrupt is requested for that 
pin. It is possible to map each dedicated pin to one of a 
number of possible interrupt vectors. This is accom- 
plished by programming 
the interrupt 
map (IMAP) 
control registers with an interrupt vector number for 
each pin. (Recall that interrupt 
vector numbers are 
8-bit values which reference the 248 vectors in the in- 
terrupt table.) 
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Figure 4-9. Interrupt Modes 
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Only the upper four bits of the vector number can be 
programmed for a dedicated mode interrupt. The lower 
four bits are fixed at the value OOI~. With four pro- 
grammable bits, one of 15 interrupt vectors is available 
for each dedicated pin. These interrupt vectors span the 
even priority levels from priority 2 to 30. The vector at 
priority 0 is not defined. 


The 15 interrupt vectors available to dedicated sources 
can be cached in internal data RAM. If this interrupt 
vector caching feature is selected, the processor will au- 
tomatically fetch the vector from data RAM, eliminat- 
ing the latency caused by a bus request for a vector in 
external memory. 


The DMA Controller can request four interrupts to sig- 
nal the end of a DMA for each of four channels. The 
four interrupt signals from the DMA are handled by 
the interrupt controller in the same way as an interrupt 
pin configured as a dedicated input. Each of the four 
DMA sources may request one of 15 interrupts by pro- 
gramming the IMAP for that source. 


4.4.2.2 
Expanded 
Mode Interrupts 


In expanded mode, external hardware considers the in- 
terrupt pins (XINTO-XINT7) 
as an 8-bit binary num- 
ber. This number is used directly as the interrupt vector 
number. Each of the 248 possible interrupt vectors can 
be referenced in this way, allowing a separate external 
source for each vector. External hardware is responsi- 
ble for recognizing individual hardware sources and 
then driving the interrupt vector number corresponding 
to that source onto the interrupt pins. 


4.4.2.3 
Mixed Mode Interrupts 


In mixed mode, the 8 interrupt pins are divided into 
two functional sets. One set functions in dedicated 


3-190 


mode, the other in expanded mode. In mixed mode, 
three 
pins 
are 
dedicated 
interrupt 
pins 
(XINT7- 
XINT5). A programmable vector 'number is associated 
with each of these pins. The remaining five interrupt 
pins (XINT4-XINTO) 
are treated as the most signifi- 
cant five bits of the expanded mode vector number. The 
lower order bits are internally forced to 0102 to form 
the full 8-bit value for the vector number. 


4.4.3 INTERRUPT 
CONTROLLER 
SETUP 


The interrupt controller uses two special function regis- 
ters to.manage interrupt requests by hardware sources. 
The hardware interrupt pending register (IPND) and 
the hardware interrupt mask register (IMSK) are ad- 
dressed as sill and sf! respectively. A single bit in each 
register corresponds to each of the 8 possible external 
sources and 4 DMA sources for hardware interrupts. 
The IMSK register performs the function of masking 
hardware interrupts and the IPND register implements 
posting of interrupts 
requested by hardware. 
When 
configured for expanded or mixed mode interrupts, bit 
o of the IMSK register globally masks the expanded 
mode interrupts. 


4.4.4. NON-MASKABLE 
INTERRUPT 


In addition to the maskable hardware interrupts, a sin- 
gle Non-Maskable 
Interrupt (NMI) is provided. A dedi- 
cated NMI pin is used to request this interrupt. NMI is 
defined as' a higher priority than any hardware inter- 
rupt, software interrupt, or process priority. The NMI 
procedure, 
therefore, can never be interrupted 
and 
must execute the return instruction before other proce- 
dures can execute. The NMI 
procedure 
is entered 
through vector 248. This vector is cached in internal 
data RAM at initialization to reduce latency for the 
NMI. 
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APPENDIX 
A 
80960CA CORE IMPLEMENTATION 


The 80960CA Core is a high-performance implementa- 
tion of the 80960 Core Architecture. This section brief- 
ly describes the microarchitecture of the 80960CA core 
and the key constructs used to achieve parallel instruc- 
tion execution. 


The 80960CA core can be divided into the 6 main sub- 
units listed below. 


Instruction Sequencer 
Register File 
Execution Unit 
Multiply and Divide Unit 
Address Generation Unit 
Static Data RAM and Local Register Cache 


Figure A-I is a simple block diagram of the 8096OCA. 
The nucleus of the processor is the Instruction 
Se- 
quencer and Register File. The other subunits of the 
core, referred to as coprocessors, radiate from these 
units, connecting to either the register (REG) side or 
the memory (MEM) side of the processor. The Instruc- 
tion Sequencer issues directives, via the REG 
and 
MEM interfaces, which target a specific coprocessor. 
That coprocessor then executes an express function vir- 
tually decoupled from the IS and the other coproces- 


sors. The REG and MEM data busses shown in Figure 
A-I are used to transfer data between the common 
Register File and the coprocessors. 


A.1 Instruction Sequencer 


The Instruction 
Sequencer 
(IS) decodes the instruction 
stream and drives the decoded instruction stream onto 
the coprocessor interfaces. In a single clock, the IS de- 
codes up to 4 instruction and issues up to three of these 
instructions to the on-chip coprocessors or to the IS 
itself. One register (REG) format, one memory (MEM) 
format, and one control or control and branch (CTRL 
or COBR) format instruction can be issued at one time. 
These instructions are directed respectively to the REG 
coprocessors, the MEM coprocessors, or to the IS. The 
ability to issue multiple instructions in parallel can re- 
sult in the simultaneous execution of many instructions 
at once. An optimizing compiler or hand optimization 
of assembly code can easily produce an instruction 
stream which takes full advantage of the parallel execu- 
tion of the core. 


A technique known as resource scoreboarding 
is used to 
manage the parallel execution of instructions and the 
common resources of the processor. A coprocessor, for 
example, can scoreboard itself, indicating that it cannot 
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are issued to the rest of the machine. In the third stage 
(Pipe 2), the instruction computation is started, and for 
single cycle instructions, a result is returned. 


Several microarchitectural 
features of the core are de- 
signed to minimize performance loss due to pipeline 
breaks. 


Branch Prediction-To 
minimize pipeline breaks due to 
branching, the user can specify the direction that a con- 
ditional branch instruction 
will usually follow. The 
processor will execute along the specified instruction 
path with no pipeline break. If the branch direction 
specified was the direction actually selected by execu- 
tion of the conditional branch, no pipeline break oc- 
curs. The direction of the branch guess is determined 
by a bit value in the CfRL format instructions. 


Register Bypassing-Register 
bypassing is a feature 
which forwards the result of an instruction for immedi- 
ate use as the source of another instruction. This for- 
warding occurs at the same time that the value is writ- 
, 


A.1.1 INSTRUCTION 
CACHE 


The IS includes a I Kbyte two-way set associative in- 
struction cache capable of delivering up to four instruc- 
tions each clock to the Instruction 
Sequencer. The 
cache allows inner loops of code to execute with no 
external instruction fetches. 


A.1.2 MICROCODE 
ROM 


The 80960CA uses microcode ROM to implement com- 
plex instructions and functions. This includes calls, re- 
turns, DMA transfers, and initialization sequences. Mi- 
crocode provides an inexpensive and simple method for 
implementing complex instructions in the mostly RIse 
environment of the 8096OCA.When the IS encounters 
a microcoded instruction, it automatically branches to 
the microcode routine. The 80960CA performs this mi- 
crocode branch in 0 clocks. 


State 


Pipe 0 
decode 


Pipe 1 
xxxxx 


2 
3 


decode 
decode 


issue 
issue 


xxxxx 
xxxxx 
execute & 
return 
Pipe 2 


Figure A·2. Instruction 
Pipeline 
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A.2 Register File 


The Register File (RF) contains the 16 local and 16 
global registers. The register file has six ports (Figure 
A-3), allowing parallel access of the register set by sev- 
eral S0960CA coprocessors, This parallel access results 
in an ability to execute one simple logic or arithmetic 
instruction, 
one memory operation (load/store), 
and 
one address calculation per clock. 


MEM coprocessors interface to the RF with a 12S-bit 
wide load bus and a 12S-bit wide store bus. These bus- 
ses enable movement of up to 4 words per clock to and 
from the RF. These busses also allow LOAD data from 
a previous read access and STORE data from a current 
write access to be processed in the register file simulta- 
neously. An additional 32-bit port allows an address or 
address reduction operand to be simultaneously fetched 
by the Address Generation Unit. 


REG coprocessors interface to the RF with two 64-bit 
source busses and a single 64-bit destination bus. With 
this bus structure, two source operands are simulta- 
neously issued to a REG coprocessor when an instruc- 
tion is issued. A 64-bit destination bus allows the result 
from the previous operation to be written to the RF at 
the same time that the current operation's source oper- 
ands are issued. 


A.3 
Execution 
Unit 


The Execution Unit is the 32-bit Arithmetic and Logic 
Unit of the 80960CA Core. The EU can be viewed as a 
self-contained REG coprocessor with its own instruc- 
tion set. As such, the EU is responsible for executing or 
supporting the execution of all the integer and ordinal 
arithmetic instructions, the logic and shift instructions, 
the move instructions, the bit and bit field instructions, 
and the compare operations. The EU performs any 
arithmetic or logical instructions in a single clock. 


80960CA 
PRODUCT 
OVERVIEW 


A.4 
Multiply Divide Unit 


The Multiply and Divide Unit (MDU) is a REG coproc- 
essor which performs integer and ordinal multiply, di- 
vide, remainder, and modulo operations. The MDU de- 
tects integer overflow and divide by zero errors. The 
MDU is optimized for multiplication, performing 32- 
bit multiplies in 4 clocks. The MDU performs multi- 
plies and divides in parallel with the main execution 
unit. 


A.5 
Address Generation 
Unit 


The Address Generation Unit (AGU) is a MEM coproc- 
essor which computes the effective addresses for memo- 
ry operations. It directly executes the load address in- 
struction (lda) and"calculates addresses for loads and 
stores based on the addressing mode specified in these 
instructions. The address calculations are performed in 
parallel with the main execution unit (EU). 


A.6 
Data RAM and Local Register 
• 
Cache 


The Data RAM and Local Register Cache is part of a 
1.5 Kbyte block of on-chip Static RAM 
(SRAM). 


1 Kbyte of this SRAM is mapped into the 8096OCA's 
address 
space 
from 
location 
()()()()()()()(H to 


000003FFH. A portion of the remaining 512 bytes is 
dedicated to the Local Register Cache. This part of 
internal SRAM is not directly visible to the user. Loads 
and Stores, including quad-word accesses, to the inter- 
nal SRAM are typically performed in only one clock. 
The complete local register set, therefore, can be moved 
to the local register cache in only four clocks. 


~////////////////////~ 
128 


604, 
~/~~~~;/o/~/~~~~~/~/;!~e 
~ 
. 


604\ 
I 
16 Local Registers 
I 
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Figure A-3. Six-Port 
Register 
File 
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32-BIT HIGH PERFORMANCE 
EMBEDDED PROCESSOR 


• Two Instructions/Clock 
Sustained Execution 
• Four 59 Mbytes/s DMAChannels with Data Chaining 
• Demultiplexed 32-Bit Burst Bus with Pipelining 


• 
32-bit Parallel Architecture 
- 
Two Instructions/clock 
Execution 
- 
Load/Store Architecture 
- 
Sixteen 32-bit Global Registers 
- 
Sixteen 32-bit Local Registers 
- 
Manipulate 64-bit Bit Fields 
- 
11 Addressing Modes 
- 
Full Parallel Fault Model 
- 
Supervisor Protection Model 


• 
Fast Procedure Call/Return Model 
- 
Full Procedure Call in 4 clocks 


• 
On-Chip Register Cache 
- 
Caches Registers on Call/Ret 
- 
Minimum of 6 Frames provided 
-Up 
to 15 Programmable Frames 


• 
On-Chip Instruction Cache 
- 
1 Kbyte Two-Way Set Associative 
-128-blt 
Path to Instruction Sequencer 
- 
Cache-Lock Modes 
- 
Cache-Off Mode 


• 
High Bandwidth On-Chip Data RAM 
- 
1 Kbyte On-Chip RAM for Data 
- 
Sustain 128 bits per clock access 


• 
Four On-Chip DMA Channels 
- 
59 Mbytes/s Fly-by Transfers 
- 
32 Mbytes/s Two-Cycle Transfers 
- 
Data Chaining 
- 
Data Packing/Unpacking 
- 
Programmable Priority Method 


• 
32-Bit Demultiplexed Burst Bus 
-128-bit 
Internal Data Paths to and 
from Registers 
- 
Burst Bus for DRAM Interfacing 


- 
Address Pipelining Option 
- 
Fully Programmable Wait States 
- 
Supports 8, 16 or 32-blt Bus Widths 
- 
Supports Unaligned Accesses 
- 
Supervisor Protection Pin 


• 
High-Speed Interrupt Controller 
- 
Up to 248 External Interrupts 
- 
32 Fully Programmable Priorities 
- 
Multi-mode 8-bit Interrupt Port 
- 
Four Internal DMA Interrupts 
- 
Separate, Non-maskable Interrupt Pin 
- 
Context Switch in 750 ns Typical 


Figure 
1. 80960CA 
Die Photo 
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1.0 
PURPOSE 


This document provides electrical characteristics for 
the 33, 25 and 16 MHz versions of the 80960CA. For 
a detailed description of any 80960CA functional 
topic, other than parametric performance, consult 
the latest 80960CA Product Overview (Order No. 
270669), or the i960 CA Microprocessor 
Reference 
Manual (Order No. 270710). 


2.0 
80960CA OVERVIEW 


The 80960CA is the second-generation member of 
the 80960 Family of embedded processors. The 
80960CA is object code compatible with the 32-bit 
80960 Core Architecture while including Special 
Function Register extensions to control on-chip pe- 
ripherals, and instruction set extensions to shift 64- 
bit operands and configure on-chip hardware. Multi- 
ple 128-bit internal busses, on-chip instruction cach- 
ing and a sophisticated instruction scheduler allow 
the processor to sustain execution of two instruc- 
tions every clock, and peak at execution of three 
instructions per clock. 
. 
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A 32-bit demultiplexed and pipelined burst bus pro- 
vides a 132 Mbyte/s bandwidth to a system's high- 
speed external memory sub-system. In addition, the 
80960CA's on-chip caching of instructions, proce- 
dure context and critical program data substantially 
decouples system performance from the wait states 
associated with accesses to the system's slower, 
cost sensitive, main memory sub-system. 


The 80960CA bus controller also integrates full wait 
state and bus width control for highest system per- 
formance with minimal system design complexity. 
Unaligned access and Big Endian byte order support 
reduces the cost of porting existing applications to 
the 80960CA. 


The processor also integrates four complete data- 
chaining DMA channels and a high-speed interrupt 
controller on-chip. The DMA channels perform: sin- 
gle-cycle or two-cycle transfers, data packing and 
unpacking, and data chaining. Block transfers, in ad- 
dition to source or destination synchronized trans- 
fers, are provided.( 
The interrupt controller provides full programmability 
of 248 interrupt sources into 32 priority levels with a 
typical interrupt task switch ("latency") 
time of 
750 ns. 


FOUR-CHANNEL 
DMA CONTROLLER 


INSTRUCTION PREFETCH QUEUE 


CONTROL 
Memory 
Region 


Configuration 


PROGRAMMABLE 
INTERRUPT CONTROLLER 


INSTRUCTION CACHE 


(1K byte. Two-way set associative) 
BUS CONTROLLER 


ADDRESS 


MULTIPLV/DIVIDE 
UNIT 


DATA 
128-BIT CACHE BUS 


SIX-PORT 
REGISTER FILE 


32-BI1' 
BASj 
BUS 


128-BIT 
LOAD BUS 
I 


ST~~fll~us--"""'''''''---- 
I 


270727-2 


Figure 2. 80960CA 
Block Diagram 
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2.1. 
The C-Series Core 


The C-Series core is a very high performance micro- 
architectural implementation of the 80960 Core Ar- 
chitecture. The C-Series core can sustain execution 
of two instructions per clock (66 MIPs at 33 MHz). 
To achieve this level of performance, Intel has incor- 
porated state-of-the-art silicon technology and inno- 
vative microarchitectural constructs into the imple- 
mentation of the C-Series core. Factors that contrib- 
ute to the core's performance include: 
- 
Parallel instruction decoding allows issue of up 
to three instructions per clock. 
- 
Most instructions execute in a single clock. 
- 
Parallel instruction 
decode 
allows sustained, 
simultaneous execution of two single-clock in- 
structions every clock cycle. 
- 
Efficient instruction pipeline minimizes pipeline 
break losses. 
- 
Register 
and 
resource 
scoreboarding 
allow 
simultaneous multi-clock instruction execution. 
- 
Branch look-ahead and prediction allows many 
branches to execute with no pipeline break. 
- 
Local Register Cache integrated on-chip caches 
Call/Return context. 
- 
Two-way set associative, 1 Kbyte integrated in- 
struction cache. 
- 
1 Kbyte integrated Data RAM sustains a four- 
word (128-bit) access every clock cycle. 


'2.2: 
Pipelined, 
Burst Bus 


A 32-bit high performance bus controller interfaces 
the 80960CA to external memory and peripherals. 
The Bus Control Unit features a maximum transfer 
rate of 132 Mbytes per second (at 33 MHz). Internal- 
ly programmable wait states and 16 separately con- 
figurable memory regions allow the processor to in- 
terface with a variety of memory subsystems with a 
minimum of system complexity and a maximum of 
performance. The Bus Controller's main features in- 
clude: 


3-198 


- 
Demultiplexed, Burst Bus to exploit most efficient 
DRAM access modes. 
- 
Address Pipelining to reduce memory cost while 
maintaining performance. 
- 
32-, 16-'and 8-bit modes for 110 interfacing ease. 


- 
Full internal wait state generation to reduce sys- 
tem cost. 


- 
Little and Big Endian support to ease application 
development. 


- 
Un'alignedaccess support for code portability. 


- 
Three-deep request queue to decouple the bus 
from the core. 


2.3. 
Flexible DMA Controller 


A four channel DMA controller provides high speed 
DMA control for data transfers involving peripherals 
and memory. The DMA provides advanced features 
such as data chaining, byte assembly and disassem- 
bly, and a high performance fly-by mode capable of 
transfer speed of up to 59 Mbytes per second at 
33 MHz. The DMA controller features a performance 
and flexibility which is only possible by integrating 
the DMA controller.and the 80960CA core. 


2.4. 
Priority Interrupt Controller 


A programmable-priority interrupt controller man- 
ages up to 248 external sources through the 8-bit 
external interrupt port. The Interrupt Unit also han- 
dles the four internal sources from the DMA control- 
ler, and a single non-maskable interrupt input. The 
8-bit interrupt port can also be configured to provide 
individual interrupt sources that are level or edge 
triggered. 


Interrupts in the 80960CA are prioritized and sig- 
naled within 270 ns of the request. If the interrupt is 
of higher priority than the processor priority, the con- 
text switch to the interrupt routine typically is com- 
plete in another 480 ns. The interrupt unit provides 
the mechanism for the low latency and high through- 
put interrupt service which is essential for embedded 
applications. 


I 
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2.5. 
Instruction 
Set Summary 


The following 
table summarizes 
the 80960CA 
instruction 
set by logical groupings. 
See the i960 CA Microproc- 


essor Reference 
Manual 
for a complete 
description 
of the instruction 
set. 


Data 
, 
Arithmetic 
Logical 
Bit, Bit Field 
Movement 
and Byte 


Load 
Add 
And 
Set Bit 
Store 
Subtract 
Not And 
Clear Bit 
Move 
Multiply 
And Not 
Not Bit 
Load Address 
Divide 
Or 
Alter Bit 
Remainder 
Exclusive 
Or 
Scan for Bit 
Modulo 
Not Or 
Span over Bit 
Shift 
Or Not 
Extract 


• Extended 
Nor 
Modify 
Shift 
Exclusive 
Nor 
Scan Byte for Equal 
Extended 
Not 
Multiply 
. 
Nand 
Extended 
Divide 
Add with 
Carry 
Subtract 
with 
Carry 
Rotate 


Comparison 
Branch 
Call and Return 
Fault 


Compare 
Unconditional 
Call 
Conditional 
Conditional 
Branch 
Call Extended 
Fault 
Compare 
Conditional 
Call System 
Synchronize 
Compare 
and 
Branch 
Return 
Faults 
Increment 
Compare 
and 
Branch and Link 
Compare 
and 
Branch 
Decrement 
Test Condition 
Code 
Check Bit 


Debug 
Processor 
Atomic 
Management 


Modify Trace 
Modify 
Atomic Add 
Controls 
Process 
' Atomic 
Modify 
Mark 
Controls 
Force Mark 
Modify 
Arithmetic 
Controls 
'System 
Control 
'DMA 
Control 
Flush Local 
Registers 


NOTE: 
Instructions 
marked 
by (") are 80960CA 
extensions 
to the 80960 
instruction 
set. 


I 
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3.2. Pin uescnpnons 


The 80960CA 
pins are described 
in this section. 
Ta- 
ble 1 presents 
the legend for interpreting 
the pin de- 


scriptions 
in the following 
tables. 


Pins associated 
with the 32-bit demultiplexed 
proc- 
essor bus are described 
in Table 2. Pins associated 
with 
basic 
processor 
configuration 
and control 
are 
described 
in 
Table 
3. 
Pins 
associated 
with 
the 
80960CA 
OMA Controller 
and Interrupt 
Unit are de- 
scribed 
in Table 4. 


Figure 
3 provides 
an example 
pin description 
table 
entry. "1/0" signifies 
that data pins are input-output. 


"S" 
indicates 
the pins are synchronous 
to PCLK2:1. 


"H(Z)" 
indicates 
that these pins float while the proc- 
essor 
bus is in a Hold Acknowledge 
state. 
"R(Z)" 
notation 
indicates 
that 
the 
pins 
also 
float 
while 
RESET is low. 


All pins float 
while 
the 
processor 
is in the 
ONCE 
mode. 


-.I" 
--_ 
.. 
- .--- .- - -_ .._... 


S(E) 
Edge sensitive 
input 
S(L) 
Level sensitive 
input 


A( ... 
) 
Asynchronous. 
Inputs may be 
asynchronous 
to PCLK2:1. 
A(E) 
Edge sensitive 
input 
A(L) 
Level sensitive 
input 


H( ... 
) 
While the processor's 
bus is in the 
Hold Acknowledge 
or Bus Backoff 
state, the pin: 
H(1) 
is driven to vcc 
H(O) 
is driven to Yss 
H(Z) 
floats 
H(Q) 
continues 
to be a valid output 


R( ... 
) 
While the processor's 
RESET pin is 
low, the pin 
R(1) 
is driven to vcc 
R(O) 
is driven to Vss 
R(Z) 
floats 
R(Q) 
continues 
to be a valid output 


Name 
Type 
Description 


031:0 
1/0 
OAT A BUS carries 32, 16 or 8-bit data quantities 
depending 
on bus width configuration. 
The 
S(L) 
least significant 
bit of the data is carried on DO and the most significant 
on 031. When the 


H(Z) 
bus is configured 
for 8-bit data, the lower 8 data lines, 07:0 are used. For 16-bit bus widths, 


R(Z) 
015:0 are used. For 32-bit bus widths the full data bus is used. 


Figure 
3. Example 
Pin Description 
Entry 
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Table 
2. 80960CA 
Pin Descrlptlon-External 
Bus Signals 


Name 
Type 
Description 


A31:2 
0 
ADDRESS 
BUS carries the upper 30 bits of the physical address. 
A31 is the most 
S 
significant 
address 
bit and A2 is the least significant. 
During a bus access, A31:2 


H(Z) 
identify all external 
addresses 
to word (4-byte) boundaries. 
The byte enable 


R(Z) 
signals indicate the selected 
byte in each word. During burst accesses, 
A3 and A2 
increment 
to indicate successive 
data cycles. 


D31:0 
I/O 
DATA BUS carries 32, 16 or 8-bit data quantities 
depending 
on bus width 
S(L) 
configuration. 
The least significant 
bit of the data is carried on DO and the most 


H(Z) 
significant 
on D31. When the bus is configured 
for 8 bit data, the lower 8 data 


R(Z) 
lines, D7:0 are used. For 16-bit bus widths, D15:0 are used. For 32-bit bus widths 
the full data bus is used. 


BE3 
0 
BYTE ENABLES 
select which of the four bytes addressed 
by A31:2 are active 
BE2 
S 
during an access to a memory region confi~d 
for a 32-bit data-bus 
width. BE3 


BE1 
H(Z) 
applies to D31 :24; BE2 applies to D23:16; BE1 applies to D15:8; and BEO applies 


BEO 
R(1) 
to D7:0. 


32-bit bus: 
BE3 
-Byte 
Enable 3 
-enable 
D31 :24 
BE2 
-Byte 
Enable 2 
-enable 
D23:16 
BE1 
-Byte 
Enable 1 
-enable 
D15:8 
BEO 
-Byte 
Enable 0 
-enable 
D7:0 


For accesses 
to a memory r~n 
configured 
for a 16-bit data-bus 
width, ~ 
processor 
directly encodes 
BE3, BE1 and BEO to provided 
BHE, A 1 and BLE 
respectively. 


16-bit bus: 
BE3 
-Byte 
High Enable (BHE) 
-enable 
D15:8 
BE2 
-Not 
used (is driven high or low) 
BE1 
-Address 
Bit 1 (A 1) 
BEO 
-Byte 
Low Enable (BLE) 
-enable 
D7:0 


For accesses 
to a memory r~n 
co~red 
for an 8-bit data bus width, the 
processor 
directly encodes 
BE1 and BEO to provide A 1 and AO respectively. 


8-bitbus: 
BE3 
-Not 
used (is driven high or low) 
BE2 
-Not 
used (is driven high or low) 
BE1 
-Address 
Bit 1 (A 1) 
BEO 
-Address 
Bit 0 (AO) 


W/R 
0 
WRITE/READ 
is asserted 
for read requests and deasserted 
for write requests. 
S 
The WIR signal changes 
in the same clock cycle as ADS. It remains valid for the 


H(Z) 
entire access in non-pipelined 
regions. In pipelined 
regions, W/R 
is not 


R(O) 
guaranteed 
to be valid in the last cycle of a read access. 


ADS 
0 
ADDRESS 
STROBE 
indicates 
valid address and the start of a new bus access. 


S 
ADS is asserted 
for the first clock of a bus access. 


H(Z) 
R(1) 


READY 
I 
READY 
is an input which signals the termination 
of a data transfer. 
READY is 
S(L) 
used to indicate that read data on the bus is valid, or that a write-data 
transfer 
has 
H(Z) 
completed. 
The READY signal works in conjunction 
with the internally 


R(Z) 
prpgrammed 
wait-state 
generator. 
If READY is enabled 
in a region, the pin is 
sampled 
after the programmed 
number of wait-states 
has expired. If the READY 
pin is deasserted, 
wait states continue 
to be inserted 
until READY becomes 
asserted. 
This is true for the NRAD, NRDD, NWAD, and NWDD wait states. The 
NXDA wait states cannot be extended. 


I 
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Table 2. 80960CA 
Pin Description-External 
Bus Signals (Continued) 


Name 
Type 
Description 


BTERM 
I 
BURST TERMINATE-The 
burst terminate 
signal breaks up a burst access and 
S(L) 
causes another address cycle to occur. The STERM signal works in conjunction 
H(Z) 
with the internally 
programmed 
wait-state 
generator. 
If READY and STERM are 
R(Z) 
enabled 
in a region, the STERM pin is sampled 
after the programmed 
number of 
wait states has expired. When STERM is asserted, 
a new ADS si~enerated 
and the access is completed. 
The READY input is ignored when STERM is 
asserted. 
STERM must be externally 
synchronized 
to satisfy the STERM setup 
and hold times. 


WAIT 
0 
WAIT indicates 
internal wait state generator 
status. WAIT is asserted 
when wait 
S 
states are being caused by the internal wait state generator 
and not by the __ 
H(Z) 
READY or STERM inputs. WAIT can be used to derive a write-data 
strobe. WAIT 


R(1) 
can also be thought 
of as a READY output that the processor 
provides 
when it is 
inserting wait states. 


BLAST 
0 
BURST LAST indicates 
the last transfer 
in a bus access. 
SLAST is asserted 
in the 
S 
last data transfer 
of burst and non-burst 
accesses 
after the wait state counter 
H(Z) 
reaches zero. SLAST remains asserted 
until the clock following 
the last cycle of 
R(O) 
the last data transfer 
of a bus access. 
If the READY or STERM input is used to 
extend wait states, the SLAST signal remains asserted 
until READY or STERM 
terminates 
the access. 


DT/R 
0 
DATA TRANSMIT IRECEIVE 
indicates 
direction 
for data transceivers. 
DT IR is 
S 
used in conjunction 
with DEN to provide control 
for data transceivers 
attached 
to 
H(Z) 
the external bus. When DTlA is asserted, 
the signal indicates 
that the processor 
R(O) 
receives 
data. Conversely, 
when deasserted 
the processor 
sends data. DTlA 
changes 
only while DEN is high. 


DEN 
0 
OAT A ENABLE indicates 
data cycles in a bus request. 
DEN is asserted 
at the 
S 
start of the bus request first data cycle and is deasserted 
at the end of the last 
H(Z) 
data cycle. DEN is used in conjunction 
with DTlA to provide control for data 
R(1) 
transceivers 
attached 
to the external 
bus. DEN remains asserted 
for sequential 
reads from pipe lined memory regions. DEN is deasserted 
when DTlA changes. 


LOCK 
0 
BUS LOCK indicates 
that an atomic read-modify-write 
operation 
is in progress. 
S 
LOCK may be used to prevent external 
agents from accessing 
memory which is 
H(Z) 
currently 
involved 
in an atomic operation. 
LOCK is asserted 
in the first clock of an 
R(1) 
atomic operation, 
and de asserted 
in the clock cycle following 
the last bus access 
for the atomic operation. 
To allow the most flexibility 
for a memory system 
enforcement 
of locked accesses, 
the processor 
acknowledges 
a bus hold request 
when LOCK is asserted. 
The processor 
performs 
DMA transfers 
while LOCK is 
active. 


HOLD 
I 
HOLD REQUEST 
signals that an external agent requests access to the external 
S(L) 
bus. The processor 
asserts HOLDA after completing 
the current bus request. 


H(Z) 
HOLD, HOLDA and SREQ are used together 
to arbitrate 
access to the 
R(Z) 
processor's 
external 
bus by external bus agents. 


BOFF 
I 
BUS BACKOFF 
-The 
backoff 
pin, when asserted, 
suspends 
the current access 
S(L) 
and causes the bus pins to float. When deasserted, 
the ADS signal is asserted 
on 
H(Z) 
the next clock cycle and the access is resumed. 
R(Z) 
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Table 2. 80960CA 
Pin Description-External 
Bus Signals (Continued) 


Name 
Type 
Description 


HOLDA 
0 
HOLD ACKNOWL~DGE 
indicates 
to a bus requestor 
that the processor 
has 
S 
relinquished 
control 
of the external bus. When HOLDA is asserted, 
the external 


H(1) 
address bus, data bus and bus control 
signals are floated. 
HOLD, BOFF, HOLDA 


R(Q) 
and BREQ are used together 
to arbitrate 
access to the processor's 
external 
bus . 


by external 
bus agents. Since the processor 
grants HOLD requests 
and enters the 
Hold Acknowledge 
state even while RESET is asserted, 
the state of the HOLDA 
pin is independent 
of the RESET pin. 


BREQ 
0 
BUS REQUEST 
is asserted when the bus controller 
has a request pending. 
BREQ 
S 
can be used by external 
bus arbitration 
logic in conjunction 
with HOLD and 


H(Q) 
HOLDA to determine 
when to return mastership 
of the external bus to the 


R(O) 
processor. 


DIC 
0 
DATA OR CODE is asserted 
for a data request and deasserted 
for instruction 
S 
requests. DIe has the same timing as W/R. 


H(Z) 
R(Z) 


DMA 
0 
DMA ACCESS indicates 
whether 
the bus request was initiated by the DMA 
S 
controller. 
DMA is asserted 
for any DMA request. DMA is deasserted 
for all other 


H(Z) 
requests. 


R(Z) 


SUP 
0 
SUPERVISOR 
ACCESS indicates 
whether 
the bus request is issued while in 
S 
supervisor 
mode. SUP is asserted 
when the request has supervisor 
privileges, 
and 


H(Z) 
is deasserted 
otherwise. 
SUP can be used to isolate supervisor 
code and data 


R(Z) 
structures 
from non-supervisor 
requests. 


Table 3. 80960CA 
Pin Description-Processor 
Control 
Signals 


Name 
Type 
Description 


RESET 
I 
RESET causes the chip to reset. When RESET is asserted, 
all external 
signals return 
A(L) 
to the reset state. When RESET is deasserted, 
initialization 
begins. When the 2-x clock 


H(Z) 
mode is selected, 
RESET must remain asserted 
for 16 PCLK2:1 cycles before being 


R(Z) 
deasserted 
in order t$9uarantee 
correct 
processor 
initialization. 
When the t-x clock 


N(Z) 
mode is selected, 
RE 
ET must remain asserted 
for 10,000 PCLK2:1 cycles before 
being deasserted 
in order to guarantee 
correct initialization 
of the processor. 
The 
CLKMODE 
pin selects 
1-x or 2-x input clock division of the CLKIN pin. 


The processor's 
Hold Acknowledge 
bus state functions 
while the chip is reset. If the 
processor's 
bus is in the Hold Acknowledge 
state when RESET is asserted, 
the 
processor 
will internally 
reset, but maintains 
the Hold Acknowledge 
state on external 


pins until the Hold request is removed. 
If a Hold request is made while the processor 
is 
in the reset state, the processor 
bus will grant HOLDA and enter the Hold 
Acknowledge 
state. 


FAIL 
0 
FAIL indicates 
failure of the processor's 
self-test 
performed 
at initialization. 
When 
S 
RESET is deasserted 
and the processor 
begins initialization, 
the FAIL pin is asserted. 


H(Q) 
An internal self-test 
is performed 
as part of the initialization 
process. 
If this self-test 


R(O) 
passes, the FAIL pin is deasserted; 
otherwise 
it remains asserted. 
The FAIL pin is 
reasserted 
while the processor 
performs 
and external 
bus self-confidence 
test. If this 
self-test 
passes, the processor 
deasserts 
the FAIL pin and branches 
to the users 
initialization 
routine, otherwise 
the FAIL pin remains asserted. 
Internal self-test 
and the 
use of the FAIL pin can be disabled with the STEST pin. 


I 
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Table 3. 80960CA 
Pin Description-Processor 
Control 
Signals (Continued) 


Name 
Type 
Description 


STEST 
I 
SELF TEST causes the processor's 
internal self-test 
feature to be enabled 
or disabled 
S{L) 
at initialization. 
STEST is read on the rising edge of RESET. When asserted, 
the 


H{Z) 
processor's 
internal self-test 
and external 
bus confidence 
tests are performed 
during 


R{Z) 
processor 
initialization. 
When deasserted, 
only the bus confidence 
tests are 
performed 
during initialization. 


ONCE 
I 
ON CIRCUIT 
EMULATION 
causes all outputs to be floated when asserted. 
ONCE is 
A{L) 
continuously 
sampled while RESET is low and is latched on the rising edge of RESET. 


H{Z) 
To place the processor 
in the ONCE state: 


R{Z) 
(1) 
assert RESET and ONCE (order does not matter) 
(2) 
wait for at least 16 CLKIN periods in 2-x mode, or 10,000 CLKIN periods in 1-x 
mode, after VCC and CLKIN are within operating 
specifications 


(3) 
deassert 
RESET 
(4) 
wait at least 32 CLKIN periods 
(The processor 
will now be latched in the ONCE state as long as RESET is high.) 


To exit the ONCE state, bring vcc and CLKIN to operating 
conditions, 
then assert 
RESET and bring ONCE high prior to deasserting 
RESET. 


CLKIN must operate within the specified 
operating 
conditions 
of the processor 
until 
step 4 above has been completed. 
The CLKIN may then be changed 
to DC to achieve 
the lowest possible ONCE mode leakage current. 


ONCE can be used by emulator 
products 
or for board testers to effectively 
make an 
installed 
processor 
transparent 
in the board. 


CLKIN 
I 
CLOCK INPUT is an input for the external clock needed to run the processor. 
The 
A{E) 
external 
clock is internally 
divided as prescribed 
by the CLKMODE 
pin to produce 


H{Z) 
PCLK2:1. 


R{Z) 


CLKMODE 
I 
CLOCK MODE selects the division factor applied to the external clock input (CLKIN). 


A{L) 
When CLKMODE 
is high, CLKIN is divided by one to create PCLK2:1 and the 


H{Z) 
processor's 
internal clock. When CLKMODE 
is low, CLKIN is divided by two to create 
, 


R{Z) 
PCLK2:1 and the processor's 
internal clock. CLKMODE 
should be tied high, or low in a 
system, as the clock mode is not latched by the processor. 
If left unconnected, 
the 
processor 
will internally 
pull the CLKMODE 
pin low, enabling the 2-x clock mode. 


PCLK2 
0 
PROCESSOR 
OUTPUT 
CLOCKS 
provide a timing reference 
for all inputs and outputs 
PCLK1 
S 
of the processor. 
All inputs and output timings are specified 
in relation to PCLK2 and 


H{Q) 
PCLK1. PCLK2 and PCLK1 are identical 
signals. Two output pins are provided 
to allow 


R{Q) 
flexibility 
in the system's 
allocation 
of capacitive 
loading on the clock. PCLK2:1 may 
also be connected 
at the processor 
to form a single clock signal. 


Vss 
- 
GROUND 
connections 
consist of 24 pins which must be connected 
externally 
to a Vss 


board plane. 


Vcc 
- 
POWER connections 
consist of 24 pins which must be connected 
externally 
to a vcc 
board plane. 


VCCPLL 
- 
VCCPLL is a separate vcc supply pin for the phase lock loop used in 1-x clock mode. 
Connecting 
a simple lowpass filter to VCCPLL may help reduce clock jitter (TCP) in 
noisy environments. 
Otherwise, 
VCCPLL should be connected 
to VCC. The VCCPLL pin 


is not implemented 
until the D-stepping. 
See Section 3.6 for stepping 
register 
information. 


NIC 
- 
NO CONNECT 
pins must not be connected 
in a system. 
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Table 
4. 80960CA 
Pin Description-DMA 
and Interrupt 
Unit Control 
Signals 


Name 
Type 
Description 


DREQ3 
I 
DMA REQUEST 
causes a DMA transfer 
to be requested. 
Each of the four signals 
DREQ2 
A(L) 
requests 
a transfer 
on a single channel. 
DREQO requests 
channel 
0, DREQ1 


DREQ1 
H(Z) 
requests 
channel 
1, etc. When two or more channels 
are requested 
simultaneously, 


DREQO 
R(Z) 
the channel 
with the highest priority is serviced first. The channel 
priority mode is 
programmable. 


DACK3 
0 
DMA ACKNOWLEDGE 
indicates 
that a DMA transfer 
is being executed. 
Each of the 


DACK2 
S 
four signals acknowledges 
a transfer for a single channel. 
DACKO acknowledges 


DACK1 
H(1) 
channel 
0, DACK1 acknowledges 
channel 
1, etc. DACK3:0 
are asserted 
when the 


DACKO 
R(1) 
requesting 
device of a DMA is accessed. 


EOP3/TC3 
I/O 
END OF PROCESS/TERMINAL 
COUNT can be programmed 
as either an input 
EOP2/TC2 
A(L) 
(EOP3:0) or as an output (TC3:0), but not both. Each pin is individually 


EOP1/TC1 
H(Z/Q) 
programmable. 
When programmed 
as an input, EOPx causes the termination 
of a 


EOPO/TCO 
R(Z) 
current DMA transfer for the channel 
corresponding 
to the EOPx pin. EOPO 
corresponds 
to channel 
0, EOP1 corresponds 
to channel 
1, etc. When a channel 
is 
configured 
for source and destination 
chaining, the EOP pin for that channel 
causes 
termination 
of only the current buffer transferred 
and causes the next buffer to be 
transferred. 
EOP3:0 are asynchronous 
inputs. 


When programmed 
as an output, the channel's 
TCx pin indicates 
that the channel 
byte count has reached 
0 and a DMA has terminated. 
TCx is driven with the same 
timing as DACKx during the last DMA transfer for a buffer. If the last bus request 
is 
executed 
as multiple bus accesses, 
TCx will stay asserted 
for the entire bus request. 


XINT7 
I 
EXTERNAL 
INTERRUPT 
PINS cause interrupts 
to be requested. 
These pins can be 
XINT6 
A(E/L) 
configured 
in three modes. 


XINT5 
H(Z) 
In the Dedicated 
Mode, each pin is a dedicated 
external 
interrupt 
source. Dedicated 


'XINT4 
R(Z) 
inputs can be individually 
programmed 
to be level (Iow) or edge (falling) activated. 


XINT3 
In the Expanded 
Mode, the a pins act together 
as an a-bit vectored 
interrupt 
source. 


XINT2 
The interrupt 
pins in this mode are level activated. 
Since the interrupt 
pins are active 


XINT1 
low, the vector number requested 
is the one's complement 
of the positive logic value 


XINTO 
place on the port. This eliminates 
glue logic to interface 
to combinational 
priority 
encoders 
which output negative 
logic. 


In the Mixed Mode, XINT7:5 are dedicated 
sources and XINT4:0 act as the 5 most 
significant 
bits of an expanded 
mode vector. The least significant 
bits are set to 010 
internally. 


NMI 
I 
NON-MASKABLE 
INTERRUPT 
causes a non-maskable 
interrupt 
event to occur. 
A(E) 
NMI is the highest priority interrupt 
recognized. 
NMI is an edge (falling) activated 


H(Z) 
source. 


R(Z) 


I 
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A28 ........ 
014 
028 ........ 
004 
BEO ........ 
R09 
OREOO ..... 
B05 


A27 ........ 
516 
027 ........ 
R02 
5TE5T 
........ 
B02 


A26 ........ 
R15 
026 ........ 
003 
W/R 
....... 
510 
OACK3 
..... 
A10 


A25 ........ 
517 
025 
........ 
501 
ONCE 
........ 
C03 
OACK2 
..... 
A09 


A24 ........ 
015 
024 ........ 
R01 
A05 
....... 
R06 
OACK1 
..... 
A08 


A23 ........ 
R16 
023 ........ 
002 
CKLlN 
........ 
C13 
OACKO ..... 
B08 


A22 ........ 
R17 
022 
........ 
P03 
REAOY ..... 
503 
CLKMOOE 
.... 
C14 


A21 ........ 
016 
021 ........ 
001 
BTERM ..... 
R04 
PCLK1 ........ 
B14 
EOP/TCO 
... A11 


A20 
........ 
P15 
020 
........ 
P02 
PCLK2 ........ 
B13 
EOP/TC1 
... A12 


A19 
........ 
P16 
019 
........ 
P01 
WAIT ....... 
512 
EOP/TC2 
... A13 


A18 ........ 
017 
018 ........ 
N02 
BLAST 
..... 
508 
VSS 
EOP/TC3 
... A14 


A17 
........ 
P17 
017 ........ 
N01 
Location 


A16 ........ 
N16 
016 ........ 
M01 
OT/R ....... 
511 
C07, C08, C09, 
XINT7 
...... 
C17 


A15 ........ 
N17 
015 
........ 
L01 
OEN 
....... 
509 
C10, C11, C12, 
XINT6 
...... 
C16 


A14 ........ 
M17 
014 
........ 
L02 
F15, G03, G15, 
XINT5 
...... 
B17 
H03, H15, J03, 


A13 
........ 
L16 
013 
........ 
K01 
LOCK 
...... 
514 
J15, K03, K15, 
XINT4 
...... 
C15 


A12 
........ 
L17 
012 
........ 
J01 
L03, L15, M03, 
XINT3 
...... 
B16 
M15, 007, 008, 


A11 ........ 
K17 
011 ........ 
H01 
HOLO ...... 
R05 
009, 010, 011 
XINT2 
...... 
A17 


A10 
........ 
J17 
010 ........ 
H02 
HOLOA ..... 
504 
Vcc 
XINT1 
...... 
A15 


A9 ..... 
, ... H17 
09 ......... 
G01 
BREO ...... 
R13 
Location 
XINTO ...... 
B15 


A8 ......... 
G17 
08 
......... 
F01 
B07, B09, 


A7 ......... 
G16 
07 
......... 
E01 
O/C ........ 
513 
B11, B12, C06, 
NMI ........ 
015 


A6 
......... 
F17 
06 
......... 
F02 
OMA ....... 
R12 
E15, F03, F16, 
G02, H16, J02, 
A5 ......... 
E17 
05 ......... 
001 
5UP 
....... 
012 
J16, K02, K16, M02, 


A4 ...... 
: .. E16 
04 
......... 
E02 
M16, N03, N15, 
006, 
R07, R08, 
R10,R11 


VCCPLL 
....... 
B10 


A3 ......... 
017 
03 ......... 
C01 
BOFF 
...... 
B01 
No Connect 


A2 ......... 
016 
02 ......... 
002 
Location 


01 ......... 
C02 
A01, A03, A04, A05, 
B03, B04, C04, C05, 
003 


00 
......... 
E03 
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Table 6. PGA Pin Name with Package 
Location 
(Pin Order) 


Address 
Bus 
Data Bus 
Bus Control 
Processor 
Control 
1/0 


Location 
.. Name 
Location 
.. Name 
Location 
.. Name 
Location 
.... 
Name 
Location 
.. Name 


A01 ......... 
NC 
C01 ......... 
03 
G01 ......... 
09 
M01 
......... 
016 
R01 ........ 
024 


A02 
....... 
FAIL 
C02 ......... 
01 
G02 ........ 
VCC 
M02 
......... 
VCC 
R02 ........ 
027 


A03 ......... 
NC 
C03 ...... 
ONCE 
G03 ........ 
Vss 
M03 .......... 
Vss 
R03 ........ 
031 


A04.: 
....... 
NC 
C04 ......... 
NC 
G15 ........ 
Vss 
M15 .......... 
Vss 
R04 ..... 
BTERM 


A05 ......... 
NC 
C05 ......... 
NC 
G16 ......... 
A7 
M16 
......... 
VCC 
R05 ...... 
HOLO 


A06 ..... 
OREQ1 
C06 ........ 
VCC 
G17 ......... 
A8 
M17 .......... 
A14 
R06 
....... 
AOS 


A07 ..... 
ORE03 
C07 ........ 
Vss 
R07 ........ 
vcc 


AM 
..... 
OACK1 
C08 ........ 
Vss 
H01 ........ 
011 
N01 .......... 
017 
R08 ........ 
vcc 


A09 ..... 
OACK2 
C09 ........ 
Vss 
H02 ........ 
010 
N02 .......... 
018 
R09 ........ 
BEO 


A10 ..... 
OACK3 
C10 ........ 
Vss 
H03 ........ 
vss 
N03 .......... 
VCC· 
R10 ........ 
vcc 


A11 ... EOP/TCO 
C11 ........ 
Vss 
H15 ........ 
Vss 
N15 .......... 
vex: 
R11 ........ 
vcc 


A12 ... EOP/TC1 
C12 ........ 
Vss 
H16 ........ 
vcc 
N16 .......... 
A16 
R12 ....... 
OMA 


A13 ... EOP/TC2 
C13 ...... 
CLKIN 
H17 ......... 
A9 
N17 .......... 
A15 
R13 ...... 
BREO 


A14 ... EOP/TC3 
C14 .. CLKMOOE 
R14 ........ 
A29 


A15 ...... 
XINT1 
C15 ...... 
XINT4 
J01 
........ 
012 
P01 .......... 
019 
R15 ........ 
A26 


A16 ..... 
RESET 
C16 ...... 
XINT6 
J02 ........ 
vcc 
P02 .......... 
020 
R16 ........ 
A23 


A17 ...... 
XINT2 
C17 ...... 
XINT7 
J03 
........ 
Vss 
P03 .......... 
022 
R17 ........ 
A22 


J15 
........ 
Vss 
P15 .......... 
A20 
, 


B01 
...... 
BOFF 
001 
......... 
05 
J16 ........ 
vcc 
P16 .......... 
A19 
S01 ........ 
025 


B02 
..... 
STEST 
002 
......... 
02 
J17 
........ 
A10 
P17 .......... 
A17 
S02 ........ 
029 


B03 ......... 
NC 
003 ......... 
NC 
S03 ..... 
REAOY 


B04 ......... 
NC 
015 ........ 
NMI 
K01 ........ 
013 
001 .......... 
021 
S04 ..... 
HOLOA 


B05 ..... 
OREOO 
016 
......... 
A2 
K02 ........ 
vcc 
002 .......... 
023 
S05 ........ 
BE3 


B06 ..... 
ORE02 
017 
......... 
A3 
K03 ........ 
Vss 
003 .......... 
026 
S06 ........ 
BE2 


B07 ........ 
vcc 
K15 ........ 
Vss 
004 .......... 
028 
S07 ........ 
BE1 


B08 ..... 
OACKO 
E01 ......... 
07 
K16 ........ 
vcc 
005 .......... 
030 
S08 
..... 
BLAST 


B09 ........ 
vcc 
E02 ......... 
04 
K17 ........ 
A11 
006 .......... 
vcc 
S09 
....... 
OEN 


B10 ..... 
VCCPLL 
E03 ......... 
00 
007 
.......... 
Vss 
S10 
....... 
WIR 


B11 ........ 
VCC 
E15 ........ 
VCC 
L01 ........ 
015 
008 
.......... 
Vss 
S11 ....... 
OT/R 


B12 ........ 
vcc 
E16 ......... 
A4 
L02 ........ 
014 
009 
.......... 
Vss 
S12 ....... 
WAIT 


B13 
..... 
PCLK2 
E17 ......... 
A5 
L03 ........ 
Vss 
010 
.......... 
Vss 
S13 ........ 
0/C 


B14 
..... 
PCLK1 
L15 ........ 
Vss 
011 
.......... 
Vss 
S14 
...... 
LOCK 


B15 ...... 
XINTO 
F01 ......... 
08 
L16 
........ 
A13 
012 
......... 
SUP 
S15 ........ 
A31 


B16 ...... 
XINT3 
F02 ......... 
06 
L17 
........ 
A12 
013 .......... 
A30 
S16 ........ 
A27 


B17 ...... 
XINT5 
F03 ........ 
VCC 
014 .......... 
A28 
S17 ........ 
A25 


F15 ........ 
Vss 
015 .......... 
A24 


F16 ........ 
VCC 
016 .......... 
A21 


F17 
......... 
A6 
017 .......... 
A18 
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s 
R 
a 
p 
N 
M 
L 
K 
J 
H 
G 
F 
E 
D 
C 
B 
A 
r 
c 
021 
0,7 
0,6 
0,5 
0,3 
0,1 
08 
03 em 
Ne 
025 
024 
019 
012 
09 
07 
05 


2 
c 
~ 
2 
029 
027 
023 
020 
018 
v~C 
0,. 
v~ 
v~ 
0,0 
vcc 
06 
04 
02 
01 
STEST 
FAIL 


3 
~ 
3 
~ 
031 
026 
022 
vCC 
vss 
v~ 
vss 
vss 
vss 
v~ 
vcc 
Do 
~ 
ONCE 
NC 
~NC 


4 
~ 
4 
HOLOA ~ 
028 
NC 
~ 
~NC 


5 
BE3 
HOLD 
030 
Ne 
oREao 
NC 
5 


6 
BE2 
~ 


v;c 
c 
~E02 
~EQ1 
6 
ADS 
vcc 
7 
rn 
v~ 
'"'-- 
7 
vcc 
vss 
vss 
DRE03 


8 
-----'"'- 


v~ 
v~ 
OACtoo 
~ 
8 
BLAST 
vcc 
9 
~ 
9 
om 
lira 
v~ 
vss 
v~ 
5Aoo 


10 
w¥. 
~ 
~ 
~ 
V~PLL 
~ 
10 
vcc 
Vss 
Vss 
11 
OTIR 
v~ 
v~ 
v~ 
v~ 
EOPIffij 
11 


12 
wiJf 
-'-'-- 
sUP 
EoPtrC1 
12 
OMA 
Vss 
Vcc 
13 
o~ 
BREa 
A30 
CLKIN PCLK2 
EOPITC2 
13 


14 
cotR 
A29 
CLKMODE 
PCLKI 
EOPITC3 
14 
A28 


15 
A31 
A26 
A24 
A20 
v~ 
v~ 
Vss 
Vss 
v~ 
Vss 
v~ 
v~c 
/;NI 
-""- 
""""'--- 
XINTI 
15 
vss 
XlNT4 
XINTO 


16 
A27 
A23 
A21 
A19 
A16 
v;; 
A13 
Vcc 
Vcc 
vcc 
A7 
Vcc 
A4 
A2 
XM6 
XINn 
ReSET 
16 


17 
~ 
17 
A25 
A22 
M8 
A17 
A15 
A14 
A12 
All 
Al0 
Ail 
Kg 
A6 
A5 
A3 
XINT7 
xms 
~ 
~ 


S 
R 
a 
p 
N 
M 
L 
K 
J 
H 
G 
F 
E 
D 
C 
B 
A 
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Figure 4a. 80960CA 
PGA Plnout (View from Top Side) 
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A 
B 
C 
D 
E 
F 
G 
H 
J 
K 
L 
M 
N 
P 
a 
R 
s 


0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
NC 
IIrn'F 
03 
05 
07 
08 
D9 
011 
012 
013 
015 
016 
017 
019 
021 
024 
025 


2 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
,0 
10 
0 
0 
2 
FAl[ 
STEST 
01 
02 
04 
06 
Vcc 
010 
Vcc 
"cc 
014 
Vcc 
018 
020 
023 
027 
029 


3 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
3 
NC 
NC 
ONCE 
NC 
DO 
Vcc 
Vss 
Vss 
Vss 
Vss 
Vss 
Vss 
Vcc 
022 
026 
031 
REAllY 


4 
0 
0 
0 
0 ~ 
0 
4 
NC 
NC 
NC 
028 
HOLOA 


5 
0 
0 
0 
0 
HgO' 
<fu 
5 
NC 
l5mlii 
NC 
030 


6 
DR~ 
DR~ 
0 
METAL LID 
0 
..Q. 
0 
6 
Vcc 
Vcc 
ADS 
Im 


7 
~ 
0 
0 
0 
0 
0 
7 
vcc 
Vss 
vss 
vcc 
ID 
8 
~ 
~ 
0 
0 
0 
0 
8 
Vss 
Vss 
vcc 
IITAST 


9 
~ 
0 
0 
0 
0 
0 
9 • 


DACK2 
Vcc 
Vss 
Vss 
m 
~ 


10 
0 
0 
0 
0 
0 
0_ 
10 
~ 
VCCPLL 
Vss 
Vss 
Vcc 
WIR 


11 
0 
0 
0 
0 
0 
0_ 
11 
EOPITCO vcc 
Vss 
Vss 
Vcc 
OTIR 


12 
0 
0 
0 
~ 
0 
(L 
12 
EOP~ 
Vcc 
Vss 
l5MA 
WMT 


13 
0 
0 
0 
0 
0 
0_ 
13 
EOP/1'C2 PCLK2 
CLKIN 
A30 
BREa 
DIC 


14 
0 
0 
0 
0 
0 
o.. 
14 
EOP~ 
PCLKl 
CLKMOOE 
A28 
A29 
LOCK 


15 
.s: .o, 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
15 
XINTl 
XlNTO 
XINT4 
]IQ 
vcc 
Vss 
Vss 
Vss 
Vss 
vss 
Vss 
vss 
vcc 
A20 
A24 
A26 
A31 


16 
~ 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
16 
RESET 
XlNT3 XiN'T6 
A2 
A4 
Vcc 
A7 
Vcc 
Vcc 
Vcc 
A13 
Vcc 
A16 
A19 
A21 
A23 
A27 


17 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
17 
XiNf2 
XiiiiT5 XNT7 
A3 
A5 
A6 
A8 
A9 
Al0 
All 
A12 
A14 
A15 
'A17 
A18 
A22 
A25 
•• 


A 
B 
C 
D 
E 
F 
G 
H 
J 
K 
L 
M 
N 
P 
a 
R 
s 
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Figure 4b. 80960CA PGA Plnout (View from Bottom Side) 
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3.3.2 
80960CA 
PQFP Pinout 


Tables 
7 and 
8 list the 
80960CA 
pin names 
with 
package 
location. 


See 
Section 
4.0, 
Electrical 
Specifications 
for 
specifications 
and recommended 
connections. 


Table 7. PQFP Pin Name with Package 
Location 
(Signal Order) 


Address 
Bus 
Data Bus 
Bus Control 
Processor 
Control 
1/0 


Name 
.. Location 
Name 
.. Location 
Name 
.. Location 
Name ........ 
Location 
Name 
.. Location 


A31 
........ 
153 
D31 ........ 
186 
BE3 ........ 
176 
RESET 
........... 
091 
DREQ3 
..... 
060 


A30 
........ 
152 
D30 ........ 
187 
BE2 ........ 
175 
DREQ2 
..... 
059 


A29 
........ 
151 
D29 
........ 
188 
BE1 ........ 
172 
FAIL 
............. 
045 
DREQ1 
..... 
058 


A28 
........ 
145 
D28 ........ 
189 
BEO ........ 
170 
DREQO ..... 
057 


A27 
........ 
144 
D27 
........ 
191 
STEST 
........... 
046 


A26 
........ 
143 
D26 ........ 
192 
W/R ........ 
164 
DACK3 
..... 
065 


A25 
........ 
142 
D25 ........ 
194 
ONCE ............ 
043 
DACK2 
..... 
064 


A24 
........ 
141 
D24 ........ 
195 
ADS ........ 
178 
DACK1 
..... 
063 


A23 
........ 
139 
D23 ........ 
003 
CLKIN ............ 
087 
DACKO ..... 
062 


A22 
........ 
138 
D22 ........ 
004 
READy 
..... 
182 
CLKMODE 
........ 
085 


A21 
........ 
137 
D21 ........ 
005 
BTERM ..... 
184 
PCLK1 
........... 
078 
EOP ITC3 
... 069 


A20 
........ 
136 
D20 
........ 
006 
PCLK2 
........... 
074 
EOP ITC2 
... 068 


A19 
........ 
134 
D19 ........ 
008 
WAIT ....... 
162 
EOP ITC1 ... 067 


A18 
........ 
133 
D18 
........ 
009 
BLAST ...... 
169 
Vss 
EOP ITCO ... 066 


A17 
........ 
132 
D17 
........ 
010 
Location 


A16 
........ 
130 
D16 ........ 
011 
DT/R ....... 
163 
2,7,16,24,30,38, 
XINT7 
...... 
107 


A15 
........ 
129 
D15 
........ 
013 
DEN ........ 
167 
39, 49, 56, 70, 75, 
XINT6 
...... 
106 


A14 
........ 
128 
D14 ........ 
014 
77,81,83, 
88, 89, 
XINT5 
...... 
102 
92,98,105,109,110, 
A13 
........ 
124 
D13 
........ 
015 
LOCK 
...... 
156 
121,125,131,135, 
XINT4 
...... 
101 


A12 
........ 
123 
D12 ........ 
017 
147,150,161,165, 
XINT3 
...... 
100 
173,174, 
185, 196 
A11 
........ 
122 
D11 ........ 
018 
HOLD 
...... 
181 
XINT2 
...... 
095 


A10 
........ 
120 
D10 ........ 
019 
HOLDA 
..... 
179 
Vcc 
XINT1 
...... 
094 


A9 
· ........ 
119 
D9 ......... 
021 
BREQ 
...... 
155 
Location 
XINTO ...... 
093 


A8 
......... 
118 
D8 ......... 
022 
1,12,20,28, 


A7 
· ........ 
117 
D7 ......... 
023 
D/C ........ 
159 
32,37,44, 
50, 
NMI 
........ 
108 


A6 
......... 
116 
D6 ......... 
025 
DMA 
....... 
160 
61,71,79, 
82,96,99, 
103, 
A5 
......... 
114 
D5 ......... 
026 
SUP ........ 
158 
115,127,140,148, 


A4 
· ........ 
113 
D4 ......... 
027 
154,168,171,180,190 


VCCPLL ............ 
72 


A3 
· ........ 
112 
D3 ......... 
033 
BOFF ....... 
040 
No Connect 


A2 
· ........ 
1j 1 
D2 ......... 
034 
Location 


D1 ......... 
035 
29,41,42,47, 
48, 51, 52, 53, 
54, 55, 73, 76, 
80,84,86,90,97, 
104,126,146,149,157, 
166, 177, 183, 193 


DO ......... 
036 
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Table 8. PQFP Pin Name with Package Location (Pin Order) 
Pin 
Signal 
1 
Vcc 
2 
Vss 
3 
023 
4 
022 
5 
021 
6 
020 
7 
Vss 
8 
019 
9 
018 
10 
017 


11 
016 
12 
Vcc 


13 
015 
14 
014 
15 
013 


16 
Vss 


17 
012 
18 
011 
19 
010 
20 
Vcc 
21 
09 
22 
08 
23 
07 
24 
Vss 
25 
06 
26 
05 
27 
04 
28 
Vcc 
29 
NC 
30 
Vss 
31 
NC 
32 
Vcc 
33 
03 
34 
02 
35 
01 
36 
00 
37 
Vcc 
38 
Vss 
39 
Vss 
40 
BOFF 
41 
NC 
42 
NC 
43 
ONCE 
44 
Vcc 
45 
FAIL 
46 
STEST 
47 
NC 
48 
NC 
49 
Vss 
I 


Pin 
Signal 
50 
Vcc 
51 
NC 
52 
NC 
53 
NC 
54 
NC 
55 
NC 
56 
Vss 
57 
OREQO 
58 
OREQ1 
59 
OREQ2 
60 
OREQ3 
61 
Vcc 
62 
OACKO 
63 
OACK1 
64 
OACK2 
65 
OACK3 
66 
EOPO/TCO 
67 
EOP1/TC1 
68 
EOP2/'fC2 
69 
EOP3/TC3 
70 
Vss 
71 
Vcc 
72 
VCCPLL 
73 
NC 
74 
PCLK2 
75 
Vss 
76 
NC 
77 
Vss 
78 
PCLK1 
79 
Vcc 
80 
NC 
81 
Vss 
82 
VCC 
83 
Vss 
- 


84 
NC 
85 
CLKMOOE 
86 
NC 
87 
CLKIN 
88 
Vss 
89 
Vss 
90 
NC 
91 
RESET 
92 
Vss 
93 
XINTO 
94 
XINT1 
95 
XINT2 
96 
Vcc 
97 
NC 
98 
Vss 


Pin 
Signal 
99 
VCC 
100 
XINT3 
101 
XINT4 
102 
XINT5 
103 
Vcc 
104 
NC 
105 
Vss 
106 
XINT6 
107 
XINT7 
108 
NMI 
109 
Vss 
110 
Vss 
111 
A2 
112 
A3 
113 
A4 
114 
A5 
115 
VCC 
116 
A6 
117 
A7 
118 
A8 
119 
A9 
120 
A10 
121 
Vss 
122 
A11 
123 
A12 
124 
A13 
125 
Vss 
126 
NC 
127 
VCC 
128 
A14 
129 
A15 
130 
A16 
131 
Vss 
132 
A17 


133 
A18 


134 
A19 


135 
Vss 
136 
A20 
137 
A21 


138 
A22 
139 
A23 
140 
Vcc 
141 
A24 
142 
A25 
143 
A26 
144 
A27 
145 
A28 
146 
NC 
147 
Vss 


Pin 
Signal 


148 
VCC 
149 
NC 
150 
Vss 
151 
A29 
152 
A30 
153 
A31 
154 
Vcc 
155 
BREQ 
156 
LOCK 
157 
NC 
158 
SUP 
159 
O/G 
160 
OMA 
161 
Vss 
162 
WAIT 
163 
OT/R 
164 
W/R 
165 
Vss 
166 
NC 
167 
OEN 
168 
Vcc 
169 
BLAST 
170 
BEO 
171 
VCC 
172 
BE1 
173 
Vss 
174 
Vss 
175 
BE2 
176 
BE3 
177 
NC 
178 
AOS 
179 
HLOA 
180 
VCC 
181 
HOLO 
182 
REAOY 
183 
NC 
184 
BTERM 
185 
Vss 
186 
031 
187 
030 
188 
029 
189 
028 
190 
VCC 
191 
027 
192 
026 
193 
NC 
194 
025 
195 
024 
196 
Vss 
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Figure 4c. 80960CA 
PQFP Plnout (View from Top Side) 
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PIN 
DETAil 


270727-52 


Family: 
Ceramic 
Pin Grid Array 
Package 


Symbol 
Mllllmeters 
Inches 


Mln 
Max 
Notes 
Mln 
Max 
Notes· 


A 
3.56 
4.57 
0.140 
0.180 


A1 
0.64 
1.14 
SOLID LID 
0.025 
0.045 
SOLID LID 


A2 
23 
0.30 
SOLID LID 
0.110 
0.140 
SOLID LID 


A3 
1.14 
1.40 
0.045 
0.055 


B 
0.43 
0.51 
0.017 
0.020 


0 
44.07 
44.83 
1.735 
1.765 


01 
40.51 
40.77 
1.595 
1.605 


e1 
2.29 
2.79 
0.090 
0.110 


L 
2.54 
3.30 
0.100 
0.130 


N 
168 
168 


S1 
1.52 
2.54 
0.060 
0.100 


ISSUE 
IWS 
REVX 
7/15/88 


I 


Figure 5. 168-Lead Ceramic P~A Package Dimensions 
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Table 9. Ceramic 
PGA Package 
Dimension 
Symbols 


Letter or 
Description 
of DlmensJons 
Symbol 


A 
Distance 
from seating plane to highest point of body 


A1 
Distance 
between 
seating plane and base plane (lid) 


A2 
Distance from base plane to highest point of body 


A3 
Distance 
from seating plane to bottom of body 


B 
Diameter 
of terminal 
lead pin 


0 
Largest overall package 
dimension 
of length 


01 
A body length dimension, 
outer lead center to outer lead center 


e1 
Linear spacing between 
true lead position 
centerlines 


L 
Distance 
from seating plane to end of lead 


SI 
Other body dimension, 
outer lead center to edge of body 


NOTES: 
1. ContrOlling 
dimension: 
millimeter. 


2. Dimension 
"e1" 
("e") 
is non-cumulative. 


3. Seating 
plane (standoff) 
is defined 
by P.C. board 
hole size: 0.0415-0.0430 
inch. 


4. Dimensions 
"S", 
"S1" 
and "C" 
are nominal. 


5. Details 
of Pin 1 identifier 
are optional. 
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3.4.2 
PLASTIC 
QUAD 
FLAT 
PACKAGE 


270727-54 
mm (inch) 


Figure 6. Principal Dimensions 
and Data 


~~ 


~ 
B.25 
<'B1B)@ 
C A® 
B® 
D®lA 


.J.. 
. BB2 MM/MM<IN/IN) 
A-B 


01 
1~ 
B.25 
<'SlS)@ 
C A® 
B® 
0(5) lA 
1.J..1 . SS2 MM/MM(IN/IN) 
IA-BI 


j 


E2 
El -W 


3.S1 
( .15S) 
MAX TYP 
, 


[,SEE 
DETAIL 
M 
I 
'-' 
- I- 1. 91 
c , S75) 
MAX TYP 


~ls.25 
<'SlB)@TcIA®-B®IDFsl] 


.LT. BB2 MM/MM<IN/IN) 
101 


~ 
B.25 
(.B1B)@ 
C A®-B®ID®lA 


. BB2 MM/MM (IN/IN) 
0 


270727-55 
mm (inch) 


Figure 7. Molded 
Details 


1.32 
1.22 


t 


1.32 
<'B52) 
1.22 
<'B4S) 


B.9B 
(.B35) 
MIN. j 


2.S3 
(.SSS)-l 
1. 93 
(.B7/,) 


-~-- 
02 -----l 
270727-58 
mm (inch) 


Figure 8. Detail M 
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-U-j0.b35 
(0.025)1 


11 
y 


11 
It.:',- 


04/E4 


O/E 


SEE OETAIL 
L 


SEE DETAIL 
J 


270727-56 
mm (inch) 


. Figure 
9. Terminal 
Details 


270727-57 
DetallJ 
Detail 
L 
mm (inch) 


Figure 
10. Typical 
Lead 


Table 
10. PQFP Package 
Dimension 
Symbols 


Symbol 
Description 
Mln 
Max 
Mln 
Max 


N 
Leadcount 
196 
196 


A 
Package 
Height 
0.160 
0.170 
4.06 
4.32 


A1 
Standoff 
0.020 
0.030 
0.51 
0.76 


D,E 
Terminal 
Dimension 
1.475 
1.485 
37.47 
37.72 


01, E1 
Package 
Body 
1.347 
1.353 
34.21 
34.37 


D2,E2 
Bumper Distance 
1.497 
1.503 
38.02 
38.18 


D3,E3 
Lead Dimension 
1.200 REF 
30.48 REF 


D4,E4 
Foot Radius Location 
1.423 
1.437 
36.14 
36.49 


L1 
Foot Length 
0.020 
0.030 
0.51 
0.76 


Dimension 
INCH 
mm 


NOTES: 
1. All dimensions 
and tolerances 
conform 
to ANSI Y14.5M-1982. 


2. Datum 
plane ·H· located 
at the mold parting 
line and coincident 
with the bottom 
of the lead where 
lead exits plastic 
body. 


3. Datums 
A·S and ·D· to be determined 
where 
center 
leads exit plastic 
body at datum 
plane ·H·. 


4. ContrOlling 
Dimension, 
Inch. 
5. Dimensions 
D1. D2. E1 and 
E2 are measured 
at the 
mold 
parting 
line. 
D1 and 
E1 do not 
include 
an allowable 
mold 
protrusion 
of 0.18 mm (0.007 
in) per side. D2 and E2 do not include 
a. total allowable 
mold protrusion 
of 0.18 mm (0.007 
in) 


at maximum 
package 
size. 


6. Pin 1 identifier 
is located 
within 
one of the two zones 
indicated. 


7. Measured 
at datum 
plane 
·H·. 
8. Measured 
at seating 
plane datum 
·C·. 
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3.5. 
Package Thermal Specifications 


The 
80960CA 
is specified 
for 
operation 
when 
TC 
(the case 
temperature) 
is within 
the range 
of O·C- 
100·C. TC may be measured 
in any environment 
to 
determine 
whether 
the 80960CA 
is within 
specified 
operating 
range. 
The 
case 
temperature 
should 
be 
measured 
at the center 
of the top surface, 
opposite 
the pins. Refer to Figure 13. 


TA 
(the 
ambient 
temperature) 
can 
be 
calculated 
from (JCA(thermal 
resistance 
from case to ambient) 
with the following 
equation: 


Table 11 shows the maximum 
TA allowable 
(without 


exceeding 
Tcl at various 
airflows 
and operating 
fre- 


quencies 
(fpCLK)' 


Note that TA is greatly improved 
by attaching 
fins or 
a heat sink to the package. 
P (the maximum 
power 


consumption) 
is calculated 
by using the typical 'cc 


as tabulated 
in Section 
4.4, 
DC 
Characteristics, 


and VCC of 5V. 


Table 
11. Maximum 
TA at Various 
Airflows 
In ·C (PGA Package 
Only) 


Alrflow-ft/mln 
(m/sec) 


fpCLK 
0 
200 
400 
600 
800 
1000 
(MHz) 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 
(5.07) 


TA 
33 
51 
66 
79 
81 
85 
87 
with 
25 
61 
73 
83 
85 
88 
89 
Heat Sink' 
16 
74 
82 
89 
90 
92 
93 


TA 
33 
36 
47 
59 
66 
73 
75 
without 
25 
49 
58 
67 
73 
78 
80 
Heat Sink 
16 
66 
72 
78 
82 
86 
87 
. . 
• 0.285" high unidirectional 
heat sink (AI alloy 6061, 50 mil fin width, 150 mil center-ta-center 
fin spacing) . 


PGA Thermal 
Reslstance-·C/Watt 


Alrflow-ft.lmln 
(m/sec) 


Parameter 
0 
200 
400 
600 
800 
1000 
(0) 
(1.01) 
(2.03) 
(3.07) 
(4.06) 
(5.07) 
'.. 


(J Junction-to-Case 
'Jpl" r">. 
'k 
(Case Measured 
1.5 
1.5 
1.5 
1.5 
1.5 
1.5 
as shown in Figure 13) 
I 
'J cop I 


(J Case-to-Ambient 
17 
14 
11 
9 
7.1 
6.6 
IJIJU 
mm 
(No Heatsink) 


(J Case-to-Ambient 


270727-59 


(with Unidirectional) 
13 
9 
5.5 
5.0 
3.9 
3.4 


Heatsink)' 


NOTES: 
1. This table 
applies 
to 80960CA PGA plugged 
into socket 
or soldered 
directly 


into board. 
2. 8JA = 8JC + 8CA· 
3.8J-CAP 
= 4°C/W (approx.) 
8J-PIN = 4°C/W (inner pins) (approx.) 
8J-PIN = 8°C/W (outer 
pins) (approx.) 
• 0.285" high unidirectional 
heat sink (AI alloy 6061.50 
mil fin width. 150 mil 
center-to-center 
fin spacing). 


Figure 
11. 80960CA 
PGA Package 
Thermal 
Characteristics 


I 
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PQFPThermal Resistance-°C/Watt 


Airflow-ft.lmin 
(m/sec) 


Parameter 
0 
50 
100 
200 
400 
600 
800 
(0) (0.25) (0.50) (1.01) (2.03) (3.04) (4.06) 


(J Junction-to-Case 
(Case Measured) 
5 
5 
5 
5 
5 
5 
5 
as shown in Figure 13) 


(J Case-to-Ambient 
19 
18 
17 
15 
12 
10 
9 
(No Heatsink) 


NOTES: 
1. This table applies 
to B0960CA 
PQFP soldered 
directly 
into board. 


2. 8JA = 8JC + 8CA. 
3. 8JL = 1B"C/Walt 
8JB = 1B"C/Walt 


270727-00 


Figure 12. 80960CA PQFP Package Thermal Characteristics 


MEASURE PGA CASE TEMPERATURE 
AT CENTER OF TOP SURFACE 
MEASURE PQFP TEMPERATURE AT 
CENTER OF TOP SURFACE 


o 


270727-62 
270727-61 


Figure 13. Measuring 80960CA PGA and PQFP Case Temperature 
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LSB 


ASCII 


DECIMAL 
Stepping 
Number 


Stepping 
Number 


Figure 14. Register GO 


Table 12 Die Stepping Cross Reference 


GO Least 
Die Stepping 
Significant Byte 


01 
B 


02 
C-1 


03 
C-2,C-3 


04 
D 


I 


P.O. Box 2963 
Austin, TX 
78769-2963 


2. Augat, Inc. 
Interconnection Products Group 
33 Perry Avenue 
P.O. Box 779 
Attleboro, MA 
02703 
(508) 222-2202 


3. Concept Manufacturing Inc. 


(Decoupling Sockets) 
43024 Christy Street 
Fremont, CA 
94538 
(415) 651-3804 


Heat Sinks/Fins 


1. Thermalloy, Inc. 


2021 West Valley View Lane 
Dallas, TX 
75381-0839 
(214) 243-4321 


2. E G & G Division 
60 Audubon Road 
Wakefield, MA 
01880 
(617) 245-5900 
• 
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4.0 
ELECTRICAL SPECIFICATIONS 


4.1 Absolute Maximum Ratings 


Parameter 
Maximum Rating 


Storage Temperature 
-65'C 
to + 150'C 
Case Temperature Under Bias 
-65'C 
to + 110'C 
Supply Voltage wrt. Vss 
-0.5Vto 
+6.5V 
Voltage on Other pins wrt Vss 
- 0.5V to VCC + 0.5V 


intel~ 


NOTICE: This is a production data sheet. The specifi- 
cations are subject to change without notice . 


• WARNING: Stressing the device beyond the "Absolute 
Maximum 
Ratings" 
may 
cause 
permanent 
damage. 


These are stress ratings only. Operation 
beyond 
the 


"Operating 
Conditions" 
is not recommended 
and ex- 
tended exposure 
beyond 
the 
"Operating 
Conditions" 
may affect device reliability. 


4.2. Operating Conditions 


Operating 
Conditions 
(80960CA-33, 
-25, -16) 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


VCC 
Supply Voltage 
80960CA-33 
4.75 
5.25 
80960CA-25 
4.50 
5.50 
V 
80960CA-16 
4.50 
5.50 


fCLK2x 
Input Clock Frequency 
(z-x Mode) 
80960CA-33 
0 
66.66 
MHz 
80960CA-25 
0 
50 
MHz 
80960CA-16 
0 
32 
MHz 


fCLK1x 
Input Clock Frequency 
(t-x Mode) 
80960CA-33 
8 
33.33 
MHz 
80960CA-25 
8 
25 
MHz 
(1) 
80960CA-16 
8 
16 
MHz 


Tc 
Case Temperature 
Under Bias 
PGA Package 
0 
100 
·C 
80960CA-33, 
-25, -16 
196-Pin PQFP 
0 
100 


NOTE: 
(1) When in the t-x input clock mode, CLKIN is an input to an internal phase-locked loop and must maintain a minimum 
frequency of 8 MHz for proper processor operation. However, in the t-x Mode, CLKIN may still be stopped when the 
processor either is in a reset condition or is reset. If CLKIN is stopped, the specified ~ 
low time must be provided once 
CLKIN restarts and has stabilized. 


4.3 Recommended Connections 


Power 
and 
ground 
connections 
must 
be made 
to 
multiple vcc and Vss 
(GND) pins. Every 80960CA- 
based circuit board should include 
power (VCC>and 
ground 
(Vss) 
planes 
for 
power 
distribution. 
Every 
VCC pin must be connected 
to the power plane, and 
every 
Vss 
pin 
must 
be connected 
to the 
ground 
plane. 
Pins identified 
as "N.C." 
must 
not 
be con- 
nected 
in the system. 


Liberal 
decoupling 
capacitance 
should 
be 
placed 
near the 80960CA. 
The processor 
can cause tran- 
sient 
power 
surges 
when 
its numerous 
output 
buff- 
ers transition, 
particularly 
when 
connected 
to large 
capacitive 
loads. 
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Low 
inductance 
capacitors 
and 
interconnects 
are 
recommended 
for best high frequency 
electrical 
per- 
formance. 
Inductance 
can be reduced 
by shortening 
the board traces between 
the processor 
and decou- 
pling 
capacitors 
as much 
as possible. 
Capacitors 
specifically 
designed 
for PGA packages 
will offer the 
lowest 
possible 
inductance. 


For reliable 
operation, 
always 
connect 
unused 
in- 
puts to an appropriate 
si~ 
level. In particular, 
any 
unused 
interrupt 
(XINT, NMI) or DMA (DREQ) 
input 
should be connected 
to VCC through 
a pull-up resis- 
tor, as should 
BTERM 
if not used. Pull-up resistors 
should 
be in the range 
of 20 KO for each 
pin tied 
high. If READY 
or HOLD are not used, the unused 
input 
should 
be 
connected 
to 
ground. 
N.C. 
pins 
must 
always 
remain 
unconnected. 
Refer 
to the 
i960 CA Microprocessor 
Reference 
Manual for more 
information. 
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4.4. DCSpecifications 


DC Characteristics 
(80960CA-33, 
-25, -16 under the conditions 
described 
in Section 
4.2, Operating 
Conditions.) 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


Vll 
Input Low Voltage for all pins except RESET 
-0.3 
0.8 
V 


VIH 
Input High Voltage for all pins except RESET 
2.0 
VCC + 0.3 
V 


VOl 
Output Low Voltage 
0.45 
V 
IOl = 5mA 


VOH 
Output High Voltage 
IOH = -1mA 
2.4 
V 
IOH = -200/-LA 
vcc - 
0.5 
V 


VllR 
Input Low Voltage for RESET 
- 
0.3 
1.5 
V 


VIHR 
Input High Voltage for RESET 
3.5 
vcc + 0.3 
V 


IU1 
Input Leakage Current for each pin except 
STERM, ONCE, DREQ3:0, STEST, 
EOP3:0/TC3:0, 
NMI, XINT7:0, 
READY, HOLD, SOFF, CLKMODE 
±15 
/-LA 
OV:>:VIN:>:VCC(1) 


IU2 
Input Leakage Current for: 
STERM, ONCE, DREQ3:0, STEST, 
EOP3:0/TC3:0, 
NMI, XINT7:0, 
SOFF 
0 
-300 
/-LA 
VIN = 0.45V (2) 


IU3 
Input Leakage Current for: 
READY, HOLD, CLKMODE 
0 
500 
/-LA 
VIN = 2.4V (3) 


IlO 
Output Leakage Current 
±15 
/-LA 
0.45V:>:VOUT:>:VCC 


tee 
Supply Current (80960CA-33) 
tee Max 
900 
mA 
(4) 


IccTyp 
750 
(5) 


tee 
Supply Current (80960CA-25) 


IccMax 
750 
mA 
(4) 


ICCTyp 
600 
(5) 


lee 
Supply Current (80960CA-16) 
tee Max 
550 
mA 
(4) 
tee Typ 
400 
(5) 


IONCE 
ONCE-mode 
Supply Current 
100 
mA 


CIN 
Input Capacitance 
for: 
CLKIN, RESET, ONCE, 
READY, HOLD, DREQ3:0, 
SOFF 
XINT7:0, 
NMI, STERM, CLKMODE 
0 
12 
pF 
Fe = 1 MHz 


COUT 
Output Capacitance 
of each output pin 
12 
pF 
Fc = 1 MHz, (6) 


CIIO 
1/0 Pin Capacitance 
12 
pF 
Fc = 1 MHz 


NOTES: 
(1) No Pull-up or pull-down. 
(2) These pins have internal pullup resistors. 
(3) These pins have internal pulldown resistors. 
(4) Measured at worst case frequency. vcc and temperature, with device operating and outputs loaded to the test conditions 
described in Section 4.5.1, AC Test Conditions. 
(5) Ice Typical is not tested. 
(6) Output Capacitance is the capacitive load of a floa~~E'Put. 
(7) ClKMODE pin has a pull down resistor only when 
N 
pin is deasserted. 
I 
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4.5 AC Specifications 


AC Characteristics 
- 
80960CA-33 
(80960CA-33 
only, under the conditions 
described 
in Section 
4.2, Operating 
Conditions 
and Section 
4.5.1, 
AC Test 
Conditions) 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


INPUT CLOCK(10) 


TF 
CLKIN Frequency 
0 
66.66 
MHz 
(1) 


Tc 
CLKIN Period 
In t-x Mode (fCLK1xl 
30 
125 
ns 
(1.12) 
In 2-x Mode (fCLK2xl 
15 
00 
ns 
(1) 


TCS 
CLKIN Period Stability 
In 1-x Mode (fCLK1x) 
±0.1% 
A 
(1.13) 


TCH 
CLKIN High Time 
In 1-x Mode (fCLK1xl 
6 
62.5 
ns 
(1.12) 
In 2-x Mode (fCLK2xl 
6 
00 
ns 
(1) 


TCL 
CLKIN Low Time 
In t-x Mode (fCLK1x) 
6 
62.5 
ns 
(1.12) 
In 2-x Mode (fCLK2xl 
6 
00 
ns 
(1) 


TCR 
CLKIN Rise Time 
0 
6 
ns 
(1) 


TCF 
CLKIN Fall Time 
0 
6 
ns 
(1) 


OUTPUT CLOCKS(9) 


Tcp 
CLKIN to PCLK2:1 Delay 
In 1-x Mode (fCLK1x) 
-2 
2 
ns 
(1.3.13) 
In 2-x Mode (fCLK2x) 
2 
25 
ns 
(1.3) 


T 
PCLK2:1 Period 
In 1-x Mode (fCLK1x) 
TC 
ns 
(1.13) 
In 2-x Mode (fCLK2xl 
2TC 
ns 
(1.3) 


TpH 
PCLK2:1 High Time 
(T/2) - 
2 
T/2 
ns 
(1.13) 


TpL 
PCLK2:1 Low Time 
(T/2) - 
2 
T/2 
ns 
(1.13) 


TpR 
PCLK2:1 Rise Time 
1 
4 
ns 
(1.3) 


TpF 
PCLK2:1 Fall Time 
1 
4 
ns 
(1.3) 


SYNCHRONOUS OUTPUTS(10) 


Tov 
Output Valid Delay. Output Hold 
(6.11) 
TOH 
TOV1.TOH1 
A31:2 
3 
14 
ns 
TOV2.TOH2 
BE3:0 
3 
16 
ns 
TOV3.TOH3 
ADS 
6 
18 
ns 
TOV4.TOH4 
W/R 
3 
18 
ns 
TOV5.TOH5 
D/G. SDP. DMA 
4 
16 
ns 
TOV6.TOH6 
BLAST. WAIT 
5 
16 
ns 
TOV? TOH? 
DEN 
3 
16 
ns 
TOV8.TOH8 
HOLDA.BREQ 
4 
16 
ns 
TOV9.TOH9 
LOCK 
4 
16 
ns 
TOV1O.TOH10 
DACK3:0 
4 
18 
ns 
TOV11.TOH11 
D31:0 
3 
16 
ns 
TOV12.TOH12 
DTtR 
T/2 + 3 
T/2 + 14 
ns 
TOV13.TOH13 
FAIL 
2 
14 
ns 
TOV14.TOH14 
EOP3:0/TC3:0 
3 
18 
ns 
(6.11) 


TOF 
Output Float for all outputs 
3 
22 
ns 
(6) 


SYNCHRONOUS INPUTS(10) 


TIS 
Input Setup 
TIS1 
D31:0 
3 
ns 
(1.11) 
TIS2 
BOFF 
17 
ns 
(1.11) 
TIS3 
BTERM/READY 
7 
ns 
(1.11) 
TIS4 
HOLD 
7 
ns 
(1.11) 


TIH 
Input Hold 
TIH1 
D31:0 
5 
. 
ns 
(1.11) 
TIH2 
BOFF 
5 
ns 
(1.11) 
TIH3 
BTERM/READY 
2 
ns 
(1.11) 
TIH4 
HOLD 
3 
ns 
(1.11) 
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AC Characteristics 
- 
80960CA-33 
80960CA-33 
oniy, under the conditions 
described 
in Section 
4.2, Operating 
Conditions 
and Section 
4.5.1, 


AC Test 
Conditions.) 
(Continued) 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


RELATIVE OUTPUT TIMINGS(9,7) 


TAVSHl 
A31:2 Valid to ADS Rising 
T-4 
T+4 
ns 


TAVSH2 
BE3:0, W/R, 
SOP, DIG, 


DMA, DAGK3:0 Valid to ADS Rising 
T-6 
T+6 
ns 


TAVEl1 
A31:2 Valid to DEN Falling 
T-4 
T+4 
ns 


TAVEl2 
BE3:0, W/R, 
SOP,INST, 
DMA, DAGK3:0 Valid to DEN Failing 
T-6 
T+6 
ns 


TNLOV 
WAIT Failing to Output Data Valid 
±4 
ns 


TDVNH 
Output Data Valid to WAIT Rising 
N°T - 
4 
N°T + 4 
ns 
(4) 


TNLNH 
WAIT Failing to WAIT Rising 
N*T±4 
ns 
(4) 


TNHOX 
Output Data Hold after WAIT Rising 
(N + 1) ° T - 
6 
(N + 1) ° T + 6 
ns 
(5) 


TEHTV 
DTI R Hold after DEN High 
T/2 - 6 
00 
ns 
(6) 


TTVEL 
DTIR Valid to DEN Falling 
T/2 - 4 
T/2 + 4 
ns 
(7) 


RELATIVE INPUT TIMINGS(7) 


TIS5 
RESET Input Setup (2-x Clock Mode) 
6 
ns 
(14) 


TIH5 
RESET Input Hold (2-x Clock Mode) 
5 
ns 
(14) 


TIS6 
DREQ3:0 Input Setup 
12 
ns 
(8) 


TIH6 
DREQ3:0 Input Hold 
7 
ns 
(8) 


TIS7 
XINT7:0, NMI Input Setup 
7 
ns 
(8) 


TIH7 
XINT7:0, NMllnput 
Hold 
3 
ns 
(8) 


Tlss 
RESET Input Setup (t-x Clock Mode) 
3 
ns 
(15) 


TIH8 
RESEf Input Hold (t-x Clock Mode) 
T/4 + 1 
ns 
(15) 


NOTES: 
(1) See Section 4.5.2, AC Timing Waveforms for waveforms and definitions. 
(2) See Figure 22 for capacitive derating information for output delays and hold times. 
(3) See Figure 23 for capacitive derating information for rise and fall times. 
(4) Where N is the number of NRAD,NRDD,NWAD,or NWDDwait states that are programmed in the Bus Controller Region 
Table. When there are no wait states in an access, WAIT never goes active. 
(5) N = Number of wait states inserted with READY. 
(6) Output Data and/or DTlA may be driven indefinitely following a cycle if there is no subsequent bus activity. 
(7) See Notes 1, 2 and 3. 
(8) Since asynchronous inputs are synchronized internally by the 80960CA, they have no required setup or hold times to be 
recognized and for proper operation. However, to guarantee recognition of the input at a particular edge of PCLK2:1, the 
setup times shown must be met. Asynchronous inputs must be active for at least two consecutive PCLK2:1 rising edges to 
be seen by the processor. 
(9) These specifications are guaranteed by the processor. 
(10) These specifications must be met by the system for proper operation of the processor. 
(11) This timing is dependent upon the loading of PCLK2:1. Use the derating curves of Section 4.5.3 to adjust the timing for 
PCLK2:1 loading. 
(12) In the t-x input clock mode, the maximum input clock period is limited to 125 ns while the processor is operating. When 
the processor is in reset, the input clock may stop even in t-x mode. 
(13) When in the t-x input clock mode, these specifications assume a stable input clock with a period variation of less than 
± 0.1% between adjacent cycles. 
(14) In 2-x clock mode, RESET is an asynchronous input which has no required setup and hold time for proper operation. 
However, to guarantee the device exits reset synchronized to a particular clock edge, the RESET pin must meet setup and 
hold times to the falling edgs 
of the CLKIN. (See Figure 28a.) 
(15) In t-x clock mode, RE ET is an asynchronous input which has no required setup and hold time for proper operation. 
However, to guarantee the device exits reset synchronized to a particular clock edge, the RESET pin must be deasserted 
while CLKIN is high and meet setup and hold times to the rising edge of the CLKIN. (See Figure 28b.) 
I 
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AC Characteristics 
- 
80960CA-25 
(80960CA-25 
only. under the conditions 
described 
in Section 
4.2, Operating 
Conditions 
and Section 
4.5.1, 
AC Test 
Conditions.) 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


INPUT CLOCK(10) 


TF 
CLKIN Frequency 
0 
50 
MHz 
(1) 


TC 
CLKIN Period 
In t-x Mode (fCLKlx) 
40 
125 
ns 
(1.12) 
In 2-x Mode (fCLK2x) 
20 
00 
ns 
(1) 


Tcs 
CLKIN Period Stability 
In t-x Mode (fCLKlxl 
±0.1% 
t.. 
(1.13) 


TCH 
CLKIN High Time 
In t-x Mode (fCLKlxl 
8 
62.5 
ns 
(1.12) 
In 2-x Mode (fCLK2xl 
8 
00 
ns 
(1) 


TCL 
CLKIN Low Time 
In t-x Mode (fCLKlx) 
8 
62.5 
ns 
(1.12) 
In 2-x Mode (fCLK2xl 
8 
00 
ns 
(1) 


TCA 
CLKIN Rise Time 
0 
6 
ns 
(1) 


TCF 
CLKIN Fall Time 
0 
6 
ns 
(1) 


OUTPUT CLOCKS(9) 


Tcp 
CLKIN to PCLK2:1 Delay 
In t-x Mode (fCLKlx) 
-2 
2 
ns 
(1.3.13) 
In 2-x Mode (fCLK2xl 
2 
25 
ns 
(1.3) 


T 
PCLK2:1 Period 
In 1-x Mode (fCLKlxl 
Tc 
ns 
(1.13) 
In 2-x Mode (fCLK2x) 
2Tc 
ns 
(1.3) 


TpH 
PCLK2:1 High Time 
(T/2) - 
3 
T/2 
ns 
(1.13) 


TpL 
PCLK2:1 Low Time 
(T/2) - 
3 
T/2 
ns 
(1.13) 


TpA 
PCLK2:1 Rise Time 
1 
4 
ns 
(1.3) 


TpF 
PCLK2:1 Fall Time 
1 
4 
ns 
(1.3) 


SYNCHRONOUS OUTPUTS(10) 


Tov 
Output Valid Delay. Output Hold 
(6.11) 
TOH 
TOV1.TOHl 
A31:2 
3 
16 
ns 
TOV2.TOH2 
BE3:0 
3 
18 
ns 
TOV3.TOH3 
ADS 
6 
20 
ns 
TOV4.TOH4 
W/R 
3 
20 
ns 
TOV5.TOH5 
D/C.SOP.DMA 
4 
18 
ns 
TOV6.TOH6 
BLAST. WAIT 
5 
18 
ns 
TOV7.TOH7 
DEN 
3 
18 
ns 
TOV8.TOH8 
HOLDA. BREQ 
4 
18 
ns 
TOV9.TOH9 
LOCK 
4 
18 
ns 
ToVlO. TOH10 
DACK3:0 
4 
20 
ns 
TOV11.TOHll 
D31:0 
3 
18 
ns 
TOV12.TOH12 
DT/R 
T/2 + 3 
T/2 + 16 
ns 
TOV13.TOH13 
FAIL 
2 
16 
ns 
TOV14.TOH14 
EOP3:0/TC3:0 
3 
20 
ns 
(6.11) 


TOF 
Output Float for all outputs 
3 
22 
ns 
(6) 


SYNCHRONOUS INPUTS(10) 


TIS 
Input Setup 
TISl 
D31:0 
5 
ns 
(1.11) 
TIS2 
BOFF 
19 
ns 
(1.11) 
TIS3 
BTERM/READY 
9 
ns 
(1.11) 
TIS4 
HOLD 
9 
ns 
(1.11) 


TIH 
Input Hold 
TIHl 
D31:0 
5 
ns 
(1.11) 
TIH2 
BOFF 
7 
ns 
(1.11) 
TIH3 
STERM/READY 
2 
ns 
(1.11) 
TIH4 
HOLD 
5 
ns 
(1.11) 
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AC Characteristics 
- 
80960CA-25 
(80960CA-25 
oniy, under the conditions 
described 
in Section 
4.2, Operating 
Conditions 
and Section 
4.5.1, 
AC Test 
Conditions.) 
(Continued) 


Symbol 
Parameter 
Min 
Max 
Units 
Notes 


RELATIVE OUTPUT TIMINGS(9,7) 


TAVSH1 
A31:2 Valid to ADS Rising 
T-4 
T+4 
ns 


TAVSH2 
BE3:0, W/R, 
SDP, DIe, 
DMA, DACK3:0 Valid to ADS Rising 
T-6 
T+6 
ns 


TAVEL1 
A31:2 Valid to DEN Falling 
T-4 
T+4 
ns 


TAVEL2 
BE3:0, W/R, 
SDP,INST, 
DMA, DACK3:0 Valid to DEN Falling 
T-6 
T+6 
ns 


TNLOV 
WAIT Falling to Output Data Valid 
±4 
ns 


TOVNH 
Output Data Valid to WAIT Rising 
N'T 
- 
4 
WT+4 
ns 
(4) 


TNLNH 
WAIT Falling to WAIT Rising 
N'T 
± 4 
ns 
(4) 


TNHOX 
Output Data Hold after WAIT Rising 
(N + 1)' 
T - 
6 
(N + 1)' 
T + 6 
ns 
(5) 


TEHTV 
DTIR Hold after DEN High 
T/2 - 6 
00 
ns 
(6) 


TTVEL 
DTIR Valid to DEN Falling 
T/2 - 4 
T/2 + 4 
ns 
(7) 


RELATIVE INPUT TIMINGS(7) 


TIS5 
RESET Input Setup (2-x Clock Mode) 
8 
ns 
(14) 


TIH5 
RESET Input Hold (2-x Clock Mode) 
7 
ns 
(14) 


TIS6 
DREQ3:0 Input Setup 
14 
ns 
(8) 


TIH6 
DREQ3:0 Input Hold 
9 
ns 
(8) 


TIS7 
XINT7:0, NMllnput 
Setup 
9 
ns 
(8) 


TIH7 
XINT7:0, NMllnput 
Hold 
5 
ns 
(8) 


TIS8 
RESET Input Setup (l-x Clock Mode) 
3 
ns 
(15) 


TIH8 
RESET Input Hold (l-x Clock Mode) 
T/4 + 1 
ns 
(15) 


NOTES: 
(1) See Section 4.5.2, AC Timing Waveforms for waveforms and definitions. 
(2) See Figure 22 for capacitive derating information for output delays and hold times. 
(3) See Figure 23 for capacitive derating information for rise and fall times. 
(4) Where N is the number of NRAD,NRDD,NWAD,or NWDDwait states that are programmed in the Bus Controller Region 
Table. When there are no wait states in an access, WAIT never goes active. 
(5) N = Number of wait states inserted with READY. 
(6) Output Data and/or DTlA may be driven indefinitely following a cycle if there is no subsequent bus activity. 
(7) See Notes 1, 2 and 3. 
(8) Since asynchronous inputs are synchronized internally by the 80960CA, they have no required setup or hold times in 
order to be recognized and for proper operation. However, in order to guarantee recognition of the input at a particular edge 
of PCLK2:1, the setup times shown must be met. Asynchronous inputs must be active for at least two consecutive PCLK2:1 
rising edges to be seen by the processor. 
(9) These specifications are guaranteed by the processor. 
(10) These specifications must be met by the system for proper operation of the processor. 
(11) This timing is dependent upon the loading of PCLK2:1. Use the derating curves of Section 4.5.3 to adjust the timing for 
PCLK2:1 loading. 
. 


(12) In the 1-x input clock mode the maximum input clock period is limited to 125 ns while the processor is operating. When 
the processor is in reset, the input clock may stop even in 1-x mode. 
(13) When in the 1-x input clock mode, these specifications assume a stable input clock with a period variation of less than 
± 0.1% between adjacent cycles. 
(14) In 2-x clock mode, RESET is an asynchronous input which has no required setup and hold time for proper operation. 
However, to guarantee the device exits reset synchronized to a particular clock edge, the RESET pin must meet setup and 
hold times to the falling edgs 
of the CLKIN. (See Figure 28a.) 


(15) In t-x clock mode, RE ET is an asynchronous input which has no required setup and hold time for proper operation. 
However, to guarantee the device exits reset synchronized to a particular clock edge, the RESET pin must be deasserted 
while CLKIN is high and meet setup and hold times to the rising edge of the CLKIN. (See Figure 28b.) 
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AC Characteristics 
- 
80960CA-16 
(80960CA-16 
only. under the conditions 
described 
in Section 
4_2, Operating 
Conditions 
and Section 
4_5.1, 
AC Test 
Conditions.) 
(Continued) 


Symbol 
Parameter 
Min 
Max 
Units 
Notes 


INPUT CLOCK(10) 


TF 
CLKIN Frequency 
0 
32 
MHz 
(1) 


Tc 
CLKIN Period 
In 1-x Mode (fCLK1,j 
62.5 
125 
ns 
(1.12) 
In 2-x Mode (fCLK2,j 
31.25 
00 
ns 
(1) 


TCS 
CLKIN Period Stability 
In t-x Mode (fCLK1,j 
±0.1% 
t. 
(1.13) 


TCH 
CLKIN High Time 
In t-x Mode (fCLK1,j 
10 
62.5 
ns 
(1.12) 
In 2-x Mode (fCLK2x> 
10 
00 
ns 
(1) 


TCL 
CLKIN Low Time 
In 1-x Mode (fCLK1,j 
10 
62.5 
ns 
(1.12) 
In 2-x Mode (fCLK2,j 
10 
00 
ns 
(1) 


TCR 
CLKIN Rise Time 
0 
6 
ns 
(1) 


TCF 
CLKIN Fall Time 
0 
6 
ns 
(1) 


OUTPUT CLOCKS(9) 


Tcp 
CLKIN to PCLK2:1 Delay 
In 1-x Mode (fCLK1,j 
-2 
2 
ns 
(1.3.13) 
In 2-x Mode (fCLK2x> 
2 
25 
ns 
(1.3) 


T 
PCLK2:1 Period 
In t-x Mode (fCLK1,j 
Tc 
ns 
(1.13) 


In 2-x Mode (fCLK2x> 
2Tc 
ns 
(1.3) 


TpH 
PCLK2:1 High Time 
(T/2) - 
4 
T/2 
ns 
(1.13) 


TpL 
PCLK2:1 Low Time 
(T/2) - 
4 
T/2 
ns 
(1.13) 


TpR 
PCLK2:1 Rise Time 
1 
4 
ns 
(1.3) 


TpF 
PCLK2:1 Fall Time 
1 
4 
ns 
(1.3) 


SYNCHRONOUS OUTPUTS(10) 


Tov 
Output Valid Delay. Output Hold 
(6.11) 


TOH 
TOV1.TOHl 
A31:2 
3 
18 
ns 
TOV2.TOH2 
BE3:0 
3 
20 
ns 
TOV3.TOH3 
ADS 
6 
22 
ns 
TOV4.TOH4 
W/R 
3 
22 
ns 
TOV5.TOH5 
D/C. sop. DMA 
4 
20 
ns 
TOV6.TOH6 
BLAST. WAIT 
5 
20 
ns 
TOV? TOH? 
DEN 
3 
20 
ns 
TOV8.TOH8 
HOLDA.BREQ 
4 
20 
ns 
TOV9.TOH9 
LOCK 
4 
20 
ns 
TOVlO. TOH10 
DACK3:0 
4 
22 
ns 
TOVll. TOH11 
031:0 
3 
20 
ns 
TOV12.TOH12 
DT/R 
T/2 + 3 
T/2 + 18 
ns 
TOV13.TOH13 
FAIL 
2 
18 
ns 
TOV14.TOH14 
EOP3:0/TC3:0 
3 
22 
ns 
(6.11) 


TOF 
Output Float for all outputs 
3 
22 
ns 
(6) 


SYNCHRONOUS INPUTS(10) 


TIS 
Input Setup 
TISl 
031:0 
5 
ns 
(1.11) 
TIS2 
BOFF 
21 
ns 
(1.11) 
TIS3 
BTERM/READY 
9 
ns 
(1.11) 


TIS4 
HOLD 
9 
ns 
(1.11) 


TIH 
Input Hold 
TIHl 
031:0 
5 
ns 
(1.11) 
TIH2 
BOFF 
7 
ns 
(1.11) 
TIH3 
BTERM/READY 
2 
ns 
(1.11) 


TIH4 
HOLD 
5 
ns 
(1.11) 
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Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


RELATIVE 
OUTPUT 
TIMINGS(9,7) 


TAVSH1 
A31:2 Valid to ADS Rising 
T-4 
T+4 
ns 


TAVSH2 
BE3:0, WIR, 
SUP, DIC, 
DMA, DACK3:0 Valid to ADS Rising 
T-6 
T+6 
ns 


TAVEL1 
A31:2 Valid to DEN Falling 
T-6 
T+6 
ns 


TAVEL2 
BE3:0, WIR, 
SUP, INST, 
DMA, DACK3:0 Valid to DEN Falling 
T-6 
T+6 
ns 


TNLOV 
WAIT Falling to Output Data Valid 
±4 
ns 


TDVNH 
Output Data Valid to WAIT Rising 
N*T-4 
N°T + 4 
ns 
(4) 


TNLNH 
WAIT Falling to WAIT Rising 
N°T ± 4 
ns 
(4) 


TNHOX 
Output Data Hold after WAIT Rising 
(N + 1) ° T - 
6 
(N + 1) ° T + 6 
ns 
(5) 


TEHTV 
DT IR Hold after DEN High 
T/2 
- 
6 
00 
ns 
(6) 


TTVEL 
DTIR Valid to DEN Falling 
T/2 
- 
4 
T/2 
+ 4 
ns 
(7) 


RELATIVE 
INPUT TIMINGS(7) 


TIS5 
RESET Input Setup (2-x Clock Mode) 
10 
ns 
(14) 


TIH5 
RESET Input Hold (2-x Clock Mode) 
9 
ns 
(14) 


TI56 
DREQ3:0 
Input Setup 
16 
ns 
(8) 


TIH6 
DREQ3:0 
Input Hold 
11 
ns 
(8) 


TIS7 
XINT7:0, 
NMI Input Setup 
9 
ns 
(8) 


TIH7 
XINT7:0, NMI Input Hold 
5 
ns 
(8) 


TIS8 
RESET Input Setup (1-x Clock Mode) 
3 
ns 
(15) 


TIH8 
RESET Input Hold (1-x Clock Mode) 
T/4 
+ 
1 
ns 
(15) 


NOTES: 
(1) See Section 4.5.2, AC Timing Waveforms for waveforms and definitions. 
(2) See Figure 22 for capacitive derating information for output delays and hold times. 
(3) See Figure 23 for capacitive derating information for rise and fall times. 
(4) Where N is the number of NRAD,NRDD,NWAD,or NWDDwait states that are programmed in the Bus Controller Region 
Table. When there are no wait states in an access, WAIT never goes active. 
. 


(5) N = Number of wait states inserted with READY. 
(6) Output Data and/or DT/R may be driven indefinitely following a cycle if there is no subsequent bus activity. 
(7) See Notes 1, 2 and 3. 
(8) Since asynchronous inputs are synchronized internally by the 80960CA, they have no required setup or hold times in 
order to be recognized and for proper operation. However, in order to guarantee recognition of the input at a particular edge 
of PCLK2:1, the setup times shown must be met. Asynchronous inputs must be active for at least two consecutive PCLK2:1 
rising edges to be seen by the processor. 
' 


(9) These specifications are guaranteed by the processor. 
(10) These specifications must be met by the system for proper operation of the processor. 
(11) This timing is dependent upon the loading of PCLK2:1. Use the derating curves of Figure 22 to adjust the timing for 
PCLK2:1 loading. 
(12) In the t-x input clock mode the maximum input clock period is limited to 125 ns while the processor is operating. When 
the processor is in reset, the input clock may stop even in t-x mode. 
(13) When in the t-x input clock mode, these specifications assume a stable input clock with a period variation of less than 
±0.1% between adjacent cycles. 
(14) In 2-x clock mode, RESET is an asynchronous input which has no required setup and hold time for proper operation. 
However. to guarantee the device exits reset synchronized to a particular clock edge, the RESET pin must meet setup and 
hold times to the falling edge of the CLKIN. (See Figure 28a.) 
(15) In t-x clock mode, RESET is an asynchronous input which has no required setup and hold time for proper operation. 
However, to guarantee the device exits reset synchronized to a particular clock edge, the RESET pin must be deasserted 
while CLKIN is high and meet setup and hold times to the rising edge of the CLKIN. (See Figure 28b.) 
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4.5.1. AC TEST CONDITIONS 


OU~~UJ>------l 
r 
CL= 50pf for all signals 
270727-11 


Figure 15.AC Test Load 


4.5.2. AC TIMING WAVEFORMS 


The AC Specifications in Section 4.5 are tested with 
the 50 pf load shown in Figure 15. See Figure 22 to 
see how timings vary with load capacitance. 


Specifications are measured at the 1.5V crossing 
point, unless otherwise indicated. Input waveforms 
are assumed to have a rise-and-fall time of 
:5": 2 ns 
fromo.sv 
to 2.0V. See Section 4.5.2, AC Timing 
Waveforms, 
for AC spec definitions, test points, 


and illustrations. 


CLKIN 


PCLK2:1 


270727-12 


Figure 16a.lnput and Output Clocks Waveform 
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Figure 16b. CLKIN Waveform 
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Figure 17. Output 
Delay and Float Waveform 
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Figure 18a.lnput 
Setup and Hold Waveform 


- 
OUTPUT 
DELAY - 
The maximum outpu1 delay is referred to 


as the 0utpu1 Valid Delay (TOV). 
The minimum output delay is 


referred to as the Output Hold (TOH). 


- 
OUTPUT FLOAT DELAY - 
The output float cond~ion occurs 


when the maximum 
output current 
becomes 
less than I LO in magnitude. 


- 
INPUT SETUP AND HOLD - 
The input setup and hold requirements 


specify the sampling window during which synchronous 
inputs must be 


stable for correct 
processor 
operation. 
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Figure 18b. NMI, XINT7:0 
Input Setup and Hold Waveform 
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- 
OUTPUT DELAY - 
The maximum ou1pu1 delay is referred to 


as the Output Valid Delay (Tov). 
The minlmum ou1pu1 delay is 


referred to as the Output Hold (TOH)' 


- 
OUTPUT FLOAT DELAY - 
The ou1pu1 float condition occurs 


when the maximum ou1pu1 current becomes less than I LO in magnitude. 


- 
INPUT SETUP AND HOLD - 
The input setup and hold requirements 


specify the sempling window during which synchronous 
inputs must be 


stable for correct processor opera~on. 


Figure 19. Hold Acknowledge 
Timings 
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Figure 20. Bus Back-Off 
(BOFF) Timings 
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DERATING 
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Figure 21. Relative Timings Waveforms 


NOTE: 
PClK load = 50 pF 


Figure 22. Output 
Delay or Hold vs Load Capacitance 
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Figure 23. Rise and Fall Time Derating 
at Highest Operating 
Temperature 
and Minimum Vee 
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DT/R' 
Driven low (Receive) 
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Driven high (Inactive) 
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Driven high (Inactive) 
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Floating (set to input mode) 
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Floating (set to input mode) 


NOTE: 
(1) With regard to bus output pin state only, the Hold Ac- 
knowledge state takes precedence over the reset state. Al- 
though asserting the RESET pin will internally reset the 
processor, the processor's bus output pins will not enter 
the reset state if it has granted Hold Acknowledge to a pre- 
vious HOLD request (HOLDA is active). Furthermore, the 
processor will grant new HOLD requests and enter the 
Hold Acknowledge state even while in reset. 
For example, if HOLDA is not active and the processor is 
in the reset state, then HOLD is asserted, the processor's 
bus pins will enter the 
Hold Acknowledge 
state and 
HOLDA will be granted. The processor will not be able to 
perform memory accesses until the HOLD request is re- 
moved, even if the RESET pin is brought high. This opera- 
tion is provided to simplify boot-up synchronization among 
multiple processors sharing the same bus. 
I 


~ 


DEN 
Floating 


LOCK 
Floating 


BREQ 
Driven (high or low) 


D/C 
Floating 


DMA 
Floating 


SUP 
Floating 


FAIL 
Driven high (Inactive) 


DACK3 
Driven high (Inactive) 


DACK2 
Driven high (Inactive) 


DACK1 
Driven high (Inactive) 


DACKO 
Driven high (Inactive) 


EOP/TC3 
Driven if output 


EOP/TC2 
Driven if output 


EOP/TC1 
Driven if output 


EOP/TCO 
Driven if output 
• 


3-233 


w 
Nw~ 


ClKIN 


VCC'ONcE 


PCLK1.2 


--~ 
!!I 


----'=QCK WAIT 
DEN. DACKO:3 


CO 
C.., 
~I 


wiii.Qlli 
BREa. FAIL 


!I' 
0 
01 
BLAST 
c: 
:xl 
IDIn 
mr.r0J5. 


ID 
~.~ 
- 
:\lE 
DI 
_QQ1.!.. 
< 
EOPITCO.3 
ID-~I 


STEST 


REm 


~----,-11 
I 
j--'; 
I 


'___ 
table,' 
: 
: 
I 
: 
: 
: 
: 
: 
I 
~ 
I 
I 
toOutputsValid,maxmum , 
I 
I 
: 
I 
I 
I 
I 
I 
I 
, 
32 CLKIN Periods 
I 
I 
t 
I 
I 
I 
r 
~~~ 
~ 
I 
I 
I 
I 
I' 
: 
~W1i1IJ1IJJJJJjJ' 
'11" 
: 
11' 
:c' : 
[I 
It' 
I 
, 
' 
I 
, 
t 
I 
I 
I 
I 
I 
I 
I 
I 
I 


~I~IIIII" 
I 


I 
, 
I 
I 
I 
t 
I 
I 
, 
, 
I 
[: 
~~ 
: 
: 
: 
: 
: 
: 
.r=: 
, 


'I 
I 
I 
I 
, 
I 
I~ 
i------41~ 
: 
11 
I: 
I 
11 
I 
I 
: 
: 
~I::: 
'I::, 


[, 
~ 
',' 
Ill, 
I 
I 
I 
I 
I 
t 
I 
..----II~ 
, 
I" 
"', 


I 
I 
I 
I 
I 
I: 
1 
I 


[ : 
~ 
W/§IIfIfJlJ!!JJ -- -~--, ~-:--- -- t 
INV~lIO 
q 
t------4! 
I 
'I 
I 
I 
I 
t 
I 
: 
~~:::::::c 
[L----,1 
Input. 
~_----:---If-~-----:- 
---~----~--.,~-;-----~ 
I 
r 
~~: 
:VAlIO: 
: 
~ 


t------4 I 
I 
: 
I 
.etup 
Thold 1PCLK 
I 
: 
: 
: 


: 
: 
t 
: 
1 PCLKt 
I 
: 
, 
I 
t 


[I 
, 
I 
, 
I 
t 
I 
I 
I 
I 
~I 
11 
:1,':::: 
I 
,-------, 
I 
I 
I 
I 
t 
: 
I: 
I 
: 
:. 
: 
: 
I 
: 
:. 


: 
CLKIN AND VCC Stable 
to'Riiii 
high, mi'Iimum 
t 
ReS8iHlgh 
10 First Bus Activity. approximately 
: 
I 
32 ClKIN 
Periods in Two-x Mode. 10,000 CLKIN ~ 
In 00&-11Mode.: 
32 PClK Periods 
I 


: 
: 
I 
: 
I 
I 
I 
I 
I 
: 


: 
: 
: 
: 
: 
: 
: 
: 
: 
: 
, 
, 
, 
, 
, 


270727-20 


~ 
01) 
0 
C) 
CO 
Cl) 


ID 
C) 
C 
0 


Cl) 
~ 
:e 


(,0) 
~ 
l> 
IN 
< 
!JI 
m 
I 
." 
.- 
0 
Cl) 


l:J 
~ 
Cl) 


-- 


~ 
- 


@ 


intel~ 
80960CA-33, -25, -16 


--2---r--9- -------H--U--I--j------I---------- 
__-- 
-_-_------ 
--1---_- 
------l------ 
_-_--_---_-(- 
- -_---- 


± 
~ 
! 


, 
! 
---~--- 
----- . ---------- 
: 
I 
, 
2 
II 
. 


- - - - - - - - - - - :-- - - 
- -~ - - - - - - -.1 . - - - - - - - - - - 
:, 
! 
~ 
~1__+,---~,~~~_=--~~ 
-- ------- 
----- 
---:----:--- 
i"i-------- 
--------- 
, 
, 
, 
, 


- - - .1. 
.1. 
_ 
, 
, 
.•. 
.•. 


T 
T 
, 
, 
, 
' 
___ ~-----L--- 
, 
' 
, 
, 
, 
, 


------ 
--------- 


----------------------------.------------------------- 
L...J 
L...J 
L.J 
L...J 
L...J 
L.J 
L...J 
L...J 


N I~~! 
'~l~ I~ 


I.:~g 
~g I~ li 
~ 
~!l~ 
gg 
~ 
let@ 
1<c:1~ 
I@ 
;tlli 
~o 


l 


Figure 26. Warm Reset Waveform 
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Figure 29. Non-Burst, Non-Plpellned Accesses without Wait States 
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Figure 30. Non-Burst, Non-Pipelined Read with Wait States 


3·239 


80960CA-33, -25, -16 
intel~ 


Region Table Entry 
~ 
~ 
. 
8yte i 


But 
Plpe- 
Eat.,,,.t 
. 
Nwdd 
Nw.d 
Nad. 
Nrdd 
Nred 
Bur., 
: 
Order 
Width 
lining 
R.ady 
Control 
E 
0: 
bIls31-23 
blI22 
bil21 
bits 20..19 
bill 18·17 
bits 16-12 
bits 11-10 
bit$9·8 
bits 7 3 
bot2 
'''' 
bot. 
0 
X 
0 
X 
X 
3 
1 
X 
X 
Off 
Di •• blad 
Olnbfad 
00 
. 
0 
.. 
.. 
00011 
O. 
.. 
IIIOtXX 
0 
0 
0 


2 
A 
0 
3 


PCLK [ 


ADs [ 


A31:2,~ 
[ 


W-fl [ 


BLAsT [ 


DT-fl [ 


DEN [ 


SOl', OQA, [ 
Die, meR 


WAIT 
[ 


031:0 
[ 
-m _m:_< 


,, 


A 


Valid 


270727-28 


,--~ 
__ 
OUT 
---.J)--- -- 
: 
i 
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Figure 33. Burst, Non-Pipellned Read with Wait States, 32-Blt Bus 
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Figure 35. Burst, Non-Plpellned Write with Wait States, 32-Blt Bus 
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Figure 36. Burst, Non-Plpellned 
Read with Wait States, 
16-Blt Bus 
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Figure 37. Burst, Non-Plpellned Read with Wait States, 8-Blt Bus 
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Figure 39. Non-Burst, Plpellned Read with Wait States, 32-Blt Bus 
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Figure 40. Burst, Plpellned Read without Wait States, 32-Bit Bus 
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Figure 43. Burst, Plpellned Read with Wait States, 8-Blt Bus 
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Figure 46. BOFF Functional 
Timing 
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Figure 47. HOLD Functional 
Timing 
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NOTES: 
1. Case 
1: DREO must 
deassert 
before 
"C5ACR deasserts. 
Applications 
are 
Fly-by 
and 
some 
packing 
and 
unpacking 
modes 
in which 
loads are followed 
by loads, or stores 
are followed 
by stores. 
2. Case 2: DREQ must be deasserted 
by the second 
clock 
(rising edge) 
after DACK is driven 
high. Applications 
are non 
fly-by transfers 
and adjacent 
load-stores 
or store-loads. 


3. DACKx 
is asserted 
for the duration 
of a DMA bus request. 
The request 
may consist 
of multiple 
bus accesses 
(defined 
by i\[jg and BLAST. 
Refer to User's 
Manual 
for "access", 
"request" 
definition. 


Figure 48. DREQ and DACK Functional Timing 
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NOTE: 
EOP has the same AC Timin~quirements 
as DREQ to prevent 
unwanted 
DMA requests. 


EOP is NOT edge triggered. 
EOP must be held for a minimum 
of 2 clock 
cycles 
then EOP must be deasserted 
within 
15 clock 
cycles. 


Figure 49. EOP Functional Timing 
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NOTE: 
Terminal 
Count 
becomes 
active 
during the last bus request 
of a buffer 
transfer. 
If the last LOAD/STORE 
bus request 
is 
executed 
as multiple 
bus accesses, 
the TC will be active for the entire bus request 
Refer to the i960 CA Microprocessor 
Reference 
Manual 
for further 
information. 


Figure 50. Terminal 
Count Functional 
Timing 
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Figure 51. FAIL Functional 
Timing 
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Figure 52. A Summary 
of Aligned and Unallgned 
Transfers 
for Little Endian Regions 
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Figure 53. A Summary 
of Aligned and Unallgned 
Transfers 
for Little Endlan Regions 
(Continued) 
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32-BIT HIGH PERFORMANCE SUPERSCALAR 
PROCESSOR 


• Socket and Object Code Compatible with 80960CA 
• Two Instructions/Clock 
Sustained Execution 
• Four 59 Mbytes/s DMA Channels with Data Chaining 
• Demultlplexed 32-blt Burst Bus with Pipellning 


• 
32-blt Parallel Architecture 
- 
Two Instructions/clock 
Execution 
- 
Load/Store 
Architecture 
- 
Sixteen 32-blt Global Registers 
- 
Sixteen 32-blt Local Registers 
- 
Manipulate 64-blt Bit Fields 
- 
11 Addressing 
Modes 
- 
Full Parallel Fault Model 
- 
Supervisor Protection 
Model 


• 
Fast Procedure Call/Return 
Model 
- 
Full Procedure Call In 4 clocks 


• 
On-Chip Register Cache 
- 
Caches Registers on Call/Ret 
- 
Minimum of 6 Frames provided 
- 
Up to 15 Programmable 
Frames 


• 
On-Chip Instruction 
Cache 
- 
4 Kbyte Two-Way Set Associative 
-128-blt 
Path to Instruction 
Sequencer 
- 
Cache-Lock Modes 
- 
Cache-Off Mode 


• 
On-Chip Data Cache 
- 
1 Kbyte Direct-Mapped, 
Write Through 
- 
128 bits per Clock Access on 
Cache Hit 


• 
High Bandwidth On-Chip Data RAM 
- 
1 Kbytes On-Chip RAM for Data 
- 
Sustain 128 bits per clock access 


• 
Four On-Chip DMA Channels 
- 
59 Mbytes/s 
Fly-by Transfers 
- 
32 Mbytes/s Two-Cycle Transfers 
- 
Data Chaining 
- 
Data Packing/Unpacking 
- 
Programmable Priority Method 


• 
32-Blt Demultlplexed 
Burst Bus 
- 
128-bit Internal Data Paths to and 
3 


from Registers 
- 
Burst Bus for DRAM Interfacing 
- 
Address Plpellning Option 
- 
Fully Programmable Wait States 
- 
Supports 8, 16 or 32-blt Bus Widths 


- 
Supports Unallgned Accesses 
- 
Supervisor Protection 
Pin 
• 
Selectable Big or Little Endlan Byte 
Ordering 
• 
High-Speed Interrupt Controller 
- 
Up to 248 External Interrupts 
- 
32 Fully Programmable 
Priorities 
- 
Multi-mode 8-blt Interrupt 
Port 
- 
Four Internal DMA Interrupts 
- 
Separate, Non-maskable Interrupt 
Pin 
- 
Context Switch in 750 ns Typical 


Figure 
1. 80960CF 
Die Photo 
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1.0 
PURPOSE 


This document previews electrical characterizations 
of Intel's i960 CF embedded microprocessor (avail- 
able in 33, 25 and 16 MHz). For a detailed descrip- 
tion of any i960 CF processor functional topic-oth- 
er than parametric performance-refer 
to the latest 
i960 CA Microprocessor Reference Manual (Order 
No. 270710) and the ;960 CF Reference 
Manual Ad- 
dendum 
(Order No. 272188). 


2.0 
i960 CF PROCESSOR OVERVIEW 


Intel's i960 CF microprocessor is the performance 
follow-on product to the i960 CA processor, The 
i960 CF product is socket- and object code-compati- 
ble with the CA; this makes CA-to-CF design up- 
grades straightforward. The i960 CF processor's in- 
struction cache is 4 Kbytes (CA device has 1 Kbyte); 
CF data cache is 1 Kbyte (CA device has no data 
cache). This extra cache on the CF product adds a 
significant performance boost over the CA. The 
80960CF is object code compatible with the 32-bit 
80960 Core Architecture while including Special 
Function Register extensions to control on-chip pe- 
ripherals, and instruction set extensions to shift 64- 
bit operands and configure on-chip hardware. Multi- 
ple 128-bit internal busses, on-chip instruction cach- 
ing and a sophisticated instruction scheduler allow 
the processor to sustain execution of two instruc- 


intel~ 


tions every clock, and peak at execution of three 
instructions per clock. 


A 32-bit demultiplexed and pipelined burst bus pro- 
vides a 132 Mbyte/s bandwidth to a system's high- 
speed external memory sub-system. In addition, the 
80960CF's on-chip caching of instructions, proce- 
dure context and critical program data substantially 
decouples system performance from the wait states 
associated with accesses to the system's slower, 
cost sensitive, main memory sub-system. 


The 80960CF bus controller also integrates full wait 
state and bus width control for highest system per- 
formance with minimal system design complexity. 
Unaligned access and Big Endian byte order support 
reduces the cost of porting existing applications to 
the 80960CF. 


The processor also integrates four complete data- 
chaining DMA channels and a high-speed interrupt 
controller on-chip. The DMA channels perform: sin- 
gle-cycle or two-cycle transfers, data packing and 
unpacking, and data chaining. Block transfers, in ad- 
dition to source or destination synchronized trans- 
fers, are provided. 


The interrupt controller provides full programmability 
of 248 interrupt sources into 32 priority levels with a 
typical interrupt task switch ("latency") 
time of 
750 ns. 


FOUR·CHANNEL 
DMA CONTROLLER 


CONTROL 


INSTRUCTION PREFETCH QUEUE 


ADDRESS 


PROGRAMMABLE 
INTERRUPT CONTROLLER 


INSTRUCTION CACHE 


DATA 


('(K byte, Two-way set associative) 


MULTIPLY/DIVIDE 
UNIT 


128·BIT CACHE BUS 


SIX·PORT 
REGISTER FILE 


32·BITo 
BASE BUS 
I 
128·BIT 
LOAD BUS 
I 
128·BIT 
STORE BUS 
I 
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Figure 2. 80960CF 
Block Diagram 


2.1. The C-Series Core 


The C-Series core is a very high performance micro- 
architectural implementation of the 80960 Core Ar- 
chitecture. The C-Series core can sustain execution 
of two instructions per clock (66 MIPs at 33 MHz). 
To achieve this level of performance, Intel has incor- 
porated state-of-the-art silicon techn~logy an~ inno- 
vative microarchitectural constructs Into the Imple- 
mentation of the C-Series core. Factors that contrib- 
ute to the' core's performance include: 
- 
Parallel instruction decoding allows issue of up 
to three instructions per clock. 
- 
Most instructions execute in a single clock. 


- 
Parallel instruction decode 
allows sustained, 
simultaneous execution of two single-clock in- 
structions every clock cycle. 
- 
Efficient instruction pipeline minimizes pipeline 
break losses. 


- 
Register 
and 
resource 
scoreboarding 
allow 


simultaneous multi-clock instruction execution. 
- 
Branch look-ahead and prediction allows many 
branches to execute with no pipeline break. 


- 
Local Register Cache integrated on-chip caches 
Call/Return context. 


- 
Two-way set associative, 4 Kbyte integrated in- 
struction cache. 


- 
Direct 
mapped, 
1 
Kbyte data 
cache, 
write 


through, write allocate. , 


- 
1 Kbyte integrated Data RAM sustains a four- 
word (128-bit) access every clock cycle. 


2.2. 
Pipelined, Burst Bus 


A 32-bit high performance bus controller interfaces 
the 80960CF to external memory and peripherals. 
The Bus Control Unit features a maximum transfer 
rate of 132 Mbytes per second (at 33 MHz). Internal- 
ly programmable wait states and 16 separately c~n- 
figurable memory regions allow the processor to In- 
terface with a variety of memory subsystems with a 
minimum of system complexity and a maximum of 
performance. The Bus Controller's main features in- 
clude: 
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- 
Demultiplexed, Burst Bus to exploit most efficient 
DRAM access modes. 


- 
Address Pipelining to reduce memory cost while 
maintaining performance. 
- 
32-, 16- and 8-bit modes for I/O interfacing ease. 


- 
Full internal wait state generation to reduce sys- 
tem cost. 


- 
Little and Big Endian support to ease application 
development. 


- 
Unaligned access support for code portability. 


- 
Three-deep request queue to decouple the bus 
from the core. 


2.3. 
Flexible DMA Controller 


A four channel DMA controller provides high speed 
DMA control for data transfers involving peripherals 
and memory. The DMA provides advanced features 
such as data chaining, byte assembly and disassem- 
bly, and a high performance fly-by mode capable of 
transfer speed of up to 59 Mbytes per second at 
33 MHz. The DMA controller features a performance 
and flexibility which is only possible by integrating 
the DMA controller and the 80960CF core. 


2.4. 
Priority Interrupt Controller 


A programmable-priority interrupt controller man- 
ages up to 248 external sources through the 8-bit 
external interrupt port. The Interrupt Unit also han- 
dles the four internal sources from the DMA control- 
ler, and a single non-maskable interrupt input. ~he 
8-bit interrupt port can also be configured to provide 
individual interrupt sources that are level or edge 
triggered. 


Interrupts in the 80960CF are prioritized and sig- 
naled within 270 ns of the request. If the interrupt is 
of higher priority than the processor priority, the con- 
text switch to the interrupt routine typically is com- 
plete in another 480 ns. The interrupt unit provides 
the mechanism for the low latency and high through- 
put interrupt service which is essential for embedded 
applications. 
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....... 
Moony 
Shift 
Exclusive 
Nor 
Scan Byte for Equal 
Extended 
Not 
Multiply 
Nand 
Extended 
Divide 
Add with 
Carry 
Subtract 
with 
Carry 
Rotate 


Comparison 
Branch 
Call and Return 
Fault 


Compare 
Unconditional 
Call 
Conditional 
Conditional 
Branch 
Call Extended 
Fault 
Compare 
Conditional 
Call System 
Synchronize 
Compare 
and 
Branch 
Return 
Faults 


Increment 
Compare 
and 
Branch and Link 
Compare 
and 
Branch 


Decrement 


Test Condition 
Code 


Check Bit 


Debug 
Processor 
Atomic 
Management 


Modify Trace 
Modify 
Atomic Add 
Controls 
Process 
Atomic 
Modify 


Mark 
Controls 


Force Mark 
Modify 
Arithmetic 
Controls 


°System 
Control 


°DMA Control 
Flush Local 
Registers 


NOTE: 
Instructions 
marked 
by (0) are 80960CF 
extensions 
to the 80960 
instruction 
set. 
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., ne ~110uer- 
pms are uescr IU.,U III U11:;:;""UUII. 
I C1- 
ble 1 presents 
the legend for interpreting 
the pin de- 
scriptions 
in the' following 
tables. 


Pins associated 
with the 32-bit demultiplexed 
proc- 
essor bus are described 
in Table 2. Pins associated 
with 
basic 
processor 
configuration 
and control 
are 
described 
in 
Table 
3. 
Pins 
associated 
with 
the 
80960CF 
OMA Controller 
and Interrupt 
Unit are de- 
scribed 
in Table 4. 


Figure 
3 provides 
an example 
pin description 
table 
entry. "1/0" signifies 
that data pins are input-output. 


"S" 
indicates 
pins 
are 
synchronous 
to 
PCLK2:1, 
"H(Z)" 
indicates 
that these pins float while the proc- 
essor 
bus is in a Hold Acknowledge 
state. 
"R(Z)" 
indicates 
that the pins also float while RESET is low. 


All pins float 
while 
the 
processor 
is in the 
ONCE 
mode. 


-\-, ---_. -_ .._ ..--_ 
.. ...-- 


A( ... 
) 
Asynchronous. 
Inputs may be 
asynchronous 
to PCLK2:1. 
A(E) Edge sensitive 
input 
A(L) Level sensitive 
input 


H( ... 
) 
While the processor's 
bus is in the 
Hold Acknowledge 
or Bus Backoff 
state, the pin: 
H(1) 
is driven to vcc 
H(O) 
is driven to vss 
H(Z) 
floats 
H(Q) 
continues 
to be a valid output 


R( ... 
) 
While the processor's 
RESET pin is 
low, the pin 
R(1) 
is driven to vcc 
R(O) 
is driven to vss 
R(Z) 
floats 
R(Q) 
continues 
to be a valid output 


Name 
Type 
Description 


D31:0 
I/O 
DATA 
BUS carries 32-,16- 
or 8-bit data quantities 
depending 
on bus width configuration. 
S(L) 
The least significant 
bit of the data is carried on DO and the most significant 
on 031. When 


H(Z) 
the bus is configured 
for 8-bit data, the lower 8 data lines, 07:0 are used. For 16-bit bus 


R(Z) 
widths, 015:0 are used. For 32-bit bus widths the full data bus is used. 


Figure 
3. Example 
Pin Description 
Entry. 
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Table 2. 80960CF 
Pin Description-External 
Bus Signals 


Name 
Type 
Description 


A31:2 
0 
ADDRESS 
BUS carries the physical address upper 30 bits. A31 is the most 
S 
significant 
address bit and A2 is the least significant. 
During a bus access, A31:2 


H(Z) 
identify all external addresses 
to word (4-byte) boundaries. 
The byte enable 


R(Z) 
signals indicate the selected 
byte in each word. During burst accesses, 
A3 and A2 
increment 
to indicate 
successive 
data cycles. 


D31:0 
I/O 
DATA BUS carries 32-, 16- or 8-bit data quantities 
depending 
on bus width 
S(l) 
configuration. 
The least significant 
bit of the data is carried on DO and the most 


H(Z) 
significant 
on 031. When the bus is configured 
for 8-bit data, the lower 8 data 


R(Z) 
lines, 07:0 are used. For 16-bit bus widths, 015:0 are used. For 32-bit bus widths 
the full data bus is used. 


BE3 
0 
BYTE ENABLES 
select which of the four bytes addressed 
by A31:2 are active 
BE2 
S 
during an access t~emory 
region confi~d 
for a 32-bit data-bus 
width. BE3 


BE1 
H(Z) 
applies to 031 :24; BE2applies 
to 023:16; 
BE1 applies to 015:8; and BEO applies 


BEO 
R(1) 
to 07:0. 


32-bit bus: 
BE3 
-Byte 
Enable 3 
-enable 
031 :24 
BE2 
-Byte 
Enable 2 
-enable 
023:16 
BE1 
-Byte 
Enable 1 
-enable 
015:8 
BEO 
-Byte 
Enable 0 
-enable 
07:0 


For accesses 
to a memory r~n 
configured 
for a 16-bit data-bus 
width, ~ 
processor 
directly encodes 
BE3, BE1 and BEO to provided 
BHE, A 1 and BlE 
respectively. 


16-bit bus: 
BE3 
-Byte 
High Enable (BHE) 
-enable 
015:8 
BE2 
-Not 
used (is driven high or low) 
BE1 
-Address 
Bit 1 (A 1) 
BEO 
-Byte 
low 
Enable (BlE) 
-enable 
07:0 


For accesses 
to a memory r~n 
co~red 
for an 8-bit data bus width, the 
processor 
directly encodes 
BE1 and BEO to provide A 1 and AO respectively. 


I 
8-bit bus: 
BE3 
-Not 
used (is driven high or low) 
BE2 
-Not 
used (is driven high or low) 
BE1 
-Address 
Bit 1 (A1) 
BEO 
-Address 
Bit 0 (AO) 
• 
- 


W/R 
0 
WRITE/READ 
is asserted for read requests 
and deasserted 
for write requests. 


S 
The WIR signal changes 
in the same clock cycle as ADS. It remains valid for the 


H(Z) 
entire access in non-pipelined 
regions. In pipelined 
regions, W/R 
is not 


R(O) 
guaranteed 
valid in the last cycle of a read access. 


ADS 
0 
ADDRESS 
STROBE 
indicates 
valid address and the start of a new bus access. 
S 
ADS is asserted 
for the first clock of a bus access. 


H(Z) 
R(1) 


READY 
I 
READY is an input which signals the termination 
of a data transfer. 
READY is 
S(l) 
used to indicate that read data on the bus is valid, or that a write-data 
transfer 
has 


H(Z) 
completed. 
The READY signal works in conjunction 
with the internally 


R(Z) 
programmed 
wait-state 
generator. 
If READY is enabled 
in a region, the pin is 
sampled 
after the programmed 
number of wait-states 
has expired. 
If the READY 
pin is deasserted, 
wait states continue 
to be inserted until READY becomes 
asserted. 
This is true for the NRAD, NRDD, NWAD, and NWDD wait states. The 
NXDA wait states cannot 
be extended. 
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Table 2. 80960CF 
Pin Description-External 
Bus Signals (Continued) 


Name 
Type 
Description 


BTERM 
I 
BURST TERMINATE-The 
burst terminate 
signal breaks up a burst access and 
S(L) 
causes another address cycle to occur. The BTERM signal works in conjunction 
H(Z) 
with the internally 
programmed 
wait-state 
generator. 
If READY and BTERM are 
R(Z) 
enabled 
in a region, the BTERM pin is sampled 
after the proggammed 
number of 
wait states has expired. When BTERM is asserted, 
a new AD 
si~enerated 
and the access is completed. 
The READY input is ignored when BTERM is 
asserted. 
BTERM must be externally 
synchronized 
to satisfy the BTERM setup 
and hold times. 


WAIT 
0 
WAIT indicates 
internal wait state generator 
status. WAIT is asserted 
when wait 
S 
states are being caused by the internal wait state generator 
and not by the __ 
H(Z) 
READY or BTERM inputs. WAIT can be used to derive a write-data 
strobe. WAIT 
R(1) 
can also be thought 
of as a READY output that the processor 
provides 
when it is 
inserting wait states. 


BLAST 
0 
BURST LAST indicates 
the last transfer 
in a bus access. BLAST is asserted 
in the 
S 
last data transfer 
of burst and non-burst 
accesses 
after the wait state counter 
H(Z) 
reaches zero. BLAST remains asserted 
until the clock following 
the last cycle of 
R(O) 
the last data transfer of a bus access. 
If the READY or BTERM input is used to 
extend wait states, the BLAST signal remains asserted 
until READY or BTERM 
terminates 
the access. 


DT/R 
0 
DATA TRANSMIT IRECEIVE 
indicates 
direction 
for data transceivers. 
DTIR is 
S 
used in conjunction 
with DEN to provide control for data transceivers 
attached 
to 
H(Z) 
the external 
bus. When DTIR is asserted, 
the signal indicates 
that the processor 
R(O) 
receives 
data. Conversely, 
when deasserted, 
the processor 
sends data. DTIR 
changes 
only while DEN is high. 


DEN 
0 
DATA ENABLE indicates 
data cycles in a bus request. 
DEN is asserted 
at the 
S 
start of the bus request first data cycle and is deasserted 
at the end of the last 
H(Z) 
data cycle. DEN is used in conjunction 
with DTIR to provide control for data 
R(1) 
transceivers 
attached 
to the external 
bus. DEN remains asserted 
for sequential 
reads from pipelined 
memory regions. DEN is deasserted 
when DTIR changes. 


LOCK 
0 
BUS LOCK indicates 
that an atomic read-modify-write 
operation 
is in progress. 


S 
LOCK may be used to prevent external agents from accessing 
memory which is 
H(Z) 
currently 
involved 
in an atomic operation. 
LOCK is asserted 
in the first clock of an 
R(1) 
atomic operation, 
and deasserted 
in the clock cycle following 
the last bus access. 


for the atomic operation. 
To allow the most flexibility 
for a memory system 
enforcement 
of locked accesses, 
the processor 
acknowledges 
a bus hold request 


when LOCK is asserted. 
The processor 
performs 
DMA transfers 
while LOCK is 
active. 


HOLD 
I 
HOLD REQUEST 
signals that an external 
agent requests 
access to the external 
S(L) 
bus. The processor 
asserts HOLDA after completing 
the current bus request. 


H(Z) 
HOLD, HOLDA and BREQ are used together 
to arbitrate 
access to the 
R(Z) 
processor's 
external 
bus by external bus agents. 


BOFF 
I 
BUS BACK OFF -The 
backoff 
pin, when asserted, 
suspends 
the current access 
S(L) 
and causes the bus pins to float. When deasserted, 
the ADS signal is asserted 
on 
H(Z) 
the next clock cycle and the access is resumed. 
R(Z) 
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Table 2. 80960CF 
Pin Description-External 
Bus Signals (Continued) 


Name 
Type 
Description 


HOLDA 
0 
HOLD ACKNOWLEDGE 
indicates 
to a bus requestor 
that the processor 
has 
S 
relinquished 
control 
of the external 
bus. When HOLDA is asserted, 
the external 


H(1) 
address bus, data bus and bus control 
signals are floated. 
HOLD, BOFF, HOLDA 


R(Q) 
and BREQ are used together 
to arbitrate 
access to the processor's 
external 
bus 
by external 
bus agents. Since the processor 
grants HOLD requests 
and enters the 


Hold Acknowledge 
state even while RESET is asserted, 
HOLDA pin state is 
independent 
of the RESET pin. 


BREQ 
0 
BUS REQUEST 
is asserted 
when the bus controller 
has a request pending. 
BREQ 


S 
can be used by external 
bus arbitration 
logic in conjunction 
with HOLD and 


H(Q) 
HOLDA to determine 
when to return mastership 
of the external 
bus to the 


R(O) 
processor. 


DIC 
0 
DATA OR CODE is asserted 
for a data request and deasserted 
for instruction 
S 
requests. 
DIG has the same tiOling as W/R. 


H(Z) . 
R(Z) 


DMA 
0 
DMA ACCESS indicates 
whether 
the bus request was initiated by the DMA 
S 
controller. 
DMA is asserted 
for any DMA request. 
DMA is deasserted 
for all other 


H(Z) 
requests. 


R(Z) 


SUP 
0 
SUPERVISOR 
ACCESS 
indicates 
whether the bus request is issued while in 
S 
supervisor 
mode. SUP is asserted 
when the request has supervisor 
privileges, 
and 


H(Z) 
is deasserted 
otherwise. 
SUP can be used to isolate supervisor 
code and data 


R(Z) 
structures 
from non-supervlsor 
requests. 


Table 3. 80960CF 
Pin Description-Processor 
Control 
Signals 


Name 
Type 
Description 


RESET 
I 
RESET causes the chip to reset. When RESET is asserted, 
all external 
signals return 
A(L) 
to the reset state. When RESET is deasserted, 
initialization 
begins. When the 2·x clock 


H(Z) 
mode is selected, 
RESET must remain asserted 
for 16 PCLK2:1 cycles before being 


R(Z) 
deasserted 
in order to guarantee 
correct processor 
initialization. 
When the t-x clock 


N(Z) 
mode is selected, 
RESET must remain asserted 
for 10,000 PCLK2:1 cycles before 
being deasserted 
in order to guarantee 
correct 
initialization. 
The CLKMODE 
pin 
selects t-x or z-x input ciock division of the CLKIN pin. 


The processor's 
Hold Acknowledge 
bus state functions 
while the chip is reset. If the 
processor's 
bus is in the Hold Acknowledge 
state when RESET is asserted, 
the 
processor 
will internally 
reset, but maintains 
the Hold Acknowledge 
state on external 
pins until the Hold request is removed. 
If a hold request is made while the processor 
is 
in the reset state, the processor 
bus grants HOLDA and enters the Hold Acknowledge 
state. 


FAIL 
0 
FAIL indicates 
failure of the processor's 
self-test 
performed 
at initialization. 
When 
S 
RESET is deasserted 
and the processor 
begins initialization, 
the FAIL pin is asserted. 


H(Q) 
An internal self-test 
is performed 
as part of the initialization 
process. 
If this self-test 


R(O) 
passes, the FAIL pin is deasserted 
otherwise 
it remains asserted. 
The FAIL pin is 
reasserted 
while the processor 
performs 
an external bus self-confidence 
test. If this 
self-test 
passes, the processor 
deasserts 
the FAIL pin and branches 
to the user's 


initialization 
routine; otherwise 
the FAIL pin remains asserted. 
Internal self-test 
and the 


use of the FAIL pin can be disabled 
with the STEST pin. 
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Table 
3. 80960CF 
Pin Description-Processor 
Control 
Signals 
(Continued) 


Name 
Type 
Description 


STEST 
I 
SELF TEST causes the processor's 
internal self-test 
feature to be enabled 
or 
S(L) 
disabled 
at initialization. 
STEST is read on the rising edge of RESET. When asserted, 


H(Z) 
the processor's 
internal self-test 
and external 
bus confidence 
tests are performed 


R(Z) 
during processor 
initialization. 
When deasserted, 
only the external 
bus confidence 
tests are performed 
during initialization. 


ONCE 
I 
ON CIRCUIT 
EMULATION 
causes all outputs to be floated 
when asserted. 
ONCE is 
A(L) 
continuously 
sampled 
while RESET is low, and is latched on the rising edge of 


H(Z) 
RESET. To place the processor 
in the ONCE state: 


R(Z) 
(1) 
assert RESET and ONCE (order does not matter) 
(2) 
wait for at least 16 CLKIN periods in 2-x mode, or 10,000 CLKIN periods in 1-x 
mode, after vcc and CLKIN are within operating 
specifications 
(3) 
deassert 
RESET 
(4) 
wait at least 32 CLKIN periods 
(The processor 
is now latched in the ONCE state as long as RESET is high.) 


To exit the ONCE state, bring vcc and CLKIN to 0 erating conditions, 
then assert 
RESET and bring ONCE high prior to deasserting 
RE 
ET. 


CLKIN must operate within the specified 
operating 
conditions 
of the processor 
until 
step 4 above is completed. 
The CLKIN may then be changed 
to DC to achieve 
the 
lowest possible 
ONCE mode leakage current. 


ONCE can be used by emulator 
products 
or for board testers to effectively 
make an 
installed 
processor 
transparent 
in the board. 


CLKIN 
I 
CLOCK 
INPUT is an input for the external clock needed to run the processor. 
The 
A(E) 
external clock is internally 
divided as prescribed 
by the CLKMODE 
pin to produce 


H(Z) 
PCLK2:1. 


R(Z) 


CLKMODE 
I 
CLOCK 
MODE selects the division factor applied to the external clock input (CLKIN). 


A(L) 
When CLKMODE 
is high, CLKIN is divided by one to create PCLK2:1 and the 


H(Z) 
processor's 
internal clock. When CLKMODE 
is low, CLKIN is divided by two to create 


R(Z) 
PCLK2:1 and the processor's 
internal clock. CLKMODE 
should be tied high or low in 
a system, as the clock mode is not latched by the processor. 
If left unconnected, 
the 
processor 
internally 
pulls the CLKMODE 
pin low, enabling 
the 2-x clock mode. 


PCLK2 
0 
PROCESSOR 
OUTPUT 
CLOCKS 
provide a timing reference 
for all inputs and 
PCLK1 
S 
outputs 
of the processor. 
All inputs and output timings are specified 
in relation to 


H(Q) 
PCLK2 and PCLK1. PCLK2 and PCLK1 are identical 
signals. Two output pins are 


R(Q) 
provided to allow flexibility 
in the system's 
allocation 
of capacitive 
loading on the 
clock. PCLK2:1 may also be connected 
at the processor 
to form a single clock signal. 


Vss 
- 
GROUND 
connections 
consist of 24 pins which must be connected 
externally 
to a 
Vss board plane. 


Vcc 
- 
POWER connections 
consist of 24 pins which must be connected 
externally 
to a vcc 


board plane. 


VCCPLL 
- 
VCCPLL 
is a separate vcc supply pin for the phase lock loop used in 1x clock mode. 


Connecting 
a simple low pass filter to VCCPLL may help reduce clock jitter (TcP) in 
noisy environments. 
Otherwise, 
VCCPLL should be connected 
to VCC. 


NIC 
- 
NO CONNECT 
pins must not be connected 
in a system. 
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Table 4. 80960CF 
Pin Descrlptlon-DMA 
and Interrupt 
Unit Control 
Signals 


Name 
Type 
Description 


DREQ3 
I 
DMA REQUEST 
causes a DMA transfer to be requested. 
Each of the four signals 


DREQ2 
A(l) 
request a transfer on a single channel. 
DREOO requests 
channel 
0, DRE01 
requests 


DREQ1 
H(l) 
channel 
1, etc. When two or more channels 
are requested 
simultaneously, 
the 


DREQO 
R(l) 
channel with the highest priority is serviced 
first. Channel 
priority mode is 
programmable. 


DACK3 
0 
DMA ACKNOWLEDGE 
indicates 
that a DMA transfer 
is being executed. 
Each of the 
DACK2 
S 
four signals acknowledge 
a transfer for a single channel. 
DACKO acknowledges 


DACK1 
H(1) 
channel 
0, DACK1 acknowledges 
channel 
1, etc. DACK3:0 
are asserted 
when the 


DACKO 
R(1) 
requesting 
device of a DMA is accessed. 


EOP3/TC3 
110 
END OF PROCESS/TERMINAL 
COUNT can be programmed 
as either an input 
EOP2/TC2 
A(l) 
(EOP3:0) or as an output (TC3:0), but not both. Each pin is individually 


EOP1/TC1 
H(lIQ) 
programmable. 
When programmed 
as an input, EOPx causes the termination 
of a 


EOPO/TCO 
R(l) 
current DMA transfer 
for the channel 
corresponding 
to the EOPx pin. EOPO 
corresponds 
to channel 
0, EOP1 corresponds 
to channel 
1, etc. When a channel 
is 
configured 
for source and destination 
chaining, the EOP pin for that channel 
causes 
termination 
of only the current buffer transferred 
and causes the next buffer to be 
transferred. 
EOP3:0 are asynchronous 
inputs. 


When programmed 
as an output, the channel's 
TCx pin indicates 
that the channel 


byte count has reached 
0 and a DMA has terminated. 
TCx is driven with the same 
timing as DACKx during the last DMA transfer 
for a buffer. If the last bus request 
is 


executed 
as multiple bus accesses, 
TCx remains asserted 
for the entire bus request. 


XINT7 
I 
EXTERNAL 
IN~ERRUPT 
PINS cause interrupts 
to be requested. 
These pins can be 
XINT6 
A(E/l) 
configured 
in three modes. 


XINT5 
H(l) 
In Dedicated 
Mode, each pin is a dedicated 
external 
interrupt 
source. 
Dedicated 


XINT4 
R(l) 
inputs can be individually 
programmed 
to be level (Iow) or edge (falling) activated. 


XINT3 
In Expanded 
Mode, the a pins act together 
as an a-bit vectored 
interrupt 
source. The 


XINT2 
interrupt 
pins in this mode are level activated. 
Since the interrupt 
pins are active low, 


XINT1 
the vector number requested 
is the one's complement 
of the positive logic value 


XINTO 
place on the port. This eliminates 
glue logic to interface 
to combinational 
priority 
encoders 
which output negative 
logic. 
In Mixed Mode, XINT7:5 are dedicated 
sources 
and XINT4:0 act as the 5 most 
significant 
bits of an expanded 
mode vector. The least significant 
bits are set to 010 


internally. 


NMI 
I 
NON-MASKABLE 
INTERRUPT 
causes a non-maskable 
interrupt 
event to occur. 


A(E) 
NMI is the highest priority interrupt 
recognized. 
NMI is an edge (falling) activated 


H(l) 
source. 


R(l) 
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3.3. 
80960CF Pinout 
B0960CF 
pinout 
as viewed 
from the top side of the 
component 
(Le., pins facing down). 
Figure 4b shows 
the 
complete 
B0960CF 
pinout 
as viewed 
from 
the 
pin-side 
of the 
package 
(Le., pins facing 
up). See 
Section 
4.0, Electrical 
Specifications 
for specifica- 


tions and recommended 
connections. 


3.3.1 
80960CF 
PGA PINOUT 


Tables 
5 and 
6 list the 
B0960CF 
pin names 
with 
package 
location. 
Figure 
4-a depicts 
the complete 


Table 
5. PGA Pin Name with 
Package 
Location 
(Signal 
Order) 


Address 
Bus 
Data Bus 
Bus Control 
Processor 
Control 
I/O 


Name 
.. Location 
Name 
.. Location 
Name 
.. Location 
Name 
.... 
Location 
Name 
.. Location 


A31 ........ 
S15 
03L 
....... 
R03 
BE3 ........ 
S05 
RESET 
....... 
A16 
ORE03 
..... 
A07 


A30 ........ 
013 
030 ........ 
005 
BE2 ........ 
S06 
ORE02 
..... 
B06 


A29 ........ 
R14 
029 
........ 
S02 
BE1 ........ 
S07 
FAIL .......... 
A02 
ORE01 
..... 
A06 


A2B ........ 
014 
O?B ........ 
004 
BEO ........ 
R09 
OREOO ..... 
B05 


A27 ........ 
S16 
027 ........ 
R02 
STEST ........ 
B02 


A26 ........ 
R15 
026 ........ 
003 
W/R 
....... 
S10 
OACK3 
..... 
A10 


A25 ........ 
S17 
025 
........ 
S01 
ONCE 
........ 
C03 
OACK2 
..... 
A09 


A24 ........ 
015 
024 ........ 
R01 
AOS 
....... 
R06 
OACK1 
..... 
AOB 


A23 ........ 
R16 
023 ........ 
002 
CKLlN 
........ 
C13 
OACKO ..... 
BOB 


A22 ........ 
R17 
022 
........ 
P03 
REAOY ..... 
S03 
CLKMOOE 
.... 
C14 


A21 ........ 
016 
021 ........ 
001 
BTERM ..... 
R04 
PCLK1 ........ 
B14 
EOP/TCO 
... A11 


A20 
........ 
P15 
020 
........ 
P02 
PCLK2 ........ 
B13 
EOP/TC1 
... A12 


A19 
........ 
P16 
019 
........ 
P01 
WAIT ....... 
S12 
EOP/TC2 
... A13 


A1B ........ 
017 
01B ........ 
N02 
BLAST 
..... 
SOB 
Vss 
EOP/TC3 
... A14 


A17 
........ 
P17 
017 ........ 
N01 
Location 


A16 ........ 
N16 
016 ........ 
M01 
OT/R ....... 
S11 
C07, COB, C09, 
XINT7 
...... 
C17 


A15 ........ 
N17 
015 
........ 
L01 
OEN 
....... 
S09 
C10,C11,C12, 
XINT6 
...... 
C16 
F15, G03, G15, 
A14 ........ 
M17 
014 
........ 
L02 
H03, H15, J03, 
XINT5 
...... 
B17 


A13 
........ 
L16 
013 
........ 
K01 
LOCK 
...... 
S14 
J15, K03, K15, 
XINT4 
...... 
C15 
L03, L15, M03, 
A12 
........ 
L17 
012 
......... 
J01 
M15, 007, OOB, 
XINT3 
...... 
B16 


A11 ........ 
K17 
011 ........ 
H01 
HOLO ...... 
R05 
009, 010, 011 
XINT2 
...... 
A17 


A10 
........ 
J17 
010 ........ 
H02 
HOLOA ..... 
S04 
Vcc 
XINT1 
...... 
A15 


A9 ......... 
H17 
09 ......... 
G01 
BREO ...... 
R13 
Location 
XINTO ...... 
B15 


AB ......... 
G17 
OB ......... 
F01 
B07, B09, 


A7 ......... 
G16 
07 
......... 
E61 
O/C 
........ 
S13 
B11, B12, C06, 
NMI ........ 
015 
E15, F03, F16, 
A6 ......... 
F17 
06 
......... 
F02 
OMA ....... 
R12 
G02, H16, J02, 


A5 ......... 
E17 
05 ......... 
001 
SUP 
....... 
012 
J16, K02, K16, M02, 
M16, N03, N15, 
A4 ......... 
E16 
04 
......... 
E02 
006, R07, ROB, 
R10, R11 


VCCPLL ....... 
B10 


A3 ......... 
017 
03 ......... 
C01 
BOFF 
...... 
B01 
No Connect 


A2 ......... 
016 
02 ......... 
002 
Location 


01 ......... 
C02 
A01, A03, A04, A05, 
B03, B04, C04, C05, 
003 


00 
......... 
E03 
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Table 6. PGA Pin Name with Package Location (Pin Order) 


Address Bus 
Data Bus 
Bus Control 
Processor Control 
I/O 


Location 
.. Name 
Location 
.. Name 
Location 
.. Name 
Location 
.... 
Name 
Location 
.. Name 


A01 ......... 
NC 
C01 ......... 
03 
G01 ......... 
09 
M01 
......... 
016 
R01 ........ 
024 


A02 
....... 
FAIL 
C02 ......... 
01 
G02 ........ 
VCC 
M02 ......... 
vcc 
R02 ........ 
027 


A03 ......... 
NC 
C03 ...... 
ONCE 
G03 ........ 
vss 
M03 .......... 
Vss 
R03 ........ 
031 


A04 ......... 
NC 
C04 ......... 
NC 
G15 ........ 
Vss 
M15 .......... 
Vss 
R04 ..... 
BTERM 


A05 ......... 
NC 
C05 ......... 
NC 
G16 ......... 
A7 
M16 
......... 
VCC 
R05 ...... 
HOLO 


A06 ..... 
ORE01 
C06 ........ 
VCC 
G17 ......... 
A8 
M17 .......... 
A14 
R06 
....... 
AOS 


A07 ..... 
ORE03 
C07 ........ 
vss 
R07 ........ 
VCC 


A08 ..... 
OACK1 
C08 ........ 
Vss 
H01 ........ 
011 
N01 .......... 
017 
R08 ........ 
VCC 


A09 ..... 
OACK2 
C09 ........ 
Vss 
H02 ........ 
010 
N02 .......... 
018 
R09 ........ 
BEO 


A10 ..... 
OACK3 
C10 ........ 
Vss 
H03 ........ 
vss 
N03 .......... 
VCC 
R10 ........ 
VCC 


A11 ... EOP/TCO 
C11 ........ 
Vss 
H15 ........ 
Vss 
N15 .......... 
VCc 
R11 ........ 
VCC 


A12 ... EOP/TC1 
C12 ........ 
vse 
H16 ........ 
VCC 
N16 .......... 
A16 
R12 ....... 
OMA 


A13 ... EOP/TC2 
C13 ...... 
CLKIN 
H17 ......... 
A9 
N17 .......... 
A15 
R13 ..•... 
BREO 


A14 ... EOP/TC3 
C14 .. CLKMOOE 
R14 ........ 
A29 


A15 ...... 
XINT1 
C15 ...... 
XINT4 
J01 
........ 
012 
P01 .......... 
019 
R15 ........ 
A26 


A16 ..... 
RESET 
C16 ...... 
XINT6 
J02 
........ 
VCC 
P02 .......... 
020 
R16 ........ 
A23 


A17 ...... 
XINT2 
C17 ...... 
XINT7 
J03 
........ 
vss 
P03 .......... 
022 
R17 ........ 
A22 


J15 
........ 
vse 
P15 .......... 
A20 


B01 
...... 
BOFF 
001 
......... 
05 
J16 
........ 
VCC 
P16 .......... 
A19 
S01 ........ 
025 


B02 
..... 
STEST 
002 
......... 
02 
J17 
........ 
A10 
P17 .......... 
A17 
S02 ........ 
029 


B03 ......... 
NC 
003 ......... 
NC 
S03 ..... 
REAOY 


B04 ......... 
NC 
015 ........ 
NMI 
K01 ........ 
013 
001 .......... 
021 
S04 ..... 
HOLOA 


B05 ..... 
OREOO 
016 
......... 
A2 
K02 ........ 
VCC 
002 .......... 
023 
S05 ........ 
BE3 


B06 ..... 
ORE02 
017 
......... 
A3 
K03 ........ 
vss 
003 .......... 
026 
S06 ........ 
BE2 


B07 ........ 
VCC 
K15 ........ 
Vss 
004 .......... 
028 
S07 ........ 
BE1 


B08 ..... 
OACKO 
E01 ......... 
07 
K16 ........ 
VCC 
005 .......... 
030 
S08 
..... 
BLAST 


B09 ........ 
vcc 
E02 ......... 
04 
K17 ........ 
A11 
006 .......... 
vcc 
S09 
....... 
OEN 


B10 ..... 
VCCPLL 
E03 ......... 
00 
007 .......... 
Vss 
S10 
....... 
W/R 


B11 
........ 
VCC 
E15 ........ 
VCC 
L01 ........ 
015 
008 .......... 
vss 
S11 ....... 
OT/R 


B12 ........ 
VCC 
E16 ......... 
A4 
L02 ........ 
014 
009 .......... 
Vss 
S12 ....... 
WAIT 


B13 
..... 
PCLK2 
E17 ......... 
A5 
L03 ........ 
Vss 
010 .......... 
Vss 
S13 ........ 
0/C 


B14 
..... 
PCLK1 
L15 ........ 
Vss 
011 .......... 
Vss 
S14 
...... 
LOCK 


B15 ...... 
XINTO 
F01 ......... 
08 
L16 
........ 
A13 
012 
......... 
SUP 
S15 ........ 
A31 


B16 ...... 
XINT3 
F02 ......... 
06 
L17 
........ 
A12 
013 .......... 
A30 
S16 ........ 
A27 


B17 ...... 
XINT5 
F03 ........ 
VCC 
014 .......... 
A28 
S17 ........ 
A25 


F15 
........ 
VSS 
015 .......... 
A24 


F16 ........ 
VCC 
016 .......... 
A21 


F17 
...•..... 
A6 
017 .......... 
A18 
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S 
R 
Q 
P 
N 
M 
L 
K 
J 
H 
G 
F 
E 
0 
C 
8 
A 
r 
c 
024 
019 
016 
08 
m 
NC 
025 
021 
017 
015 
013 
012 
011 
09 
07 
05 
03 


2 
c 
~ 
2 
029 
027 
023 
020 
018 
v;; 
014 
Vcc 
Vcc 
010 
Vcc 
06 
04 
02 
01 
STEST 
FAtL 


3 
~ 
~ 
3 
IlEAOV 
031 
026 
022 
Vee 
Vss 
Vss 
Vss 
Vss 
V~ 
V~ 
Vcc 
00 
~ 
oNcE 
~ 
Ne 


4 
~ 
4 
HOLOA ~ 
028 
Ne 
Ne 
Ne 
5 
BE3 
HaLO 
030 
Ne 
oREOO 
Ne 
5 


6 
BE2 
ADS 
v~ 
c 
~E02 
"'-- 
6 
Vcc 
OREQl 
7 
rn 
'-<-- 
7 
vcc 
Vss 
Vss 
vcc 
DRE03 


8 
BLAST 
V~e 
Vss 
Vss 
OACKO 
'iI'Aoo 
8 


9 
IIDl 
ID 
v~ 
Vss 
v~ 
~ 
9 • 


10 
wffl 
~ 
~ 


v~ 
V~PLL 
'l>'.i.oo 
10 
Vcc 
Vss 
11 
oTm 
v~e 
v~ 
v~ 
v~e 
rn.~ 
11 


12 
wi.ff 
-"'-- 
sUP 
v~ 
E'&s1fCf 
12 
OMA 
vss 
13 
of{ 
BREa 
A30 
CLKIN PCLK2 
rn.1fC2 
13 


14 
~ 
..\29 
eLKMO~DE 
PCLKl 
rn.1I'a 
14 
A28 


15 
~ 
15 
..\24 
A20 
V~ 
V~ 
v;; 
v;; 
V;s 
""-- 
--'-'--- 
---"'- 
XINT1 
A31 
A26 
Vss 
Vss 
Vss 
VCC 
NMI 
XINT4 
XINTO 


16 
Ai? 
A23 
A21 
A19 
A16 
v~ 
A13 
Vcc 
Vcc 
v~c 
A7 
vce 
A4 
A2 
XM6 
XINn 
RESET 
16 


17 
Ais 
..\22 
M8 
A14 
A12 
All 
Alo 
Air 
A5 
A3 
XINT7 
xms 
~ 
17 
A17 
A15 
A9 
A6 
~ 


S 
R 
Q 
P 
N 
M 
L 
K 
J 
H 
G 
F 
E 
0 
C 
8 
A 


272187-2 


Figure 4a. 80960CF 
PGA Pin out (View from Top Side) 
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A 
B 
C 
D 
E 
F 
G 
H 
J 
K 
L 
M 
N 
P 
Q 
R 
S 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


14 


15 


16 


17 


000 
mPtrn2 PCLK2 CLKIN 


o 
AS 


o 
0 
Vss 
Vss 
o 
0 
Vcc 
A7 
o 
0 
A6 
AS 


METAL LID 


mP~ 
PC'rKl C~KMOO"E------------------~ 


~..sLO 
0 
0 
XINTl 
XINTO XiNi'4 
mr vcc 


~ 
x~~ 
2 
B 


o 
0 
0 
0 
0 
0 
0 
0 
0 
000 
0 
0 
0 
0 
0 
NC lIOl'F 
03 
05 
07 
OS 
09 
011 
012 
013 
015 
016 
017 
019 
021 
024 
025 


oIm! 
om 


o.. 
WAIT 


o 
0 
0 
0 
0 
0 
0 
0 
0 
000 
0 
0 
0 
0 
0 
FAJ[ STEST 01 
02 
04 
06 
VCC 
010 
Vcc 
VCC 
014 
VCC 
01S 
020 
023 
027 
029 
o 
0 
0 
0 
0 
000 
0 
0 
0 
0 
0 
0 
0 
0 
~ 


NC 
NC 
ONCE 
NC 
DO 
Vcc 
Vss 
Vss 
Vss 
Vss Vss 
VSS 
Vcc 
022 
026 
031 
"~u. 
000 
NC 
NC 
NC 


o 
0 
0 
0 
"ss 
Vss 
VSS 
"ss 
o 
0 
0 
0 
Vcc 
Vcc 
VCC 
A13 
o 
0 
0 
0 
A9 
Al0 
All 
A12 


o 
VSS 
o 
VCC 
o 
A14 


o 
VCC 
o 
A16 


000 
0 
A20 
A24 
A26 
A31 


02~ 
• 
£LOA 


03~ 
HgO 
~ 


Q 
ADS 


o 
0 
0_ 
A30 
BREa 
OIC 


A2~ 
R9 
~ 


o 
A15 


o 
0 
0 
0 
A19 
A21 
A23 
A27 
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2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


14 


15 


16 


17 


o 


!5iIEOl 
o 
~ 


o 
0 
0 
0 
Al7 
A1S 
A22 
A25 


ABCDEFGHJKLMNPQRS 
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PGA Plnout (View from Bottom 
Side) 


A26 
........ 
143 
026 
........ 
192 
W/R ........ 
164 
OACK3 
..... 
065 


A25 
........ 
142 
025 
........ 
194 
ONCE ............ 
043 
OACK2 
..... 
064 


A24 
........ 
141 
024 
........ 
195 
ADS ........ 
178 
OACK1 
..... 
063 


A23 
........ 
139 
023 
........ 
003 
CLKIN ............ 
087 
OACKO ..... 
062 


A22 
........ 
138 
022 
........ 
004 
READy 
..... 
182 
CLKMOOE 
........ 
085 


A21 
........ 
137 
021 
........ 
005 
STERM 
..... 
184 
PCLK1 
........... 
078 
EOP ITC3 
... 069 


A20 
........ 
136 
020 
........ 
006 
PCLK2 
........... 
074 
EOP ITC2 
... 068 


A19 
........ 
134 
019 
........ 
008 
WAIT ....... 
162 
EOP/TC1 
... 067 


A18 
........ 
133 
018 
........ 
009 
BLAST ...... 
169 
VSS 
EOP ITCO ... 066 


A17 
........ 
132 
017 
........ 
010 
Location 


A16 
........ 
130 
016 
........ 
011 
OT/R ....... 
163 
2,7,16,24,30,38, 
XINT7 
...... 
107 


A15 
........ 
129 
015 
........ 
013 
DEN ........ 
167 
39, 49, 56, 70, 75, 
XINT6 
...... 
106 


A14 
........ 
128 
014 
........ 
014 
77,81,83,88,89, 
XINT5 
...... 
102 
92,98,105,109,110, 


A13 
........ 
124 
013 
........ 
015 
LOCK 
...... 
156 
121,125,131,135, 
XINT4 
...... 
101 


A12 
........ 
123 
012 
........ 
017 
147,150,161,165, 
XINT3 
...... 
100 


HOLD 
...... 
181 
173,174,185,196 
XINT2 
...... 
095 
A11 
........ 
122 
011 
........ 
018 


A10 
........ 
120 
010 
........ 
019 
HOLOA 
..... 
179 
Vcc 
XINT1 
...... 
094 


A9 
....•.... 
119 
09 
......... 
021 
SREQ 
...... 
155 
Location 
XINTO ...... 
093 


A8 
· ........ 
118 
08 
......... 
022 
1,12,20,28,32,37,44, 


A7 
......... 
117 
07 
......... 
023 
O/C 
........ 
159 
50, 61, 71, 79, 82, 96, 
NMI ........ 
108 


A6 
......... 
116 
06 
......... 
025 
OMA 
....... 
160 
99,103,115,127,140, 
148,154,168,171,180, 
A5 
......... 
114 
05 
......... 
026 
SUP ........ 
158 
190 
, 


A4 
· ........ 
113 
04 
......... 
027 
VCCPLL ............ 
72 


A3 
......... 
112 
03 
......... 
033 
SOFF ....... 
040 
No Connect 


A2 
· ........ 
111 
02 
......... 
034 
Location 


01 
......... 
035 
29,31,41,42,47, 
48,51,52,53, 
54, 55, 73, 76, 
80,84,86, 
90, 97, 


104,126,146,149,157, 
166,177,183,193 


DO ......... 
036 
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Table 8. PQFP Pin Name with Package 
Location 
(Pin Order) 


Pin 
Signal 
1 
Vcc 
2 
Vss 
3 
023 
4 
022 
5 
021 
6 
020 
7 
Vss 
8 
019 
9 
018 
10 
017 
11 
016 
12 
Vcc 
13 
015 
14 
014 
15 
013 
16 
Vss 
17 
012 
18 
011 
19 
010 
20 
Vcc 
21 
09 
22 
08 
23 
07 
24 
Vss 
25 
06 
26 
05 
27 
04 
28 
Vcc 
29 
NC 
30 
Vss 
31 
NC 
32 
Vcc 
33 
03 
34 
02 
35 
01 
36 
DO 
37 
Vcc 
38 
Vss 
39 
Vss 
40 
BOFF 
41 
NC 
42 
NC 
43 
ONCE 
44 
Vcc 
45 
FAIL 
46 
STEST 
47 
NC 
48 
NC 
49 
Vss 
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Pin 
Signal 
50 
Vcc 
51 
NC 
52 
NC 
53 
NC 
54 
NC 
55 
NC 
56 
Vss 
57 
OREQO 
58 
OREQ1 
59 
OREQ2 
60 
OREQ3 
61 
Vcc 
62 
OACKO 
63 
OACK1 
64 
OACK2 
65 
OACK3 
66 
EOPO/TCO 
67 
EOP1/TC1 
68 
EOP2/TC2 
69 
EOP3/TC3 
70 
Vss 
71 
Vcc 
72 
VCCPLL 
73 
NC 
74 
PCLK2 
75 
Vss 
76 
NC 
77 
Vss 
78 
PCLK1 
79 
Vcc 
80 
NC 
81 
Vss 
82 
Vcc 
83 
Vss 
84 
NC 
85 
CLKMOOE 
86 
NC 
87 
CLKIN 
88 
Vss 
89 
Vss 
90 
NC 
91 
RESET 
92 
Vss 
93 
XINTO 
94 
XINT1 
95 
XINT2 
96 
Vcc 
97 
NC 
98 
Vss 


Pin 
Signal 
99 
Vcc 
100 
XINT3 
101 
XINT4 
102 
XINT5 
103 
Vcc 
104 
NC 
105 
Vss 
106 
XINT6 
107 
XINT7 
108 
NMI 
109 
Vss 
110 
Vss 
111 
A2 
112 
A3 
113 
A4 
114 
A5 
115 
Vcc 
116 
A6 
117 
A7 
118 
A8 
119 
A9 
120 
A10 
121 
Vss 
122 
A11 
123 
A12 
124 
A13 
125 
Vss 
126 
NC 
127 
Vcc 
128 
A14 
129 
A15 
130 
A16 
131 
Vss 
132 
A17 
133 
A18 
134 
A19 
135 
Vss 
136 
A20 
137 
A21 
138 
A22 
139 
A23 
140 
Vcc 
141 
A24 
142 
A25 
143 
A26 
144 
A27 
145 
A28 
146 
NC 
147 
Vss 


Pin 
Signal 
148 
Vcc 
149 
NC 
150 
Vss 
151 
A29 
152 
A30 
153 
A31 
154 
Vcc 
155 
BREQ 
156 
LOCK 
157 
NC 
158 
SUP 
159 
O/C 
160 
OMA 
161 
Vss 
162 
WAIT 
163 
OT/R 
164 
W/R 
165 
Vss 
166 
NC 
167 
DEN 
168 
Vcc 
169 
BLAST 
170 
BEO 
171 
Vcc 
172 
BE1 
173 
Vss 
174 
Vss 
175 
BE2 
176 
BE3 
177 
NC 
178 
ADS 
179 
HOLOA 
180 
Vcc 
181 
HOLD 
182 
READY 
183 
NC 
184 
BTERM 
185 
Vss 
186 
031 
187 
030 
188 
029 
189 
028 
190 
Vcc 
191 
027 
192 
026 
193 
NC 
194 
025 
195 
024 
196 
Vss 
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98 
50 


99 
49 


147 
PIN 1 


148 
196 
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Figure 4c. 80960CF 
PQFP Plnout (View from Top Side) 
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3.4. 
Mechanical Data 


3.4.1 
CERAMIC 
PGA PACKAGE 


~1.65 
@ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ @ 
R~@@@@@@@@@@@@@@@@@ 
.-@@@@@@@@@@@@@@@@@ 
T 
@@@ 
@@@ 
@@@ 
@@@ 
@@@ 
-- 
@@@ 
@@@ 
@@@ 
e e e 
( 
\ 
e e e 
e e e 
e e e 
D 
@@@ 
@@@ 
PINC3~ 
@ @ @ 
'-...../ 
e e e 
@@@ 
@@@ 
rn 


O@@@@ 
@ 
@ 
@@@ 
@@O@@@@@@@@@@@i'{@;@ 
@@@@@@@@@@@@@@@o@ 
~@@@@@@@@@@@@@@@o@ 
r- 
SWAGGED \ 
~:;~ 
REF. 
PIN 
450CHAIIFER 
(4PL) 
(INDEX 
CORNER) 


SEATING_ 
PLANE 
_ 
l~~~tr 


SEATING 
PLANE 
"1 


~B(ALL 
PINS) 
I 


{=~ 


SWAGGED 
PIN 
DETAIL 


272187-5 


! ---- 
~---------- - 
~- 
f-- L 
BASE 
- 
A2 
PLANE- 


Family: Ceramic 
Pin Grid Array Package 


Symbol 
Mllllmeters 
Inches 


Mln 
Mu 
Notes 
Mln 
Mu 
Notes 


A 
3.56 
4.57 
0.140 
0.180 


A1 
0.64 
1.14 
SOLID LID 
0.025 
0.045 
SOLID LID 


A2 
23 
0.30 
SOLID LID 
0.110 
0.140 
SOLID LID 


Aa 
1.14 
1.40 
0.045 
0.055 


B 
0.43 
0.51 
0.017 
0.020 


0 
44.07 
44.83 
1.735 
1.765 


01 
40.51 
40.77 
1.595 
1.605 


91 
2.29 
2.79 
0.090 
0.110 


L 
2.54 
3.30 
0.100 
0.130 


N 
168 
168 


S1 
1.52 
2.54 
0.060 
0.100 


ISSUE 
IWS 
REV X 
7115/88 


Figure 5. 16B-Lead Ceramic 
PGA Package 
Dimensions 
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Table 9. Ceramic 
PGA Package 
Dimension 
Symbols 


Letter or 
Description 
of Dimensions 
Symbol 


A 
Distance from seating plane to highest point of body 


A1 
Distance 
between 
seating plane and base plane (lid) 


A2 
Distance 
from base plane to highest point of body 


A3 
Distance 
from seating plane to bottom of body 


B 
Diameter 
of terminal 
lead pin 


0 
Largest overall package 
dimension 
of length 


01 
A body length dimension, 
outer lead center to outer lead center 


e1 
Linear spacing between 
true lead position 
centerlines 


L 
Distance from seating plane to end of lead 


51 
Other body dimension, 
outer lead center to edge of body 


NOTES: 
1. Controlling 
dimension: 
millimeter. 


2. Dimension 
"'e1"' ("'e"') is non-cumulative. 


3. Seating 
plane 
(standoff) 
is defined 
by P.C. board 
hole size: 0.0415-0.0430 inch. 
4. Dimensions 
"'S"', "'S1"' and "'C"' are nominal. 


5. Details 
of Pin 1 identifier 
are optional. 
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3.4.2 PLASTIC QUAD FLAT PACKAGE 


272187-6 
mm (inch) 


Figure 6. Principal Dimensions 
and Data 


riM 


~ 
1l.25 
('1l111)@ 
CAS 
-B S 
os'lA 


.1l1l2 MM/MM<IN/IN) 
A-B 


01 
~ 
1l.25 ('1l111)@ 
CAS 
-B® 
0®]A 


.1l1l2 MM/MM(IN/IN) 
A-B 
• 


E2 
El 
-Yl--+:J-' 


3.81 
(.1511) MAXTYP 
• 


rSEE 
OETAILM 


) 
,_/ 
- I- 1. 91 ('1l75) 
MAXTYP 
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Ill.25 
(.1l111)@ ICIA®-B® 
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1l.25 
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272187-7 
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Figure 7. Molded 
Details 


t 
1.32 
('1l52) 
1.22 
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2.113 (.1l811) 
1. 93 ('lllb) 


----02 
-----i 
272187-8 
mm (inch) 


Figure 8. Detail M 
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i\- SEE DETAIL L 
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Figure 
9. Terminal 
Details 
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Detail 
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Figure 
10. Typical 
Lead 


Table 
111.PQFP Package 
Dimension 
Symbols 


Symbol 
Description 
Min 
Max 
Min 
I 
Max 


N 
Leadcount 
196 
196 


A 
Package 
Height 
0.160 
0.1'70 
4.06 
4.32 


A1 
Standoff 
0.020 
0.030 
0.51 
0.76 


D,E 
Terminal 
Dimension 
1.475 
1.485 
37.47 
37.72 


01, E1 
Package Body 
1.347 
1.353 
34.21 
34.37 


D2,E2 
Bumper Distance 
1.497 
1.503 
38.02 
38.18 


D3,E3 
Lead Dimension 
1.200 REF 
30.48 REF 


D4,E4 
Foot Radius Location 
1.423 
1.437 
36.14 
36.49 


L1 
Foot Length 
0.020 
0.030 
0.51 
0.76 


Dimension 
INCH 
mm 


NOTES: 
1. All dimensions 
and tolerances 
conform 
to ANSI Y14.5M-1982. 


2. Datum 
plane -H- located 
at the mold parting 
line and coincident 
with the bottom 
of the lead where 
lead exits plastic 
body.' 


3. Datums 
A-S and -0- to be determined 
where 
center 
leads exit plastic 
body at datum 
plane -H-. 
4. Controlling 
Dimension, 
Inch. 


5. Dimensions 
01, 
02. 
E1 and 
E2 are measured 
at the 
mold 
parting 
line. 
01 
and 
E1 do not include 
an allowable 
mold 


protrusion 
of 0.18 mm (0.007 
in) per side. 02 and E2 do not include 
a total allow/ilble 
mold protrusion 
of 0.16 mm (0.007 in) 


at maximum 
package 
size. 


6. Pin 1 identifier 
is located 
within 
one of the two zones 
indicated. 


7. Measured 
at datum 
plane -H-. 
8. Measured 
at seating 
plane datum 
-Co. 
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3.5. Package Thermal Specifications 


The 
80960CF 
is specified 
for 
operation 
when 
TC 
(the case temperature) 
is within 
the range 
of O·C- 
100·C. Tc may be measured 
in any environment 
to 
determine 
whether 
the 80960CF 
is within 
specified 
operating 
range. The case temperature 
is measured 
at the center 
of the top surface, 
opposite 
the pins. 


Refer to Figure 13. 


TA 
(the 
ambient 
temperature) 
can 
be 
calculated 
from 8CA (thermal 
resistance 
from case to ambient) 
with the following 
equation: 


Table 11 shows the maximum 
TA allowable 
(without 
exceeding 
Tcl at various 
airflows 
and operating 
fre- 
quencies 
(fpClK)' 


Note that TA is greatly improved 
by attaching 
fins or 
a heat sink to the package. 
P (the maximum 
power 
consumption) 
is calculated 
by using the typical 
Icc 
as tabulated 
in Section 
4.4, DC Specifications, 
and 
vcc of 5V. 


Table 
11. Maximum 
TA at Various 
Airflows 
In ·C (PGA Package 
Only) 


Alrflow-ft/mln 
(m/sec) 


fpCLK 
0 
200 
400 
600 
800 
1000 
(MHz) 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 
(5.07) 


TA 
33 
38 
57 
74 
76 
81 
84 
with 
25 
50 
65 
79 
81 
85 
87 
Heat Sink" 
16 
63 
74 
84 
86 
89 
90 


TA 
33 
18 
33 
47 
57 
66 
67 
without 
25 
34 
46 
57 
65 
72 
74 
Heat Sink 
16 
51 
60 
68 
74 
80 
81 
. . 
'0.285" 
high Unidirectional 
heat sink (AI alloy 6061,50 
mil fin width, 150 mil center-to-center 
fin spacing) . 


PGA Thermal 
Reslstance-·C/Watt 


Alrflow-ft.lmln 
(m/sec) 


Parameter 
0 
200 
400 
600 
800 
1000 
(0) 
(1.01) 
(2.03) 
(3.07) 
(4.06) 
(5.07) 
0J. 


8 Junction-to-Case 
8Jp1n r-'\ 
O.Jc 
(Case Measured 
1.5 
1.5 
1.5 
1.5 
1.5 
1.5 
as shown in Figure 13) 
I 
8J cop 1 


8 Case-to-Ambient 
17 
14 
11 
9 
6.6 
UUlJ 
UUIJ 
(No Heatsink) 
7.1 


272187-11 
8 Case-to-Ambient 
(with Unidirectional) 
13 
9 
5.5 
5.0 
3.9 
3.4 


Hsatsink)" 


NOTES: 
1. This table 
applies 
to 80960CF PGA plugged 
into socket 
or soldered 
directly 
into board. 


2. (JJA = (JJC + (JCA· 
3. (JJ.CAP = 4'C/W 
(approx.) 
(JJ.PIN = 4'C/W 
(inner pins) (approx.) 
(JJ.PIN = 8'C/W 
(outer pins) (approx.) 
• 0.285" high unidirectional 
heat sink (AI alloy 6061,50 mil fin width, 150 mil 
center-to-center 
fin spacing). 


Figure 
11. 80960CF 
PGA Package 
Thermal 
Characteristics 
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PQFP Thermal 
Reslstance-°C/Watt 


Alrflow-ft.lmln 
(m/sec) 


Parameter 
0 
50 
100 
200 
400 
600 
800 
(0) 
(0.25) 
(0.50) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 


() Junction-to-Case 
(Case Measured) 
5 
5 
5 
5 
5 
5 
5 
as shown in Figure 13) 


() Case-to-Ambient 
19 
18 
17 
15 
12 
10 
9 
(No Heatsink) 


NOTES: 
1. This table applies 
to 80960CF 
PQFP soldered 
directly 
into board. 
2. (JJA = (JJC + (JCA· 
3. (JJL = 18"C/Watt 
(JJS = 18"C/Watt 


272187-12 


Figure 12. 80960CF 
PQFP Package 
Thermal 
Characteristics 


IoIEASURE 
PGA 
CASE 
TEIoIPERATURE 


AT 
CENTER 
OF TOP 
SURF ACE 


IoIEASURE 
PQFP 
TEIoIPERATURE 
AT 
CENTER 
OF 
TOP 
SURFACE 


272187-14 


272187-13 


Figure 13. Measuring 
80960CF 
PGA and PQFP Case Temperature 
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3.6 Stepping Register Information 


Upon Reset, Register GO contains die stepping in- 
formation. The following figure shows how GO is 
configured. The most significant byte contains an 
ASCII O.The upper middle byte contains an ASCII C. 
The lower middle byte contains an ASCII F. The 
least significant byte contains the stepping number 
in ASCII. GOretains this information until it is written 
over by the user program. 


Table 12 contains a cross reference of the number 
in the least significant byte of register GOto the die 
stepping number. 


Stepping 
Number 
ASCII 


DECIMAL 
Stepping 
Number 


MSB 
LSB 


Figure 14. Register GO 


Table 12. Die Stepping Cross Reference 


GOLeast 
Die Stepping 
Significant Byte 


01 
A 


02 
B 


03 
C 
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3.7 Suggested Sources for 80960CF 
Accessories 


The following are some suggested sources of ac- 
cessories for the 80960CF. They are neither an 
endorsement of any kind, nor a warranty of the 
performance of any of the listed products and/or 
companies. 


Sockets 


1. 3M Textool Test and Interconnection Products 
Department 
=.o. Box 2963 
Austin, TX 
78769-2963 


2. Augat, Inc. 
Interconnection Products Group 
33 Perry Avenue 
p.a. Box 779 
Attleboro, MA 
02703 
(508) 222-2202 


3. Concept Manufacturing Inc. 
(Decoupling Sockets) 
43024 Christy Street 
Fremont, CA 
94538 
(415) 651-3804 


Heat Sinks/Fins 


1. Thermalloy, Inc. 
2021 West Valley View Lane 
Dallas, TX 
75381-0839 
(214) 243-4321 


2. E G & G Division 
60 Audubon Road 
Wakefield, MA 
01880 
(617) 245-5900 


Operating 
Conditions 
(80960CF-33, 
-25, -16) 


Symbol 
Parameter 
Min 
Max 
Units 
Notes 


VCC 
Suppiy Voltage 
80960CF-33 
4.75 
5.25 
80960CF-25 
4.50 
5.50 
V 
80960CF-16 
4.50 
5.50 


fCLK2x 
Input Clock Frequency 
(2-x Mode) 
80960CF-33 
0 
66.66 
MHz 
80960CF-25 
0 
50 
MHz 
80960CF-16 
0 
32 
MHz 


fCLK1x 
Input Clock Frequency 
(t-x Mode) 
80960CF-33 
8 
33.33 
MHz 
80960CF-25 
8 
25 
MHz 
(1) 
80960CF-16 
8 
16 
MHz 


Tc 
Case Temperature 
Under Bias 
PGA Package 
0 
100 
·C 
80960CF-33, 
-25, -16 
196-Pin PQFP 
0 
100 . 


NOTE: 
(1) When in the t-x input clock mode, CLKIN is an input to an internal phase-locked loop and must maintain a minimum 
frequency of 8 MHz for proper processor operation. However, in the t-x Mode, CLKIN may still be stopped when the 
processor either is in a reset condition or is reset. If CLKIN is stopped, the specified RESET low time must be provided once 
CLKIN restarts and has stabilized. 


4.3 Recommended Connections 
Low 
inductance 
capacitors 
and 
interconnects 
are 


recommended 
for best high frequency 
electrical 
per- 


formance. 
Inductance 
can be reduced 
by shortening 


board traces between 
the processor 
and decoupling 


capacitors 
as much as possible. 
Capacitors 
specifi- 
cally designed 
for PGA packages 
will offer the low- 


est possible 
inductance. 
. 


Power 
and 
ground 
connections 
must 
be made 
to 
multiple 
VCC.and Vss 
(GND) pins. Every 80960CF- 
based circuit board should 
include 
power (Vccl 
and 
ground 
(Vss) 
planes 
for 
power 
distribution. 
Every 
VCC pin must be connected 
to the power plane, and 
every 
Vss 
pin 
must 
be connected 
to the 
ground 
plane. 
Pins identified 
as "N.C." 
must 
not 
be con- 
nected 
in the system. 
For reliable 
operation, 
always 
connect 
unused 
in- 


puts to an appropriate 
si~ 
level. In particular, 
any 


unused 
interrupt 
(XINT, NMI) or DMA (DREQ) 
input 


should be connected 
to VCC through 
a pull-up resis- 


tor, as should 
BTERM 
if not used. Pull-up resistors 
should 
be in the range 
of 20 KO for each 
pin tied 


high. If READY 
or HOLD are not used, the unused 


input 
should 
be 
connected 
to 
ground. 
N.C. 
pins 
must 
always 
remain 
unconnected. 
Refer 
to the 


i960 CA Microprocessor 
Reference 
Manual for more 


information. 


Liberal 
decoupling 
capacitance 
should 
be 
placed 
near the 80960CF. 
The processor 
can cause 
tran- 
sient power 
surges 
when 
its numerous 
output 
buff- 
ers transition, 
particularly 
when 
connected 
to large 
capacitive 
loads. 
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4.4. 
DC Specifications 


DC Characteristics 
(80960CF-33, 
-25, -16 under the conditions 
described 
in Section 4.2, Operating Conditions.) 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


VIL 
Input Low Voltage for all pins except RESET 
-0.3 
0.8 
V 


VIH 
Input High Voltage for all pins except RESET 
2.0 
vcc + 0.3 
V 


VOL 
Output Low Voltage 
0.45 
V 
IOL= 5mA 


VOH 
Output High Voltage 
IOH = -1mA 
2.4 
V 


IOH = - 2OO!lA 
vcc - 
0.5 
V 


VILR 
Input Low Voltage for RESET 
- 
0.3 
1.5 
V 


VIHR 
Input High Voltage for RESET 
3.5 
vcc + 0.3 
V 


IU1 
Input Leakage Current for each pin except 
STERM, ONCE, DREQ3:0, STEST, 
EOP3:0/TC3:0, 
NMI, XINT7:0, 
READY, HOLD, SOFF, CLKMODE 
±15 
!lA 
OV:O;:VIN~VCC (1) 


IU2 
Input Leakage Current for: 
STERM, ONCE, DREQ3:0, 
STEST, 
EOP3:0/TC3:0, 
NMI, XINT7:0, 
SOFF 
0 
-300 
!lA 
VIN = 0.45V (2) 


IU3 
Input Leakage Current for: 
READY, HOLD, CLKMODE 
0 
500 
!lA 
VIN = 2.4V (3) 


ILO 
Output Leakage 
Current 
±15 
!lA 
0.45V~VOUT~VCC 


'cc 
Supply Current (80960CF-33) 


IccMax 
1150 
mA 
(4) 


Icc Typ 
960 
(5) 


Ice 
Supply Current (80960CF-25) 


IccMax 
950 
mA 
(4) 


Icc Typ 
775 
(5) 


'cc 
Supply Current (80960CF-16) 


IccMax 
750 
mA 
(4) 


Icc Typ 
575 
(5) 


IONCE 
ONCE-mode 
Supply Current 
150 
mA 


CIN 
Input Capacitance 
for: 
CLKIN, RESET, ONCE, 
READY, HOLD, DREQ3:0, 
SOFF 
XINT7:0, 
NMI, STERM, CLKMODE 
0 
12 
pF 
Fc = 1 MHz 


COUT 
Output Capacitance 
of each output pin 
12 
pF 
Fc = 1 MHz, (6) 


CI/O 
1/0 Pin Capacitance 
12 
pF 
Fc = 1 MHz 


NOTES: 
(1) No Pull-up or pull-down. 
(2) These pins have internal pullup resistors. 
(3) These pins have internal pulldown resistors. 
(4) Measured at worst case frequency, vcc and temperature, with device operating and outputs loaded to the test conditions 
described in section 
4.5.1, AC Test Conditions. 
(5) Ice Typical is not tested. 
(6) Output Capacitance is the capacitive load of a floagn~ output. 
(7) CLKMODE pin has a pulldown resistor only when 
N E pin is deasserted. 
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4.5 
AC Specifications 


AC Characteristics 
- 
80960CF-33 


(80960CF-33 
only, under the conditions 
described 
in Section 
4.2, Operating 
Conditions 
and Section 
4.5.1, 
AC Test 
Conditions.) 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


INPUT CLOCK(10) 


TF 
CLKIN Frequency 
0 
66.66 
MHz 
(1) 


Tc 
CLKIN Period 
In 1-x Mode (fCLK10 
30 
125 
ns 
(1,12) 


In 2-x Mode (fCLK2xl 
15 
00 
ns 
(1) 


Tcs 
CLKIN Period Stability 
In 1-x Mode (fCLK10 
'±0.1% 
A 
(1,13) 


TCH 
CLKIN High Time 
In t-x Mode (fCLK10 
6 
62.5 
ns 
(1,12) 


In 2-x Mode (fCLKW 
6 
00 
ns 
(1) 


TCL 
CLKIN Low Time 
In t-x Mode (fCLK10 
6 
62,5 
ns 
(1,12) 


In 2-x Mode (fCLKW 
6 
00 
ns 
(1) 


TCR 
CLKIN Rise Time 
0 
6 
ns 
(1) 


TCF 
CLKIN Fall Time 
0 
6 
ns 
(1) 


OUTPUT CLOCKS(9) 


Tcp 
CLKIN to PCLK2:1 Delay 
In t-x Mode (fCLK10 
-2 
2 
ns 
(1,3,13,14) 


In 2-x Mode (fCLKW 
2 
25 
ns 
(1,3) 


T 
PCLK2:1 Period 
in t-x Mode (fCLK10 
Tc 
ns 
(1,13) 


In 2-x Mode (fCLKW 
2Tc 
ns 
(1,3) 


TpH 
PCLK2:1 High Time 
(T/2) - 
2 
T/2 
ns 
(1,13) 


TpL 
PCLK2:1 Low Time 
(T/2) - 
2 
T/2 
ns 
(1,13) 


TpR 
PCLK2:1 Rise Time 
1 
4 
ns 
(1,3) 


TPF 
PCLK2:1 Fall Time 
1 
4 
ns 
(1,3) 


SYNCHRONOUS OUTPUTS(10) 


Tov 
Output Valid Delay, Output Hold 
(6,11) 


TOH 
TOV1' TOH1 
A31:2 
3 
14 
ns 
TOV2' TOH2 
BE3:0 
3 
16 
ns 
TOV3,TOH3 
~ 
6 
18 
ns 
TOV4,TOH4 
W/R 
3 
18 
ns 
TOV5,TOH5 
D/G, SOP, DMA 
4 
16 
ns 
TOV6,TOH6 
BLAST, WAIT 
5 
16 
ns 
Tovz. TOH? 
DEN 
3 
16 
ns 
TOV8,TOH8 
HOLDA,BREQ 
4 
16 
ns 
TOV9,TOH9 
LOCK 
4 
16 
ns 
ToV1O,TOH10 
DACK3:0 
4 
18 
ns 
TOV11, TOH11 
D31:0 
3 
16 
ns 
TOV12'TOH12 
DT/R 
T/2 + 3 
T/2 + 14 
ns 
TOV13,TOH13 
FAIL 
2 
14 
ns 
(6, 11) 
TOV14'TOH14 
EOP3:0/TC3:0 
3 
18 
ns 


TOF 
Output Float for all outputs 
3 
22 
ns 
(6) 


SYNCHRONOUS INPUTS(10) 


TIS 
Input Setup 
TIS1 
D31:0 
3 
ns 
(1,11) 


TIS2 
BOFF 
17 
ns 
(1,11) 


TIS3 
BTERM/READY 
7 
ns 
(1,11) 


TIS4 
HOLD 
7 
ns 
(1,11) 


TIH 
Input Hold 
TIH1 
D31:0 
5 
ns 
(1,11) 


TIH2 
BOFF 
5 
ns 
(1,11) 


TIH3 
BTERM/READY 
2 
ns 
(1,11) 


TIH4 
HOLD 
3 
ns 
(1,11) 
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AC 
Characteristics 
- 
80960CF-33 
80960CF-33 
only, 
under 
the 
conditions 
described 
in Section 
4.2, 
Operating 
Conditions 
and 
Section 
4.5.1, 


AC 
Test 
Conditions.) 
(Continued) 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


RELATIVE 
OUTPUT 
TIMINGS(9,7) 


TAVSHl 
A31:2 
Valid 
to ADS Rising 
T-4 
T+4 
ns 


TAVSH2 
BE3:0, 
W/R, SOP, DIG, 


DMA, 
DAGK3:0 
Valid 
to ADS Rising 
T-6 
T+6 
ns 


TAVEL1 
A31:2 
Valid 
to DEN 
Falling 
T-4 
T+4 
ns 


TAVEl2 
BE3:0, W/R, SOP, INST, 


DMA, 
DACK3:0 
Valid 
to DEN 
Falling 
T-6 
T+6 
ns 


TNLOV 
WAIT 
Falling 
to Output 
Data Valid 
±6 
ns 


TOVNH 
Output 
Data Valid 
to WAIT 
Rising 
N"T-6 
N"T+6 
ns 
(4) 


TNLNH 
WAIT 
Falling 
to WAIT 
Rising 
N"T±4 
ns 
(4) 


TNHOX 
Output 
Data 
Hold 
after WAIT 
Rising 
(N + 
1)' 
T - 
6 
(N + 
1)' 
T + 
6 
ns 
(5) 


TEHTV 
DT IR Hold 
after 
DEN 
High 
T/2 - 6 
00 
ns 
(6) 


TTVEL 
DT IR Valid 
to DEN 
Falling 
T/2 - 4 
T/2 + 
4 
ns 
(7) 


RELATIVE 
INPUT 
TIMINGS(7) 


TIS5 
RESET 
Input Setup 
(2x Clock 
Mode) 
6 
ns 
(14) 


TIH5 
RESET 
Input 
Hold 
(2x Clock 
Mode) 
5 
ns 
(14) 


TIS6 
DREQ3:0 
Input 
Setup 
12 
ns 
(8) 


TIH6 
DREQ3:0 
Input 
Hold 
7 
ns 
(8) 


TIS7 
XINT7:0, 
NMllnput 
Setup 
7 
ns 
(8) 


TIH7 
XINT7:0, 
NMI 
Input 
Hold 
3 
ns 
(B) 


TISB 
RESET 
Input 
Setup 
(1 x Clock 
Mode) 
3 
ns 
(15) 


TIHB 
RESET 
Input 
Hold (tx Clock 
Mode) 
T/4 + 
1 
ns 
(15) 


NOTES: 
(1) See Section 
4.5.2, AC Timing 
Waveforms 
for waveforms 
and definitions. 
(2) See Figure 22 for capacitive 
derating 
information 
for output 
delays 
and hold times. 
(3) See Figure 23 for capacitive 
derating 
information 
for rise and fall times. 


(4) Where 
N is the number 
of NRAD, NRDD, NWAD, ~DD 
wait states 
that are programmed 
in the Bus Controller 
Region 
Table. 
When 
there 
are no wait states 
in an access, 
WAIT 
never goes active. 


(5) N = Number 
of wait states 
inserted 
with READY. 
(6) Output 
Data and/or 
OT/R 
may be driven 
indefinitely 
following 
a cycle 
if there 
is no subsequent 
bus activity. 
(7) See Notes 
1, 2 and 3. 


(B) Since 
asynchronous 
inputs 
are synchronized 
internally 
by the B0960CF 
they 
have 
no required 
setup 
or hold 
times 
in 
order 
to be recognized 
and 
for proper 
operation. 
However, 
to guarantee 
recognition 
of the 
input 
at a particular 
edge 
of 
PCLK2:1 
the setup 
times 
shown 
must be met. Asynchronous 
inputs 
must be active 
for at least 
two consecutive 
PCLK2:1 
rising edges 
to be seen by the processor. 
(9) These 
specifications 
are guaranteed 
by the processor. 


(10) These 
specifications 
must be met by the system 
for proper 
operation 
of the processor. 


(11) This timing 
is dependent 
upon the loading 
of PCLK2:1. 
Use the derating 
curves 
of Section 
4.5.3 to adjust 
the timing 
for 
PCLK2:1 
loading. 
(12) In the l-x 
input clock 
mode, the maximum 
input clock 
period 
is limited 
to 125 ns while the processor 
is operating. 
When 
the processor 
is in reset, the input clock 
may stop even in l-x 
mode. 


(13) When 
in the t-x 
input clock 
mode, 
these 
specifications 
assume 
a stable 
input clock 
with a period 
variation 
of less than 
± 0.1 % between 
adjacent 
cycles. 


(14) In 2x clock 
mode, 
RESET 
is an asynchronous 
input which 
has no required 
setup 
and hold time for proper 
operation. 


However, 
to guarantee 
the device 
exits reset synchronized 
to a particular 
clock 
edge, the RESET 
pin must meet 
setup 
and 
hold times 
to the falling 
edge of the CLKIN. 
(See Figure 2Ba.) 
(15) In lx 
clock 
mode, 
RESET 
is an asynchronous 
input which 
has no required 
setup 
and hold time for proper 
operation. 


However, 
to guarantee 
the device 
exits reset 
synchronized 
to a particular 
clock 
edge, 
the RESET 
pin must be deasserted 
while CLKIN 
is high and meet setup and hold times 
to the rising edge of the CLKIN. 
(See Figure 28b.) 
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AC Characteristics 
- 
80960CF-25 
(80960CF-25 
only, under the conditions 
described 
in Section 
4.2, Operating 
Conditions 
and Section 
4.5.1, 
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Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


INPUT CLOCK(10) 


TF 
CLKIN Frequency 
0 
50 
MHz 
(1) 


Tc 
CLKIN Period 
In 1-x Mode (fCLK1X> 
40 
125 
ns 
(1,12) 
In 2-x Mode (fCLK2xl 
20 
00 
ns 
(1) 


Tcs 
CLKIN Period Stability 
In 1-x Mode (fCLK1X> 
±0.1% 
A 
(1,13) 


TCH 
CLKIN High Time 
In t-x Mode (fCLK1X> 
8 
62.5 
ns 
(1,12) 
In 2-x Mode (fCLK2x) 
8 
00 
ns 
(1) 


TCL 
CLKIN Low Time 
In 1-x Mode (fCLK1X> 
8 
62.5 
ns 
(1,12) 
In 2-x Mode (fCLK2xl 
8 
00 
ns 
(1) 


TCR 
CLKIN Rise Time 
0 
6 
ns 
(1) 


TCF 
CLKIN Fall Time 
0 
6 
ns 
(1) 


OUTPUT CLOCKS(9) 


Tcp 
CLKIN to PCLK2:1 Delay 
In t-x Mode (fCLK1X> 
-2 
2 
ns 
(1,3,13,14) 
In 2-x Mode (fCLK2X> 
2 
25 
ns 
(1,3) 


T 
PCLK2:1 Period 
In t-x Mode (fCLK1X> 
Tc 
ns 
(1,13) 
In 2-x Mode (fCLK2xl 
2Tc 
ns 
(1,3) 


TpH 
PCLK2:1 High Time 
(T/2) - 
3 
T/2 
ns 
(1,13) 


TpL 
PCLK2:1 Low Time 
(T/2) - 
3 
T/2 
ns 
(1,13) 


TpR 
PCLK2:1 Rise Time 
1 
4 
ns 
(1,3) 


TpF 
PCLK2:1 Fall Time 
1 
4 
ns 
(1,3) 


SYNCHRONOUS OUTPUTS(10) 


Tov 
Output Valid Delay, Output Hold 
(6,11) 
TOH 
TOV1' TOH1 
A31:2 
3 
16 
ns 
TOV2' TOH2 
SE3:0 
3 
18 
ns 
TOV3,TOH3 
.Ai5S 
6 
20 
ns 
TOV4' TOH4 
W/R 
3 
20 
ns 
Tovs. TOHS 
D/C,SOP,DMA 
4 
18 
ns 
TOV6,TOH6 
SLAST, WAIT 
5 
18 
ns 
TOV?,TOH? 
DEN 
3 
18 
ns 
TOV8,TOH8 
HOLDA, SREQ 
4 
18 
ns 
TOV9,TOH9 
LOCi< 
4 
18 
ns 
ToVlO, TOH10 
DACK3:0 
4 
20 
ns 
TOVll , TOH11 
031:0 
3 
18 
ns 
TOV12'TOH12 
DT/R 
T/2 + 3 
T/2 + 16 
ns 
TOV13,TOH13 
FAIL 
2 
16 
ns 
TOV14'TOH14 
EOP3:0/TC3:0 
3 
20 
ns 
(6,11) 


TOF 
Output Float for all outputs 
3 
22 
ns 
(6) 


SYNCHRONOUS INPUTS(10) 


TIS 
Input Setup 
TISl 
031:0 
5 
ns 
(1.11) 
TIS2 
SOFF 
19 
ns 
(1.11) 
TIS3 
STERM/READY 
9 
ns 
(1.11) 
TIS4 
HOLD 
9 
ns 
(1.11) 


TIH 
Input Hold 
TIHl 
D31:0 
5 
ns 
(1.11) 
TIH2 
SOFF 
7 
ns 
(1.11) 
TIH3 
STERM/READY 
2 
ns 
(1.11) 
TIH4 
HOLD 
5 
ns 
(1.11) 


3-291 


• 


80960CF-33, -25, -16 
intel~ 


AC Characteristics 
- 
80960CF-25 


(809S0CF-25 
only, under the conditions 
described 
in Section 
4.2, Operating 
Conditions 
and Section 
4.5.1, 
AC Test 
Conditions.) 
(Continued) 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


RELATIVE OUTPUT TIMINGS(9,7) 


TAVSHl 
A31:2 Valid to ADS Rising 
T-4 
T+4 
ns 


TAVSH2 
BE3:0, WIR, SOP, DIG, 
DMA, DACK3:0 Valid to ADS Rising 
T-S 
T+S 
ns 


TAVEL1 
A31:2 Valid to DEN Falling 
T-4 
T+4 
ns 


TAVEL2 
BE3:0, W/R, 
SOP, INST, 
DMA, DACK3:0 Valid to DEN Falling 
T-S 
T+S 
ns 


TNLOV 
WAIT Falling to Output Data Valid 
±S 
ns 


TOVNH 
Output Data Valid to WAIT Rising 
N°T - 
S 
N*T+6 
ns 
(4) 


TNLNH 
WAIT Falling to WAIT Rising 
N*T±4 
ns 
(4) 


TNHOX 
Output Data Hold after WAIT Rising 
(N + 1) ° T - 
6 
(N + 1) ° T + S 
ns 
(5) 


TEHTV 
DTIR Hold after DEN High 
T/2 - 6 
00 
ns 
(S) 


TTVEL 
DTIR Valid to DEN Falling 
T/2 - 4 
T/2 + 4 
ns 
(7) 


RELATIVE INPUT TIMINGS(7) 


TIS5 
RESET Input Setup (2x Clock Mode 
8 
ns 
(14) 


TIH5 
RESET Input Hold (2x Clock Mode) 
7 
ns 
(14) 


TIS6 
DREQ3:0 Input Setup 
14 
ns 
(8) 


TIH6 
DREQ3:0 Input Hold 
g 
ns 
(8) 


TIS7 
XINT7:0, NMllnput 
Setup 
g 
ns 
(8) 


TIH7 
XINT7:0, NMllnput 
Hold 
5 
ns 
(8) 


TIS8 
RESET Input Setup (lx Clock Mode) 
3 
ns 
(15) 


TIH8 
RESET Input Hold (lx Clock Mode) 
T/4 + 1 
ns 
(15) 


NOTES: 
(1) See Section 4.5.2, AC Timing Waveforms for waveforms and definitions. 
(2) See Figure 22 for capacitive derating information for output delays and hold times. 
(3) See Figure 23 for capacitive derating information for rise and fall times. 
(4) Where N is the number of NRAD,NRDD,NWAD,or NWDDwait states that are programmed in the Bus Controller Region 
Table. When there are no wait states in an access, WAIT never goes active. 
(5) N = Number of wait states inserted with READY. 
(6) Output Data and/or DT/R may be driven indefinitely following a cycle if there is no subsequent bus activity. 
(7) See Notes 1, 2 and 3. 
(8) Since asynchronous inputs are synchronized internally by the 80960CF they have no required setup or hold times in 
order to be recognized and for proper operation. However, to guarantee recognition of the input at a particular edge of 
PCLK2:1 the setup times shown must be met. Asynchronous inputs must be active for at least two consecutive PCLK2:1 
rising edges to be seen by the processor. 
(9) These specifications are guaranteed by the processor. 
(10) These specifications must be met by the system for proper operation of the processor. 
(11) This timing is dependent upon the loading of PCLK2:1. Use the derating curves of Section 4.5.3 to adjust the timing for 
PCLK2:1 loading. 
(12) In the t-x input clock mode, the maximum input clock period is limited to 125 ns while the processor is operating. When 
the processor is in reset, the input clock may stop even in l-x mode. 
(13) When in the l-x input clock mode, these specifications assume a stable input clock with a period variation of less than 
±0.1% between adjacent cycles. 
(14) In 2x clock mode, RESET is an asynchronous input which has no required setup and hold time for proper operation. 
However, to guarantee the device exits reset synchronized to a particular clock edge, the RESET pin must meet setup and 
hold times to the falling edge of the CLKIN. (See Figure 28a.) 
(15) In lx clock mode, RESET is an asynchronous input which has no required setup and hold time for proper operation. 
However, to guarantee the device exits reset synchronized to a particular clock edge, the RESET pin must be deasserted 
while CLKIN is high and meet setup and hold times to the rising edge of the CLKIN. (See Figure 28b.) 
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Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


INPUT CLOCK(10) 


TF 
CLKIN Frequency 
0 
32 
MHz 
(1) 


Tc 
CLKIN Period 
In t-x Mode (fCLK1X> 
62.5 
125 
ns 
(1,12) 
In z-x Mode (fCLK2JJ 
31.25 
00 
ns 
(1) 


Tcs 
QLKIN Period Stability 
In 1-x Mode (fCLK1X> 
±0.1% 
t. 
(1,13) 


TCH 
CLKIN High Time 
In t-x Mode (fCLK1X> 
10 
62.5 
ns 
(1,12) 
In z-x Mode (fCLK2JJ 
10 
00 
ns 
(1) 


TCL 
CLKIN Low Time 
In t-x Mode (fCLK1X> 
10 
62.5 
ns 
(1,12) 
In 2-x Mode (fCLK2x) 
10 
00 
ns 
(1) 


TCR 
CLKIN Rise Time 
0 
6 
ns 
(1) 


TCF 
CLKIN Fall Time 
0 
6 
ns 
(1) 


OUTPUT CLOCKS(9) 


Tcp 
CLKIN to PCLK2:1 Delay 
In t-x Mode (fCLK1x) 
-2 
2 
ns 
(1,3,13,14) 


In z-x Mode (fCLK2X> 
2 
25 
ns 
(1,3) 


T 
PCLK2:1 Period 
In t-x Mode (fCLK1X> 
Tc 
ns 
(1,13) 
In z-x Mode (fCLK2X> 
2Tc 
ns 
(1,3) 


TpH 
PCLK2:1 High Time 
(T/2) - 
4 
T/2 
ns 
(1,13) 


TpL 
PCLK2:1 Low Time 
(T/2) - 
4 
T/2 
ns 
(1,13) 


TpR 
PCLK2:1 Rise Time 
1 
4 
ns 
(1,3) 


TpF 
PCLK2:1 Fall Time 
1 
4 
ns 
(1,3) 


SYNCHRONOUS OUTPUTS(10) 


Tov 
Output Valid Delay, Output Hold 
(6,11) 
TOH 
TOV1' TOH1 
A31:2, 
3 
18 
ns 
TOV2' TOH2 
BE3:0 
3 
20 
ns 
TOV3,TOH3 
ADS 
6 
22 
ns 
TOV4' TOH4 
W/R 
3 
22 
ns 
TOV5,TOH5 
DIG, sop, DMA 
4 
20 
ns 
TOV6,TOH6 
BLAST, WAIT 
5 
20 
ns 
TOV7,TOH7 
DEN 
3 
20 
ns 
TOV8,TOH8 
HOLDA,BREQ 
4 
20 
ns 
TOV9,TOH9 
LOCK 
4 
20 
ns 
TOV1O,TOH10 
DACK3:0 
4 
22 
ns 
TOV11, TOH11 
D31:0 
3 
20 
ns 
TOV12' TOH12 
DTlR 
T/2 + 3 
T/2 + 18 
ns 
TOV13,TOH13 
FAIL 
2 
18 
ns 
TOV14'TOH14 
EOP3:0/TC3:0 
3 
22 
ns 
(6,11) 


TOF 
Output Float for all outputs 
3 
22 
ns 
(6) 


SYNCHRONOUS INPUTS(10) 


TIS 
Input Setup 
TIS1 
D31:0 
5 
ns 
(1,11) 
TIS2 
BOFF 
21 
ns 
(1,11) 
TIS3 
BTERM/READY 
9 
ns 
(1,11) 
TIS4 
HOLD 
9 
ns 
(1,11) 


TIH 
Input Hold 
TIH1 
031:0 
5 
ns 
(1,11) 
TIH2 
BOFF 
7 
ns 
(1,11) 
TIH3 
BTERM/READY 
2 
ns 
(1,11) 
TIH4 
HOLD 
5 
ns 
(1,11) 
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Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


RELATIVE 
OUTPUT TIMINGS(9,7) 


TAVSH1 
A31:2 Valid to ADS Rising 
T-4 
T+4 
ns 


TAVSH2 
BE3:0, W/R, 
SUP, D/C, 
DMA, DACK3:0 Valid to ADS Rising 
T-6 
T+6 
ns 


TAVEL1 
A31:2 Valid to DEN Falling 
T-6 
T+6 
ns 


TAVEL2 
BE3:0, W/R, 
SUP, INST, 
DMA, DACK3:0 Valid to DEN Falling 
T-6 
T+6 
ns 


TNLOV 
WAIT Falling to Output Data Valid 
±6 
ns 


TDVNH 
Output Data Valid to WAIT Rising 
N°T - 
6 
N°T 
+ 6 
ns 
(4) 


TNLNH 
WAIT Falling to WAIT Rising 
N°T 
± 4 
ns 
(4) 


TNHOX 
Output Data Hold after WAIT Rising 
(N + 1) ° T - 
6 
(N + 1) ° T + 6 
ns 
(5) 


TEHTV 
DT IR Hold after DEN High 
T/2 
- 
6 
ex> 
ns 
(6) 


TTVEL 
DT IR Valid to DEN Falling 
T/2 
- 
4 
T/2 
+ 4 
ns 
(7) 


RELATIVE 
INPUT TIMINGS(7) 


TIS5 
RESET Input Setup (2x Clock Mode) 
10 
ns 
(14) 


TIH5 
RESET Input Hold (2x Clock Mode) 
9 
ns 
(14) 


TIS6 
DRE03:0 
Input Setup 
16 
ns 
(8) 


TIH6 
DRE03:0 
Input Hold 
11 
ns 
(8) 


TIS7 
XINT7:0, 
NMI Input Setup 
9 
ns 
(8) 


TIH7 
XINT7:0, 
NMllnput 
Hold 
5 
ns 
(8) 


TIS8 
RESET Input Setup (1x Clock Mode) 
3 
ns 
(15) 


TIH8 
RESET Input Hold (1x Clock Mode) 
T/4 
+ 1 
ns 
(15) 


NOTES: 
(1) See Section 4.5.2, AC Timing Waveforms for waveforms and definitions. 
(2) See Figure 22 for capacitive derating information for output delays and hold times. 
(3) See Figure 23 for capacitive derating information for rise and fall times. 
(4) Where N is the number of NRAD,NRDD,NWAD,or NWDDwait states that are programmed in the Bus Controller Region 
Table. When there are no wait states in an access, WAIT never goes active. 
(5) N = Number of wait state inserted with READY. 
(6) Output Data and/or DT/R may be driven indefinitely following a cycle if there is no subsequent bus activity. 
(7) See Notes 1, 2 and 3. 
(8) Since asynchronous inputs are synchronized internally by the 80960CF they have no required setup or hold times in 
order to be recognized and for proper operation. However, to guarantee recognition of the input at a particular edge of 
PCLK2:1 the setup times shown must be met. Asynchronous inputs must be active for at least two consecutive PCLK2:1 
rising edges to be seen by the processor. 
(9) These specifications are guaranteed by the processor. 
(10) These specifications must be met by the system for proper operation of the processor. 
(11) This timing is dependent upon the loading of PCLK2:1. Use the derating curves of Figure 22 to adjust the timing for 
PCLK2:1 loading. 
(12) In the t-x input clock mode, the maximum input clock period is limited to 125 ns while the processor is operating. When 
the processor is in reset, the input clock may stop even in 1-x mode. 
(13) When in the t-x input clock mode, these specifications assume a stable input clock with a period variation of less than 
± 0.1% between adjacent cycles. 
(14) In 2x clock mode, RESET is an asynchronous input which has no required setup and hold time for proper operation. 
However, to guarantee the device exits reset synchronized to a particular clock edge, the RESET pin must meet setup and 
hold times to the falling ed~e of the CLKIN. (See Figure 28a.) 
(15) In 1x clock mode, RE ET is an asynchronous input which has no required setup and hold time for proper operation. 
However, to guarantee the device exits reset synchronized to a particular clock edge, the RESET pin must be deasserted 
while CLKIN is high and meet setup and hold times to the rising edge of the CLKIN. (See Figure 28b.) 
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4.5.1. 
AC TEST 
CONDITIONS 
The AC Specifications in Section 4.5 are tested with 
the 50 pf load shown in Figure 15. See Figure 22 to 
see how timings vary with load capacitance. 


Specifications are measured at the 1.5V crossing 
point, unless otherwise indicated. Input waveforms 
are assumed to have a rise-and-tall time of s 2 ns 
from 0.8V to 2.0V. See Section 
4.5.2, AC Timing 
Waveforms 
for AC spec definitions, test points and 
illustrations. 
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Figure 15. AC Test Load 
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Figure 16b. CLKIN Waveform 
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Figure 17. Output Delay and Float Waveform 
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Figure 18a.lnput 
Setup and Hold Waveform 


- 
OUTPUT 
DELAY - 
The maximum oulpul delay is referred to 


as the 0uIpuI Valid Delay (Tov). 
The minimum oulpul delay is 


referred to as the Output Hold (TOH)' 


- 
OUTPUT FLOAT DELAY - 
The output float oond~ion 
OCCUIS 


when the maximum 
output current becomes less than ILO in magnitude. 


- 
INPUT SETUP AND HOLD - 
The input setup and hold requirements 


specify the sampling 
window during which synchronous 
inputs must be 


stable for oorrect processor 
operation. 
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Figure 18b. NMI, XINT7:0 
Input Setup and Hold Waveform 
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Figure 19. Hold Acknowledge 
Timings 
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Figure 20. Bus Back-Off 
(BOFF) Timings 
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Figure 21. Relative Timings Waveforms 
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Figure 22. Output Delay or Hold vs Load Capacitance 
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at Highest Operating 
Temperature 
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Figure 24. Icc vs Frequency 
and Temperature 
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5.0 
RESET, BACKOFF AND HOLD 
ACKNOWLEDGE 


The following 
table lists the condition 
of each proc- 
essor output 
pin while RESET is asserted 
(Iow). 


Table 
13. Reset 
Conditions 


Pins 
State 
During 
Reset 
(HOLDA 
Inactlve)1 


A31:A2 
Floating 


031:00 
Floating 


BE3:0 
Driven high (Inactive) 


W/R 
Driven low (Read) 


ADS 
Driven high (Inactive) 


WAIT 
Driven high (Inactive) 


BLAST 
Driven low (Active) 


DT/R 
Driven low (Receive) 


DEN 
Driven high (Inactive) 


LOCK 
Driven high (Inactive) 


BREQ 
Driven low (Inactive) 


DIG 
Floating 


DMA 
Floating 


SUP 
Floating 


FAIL 
Driven low (Active) 


DACK3 
Driven high (Inactive) 


DACK2 
Driven high (Inactive) 


DACK1 
Driven high (Inactive) 


DACKO 
Driven high (Inactive) 


EOP/TC3 
Floating (set to input mode) 


Eop/fc2 
Floating (set to input mode) 


EOP/TC1 
Floating (set to input mode) 


EOP/TCO 
Floating (set to input mode) 


NOTE: 
(1) With 
regard 
to bus output 
pin state 
only, the 
Hold Ac- 


knowledge 
state takes 
precedence 
over the reset state. Al- 
though 
asserting 
the "RESl:f 
pin 
will 
internally 
reset 
the 
processor, 
the 
processor's 
bus output 
pins 
will 
not 
enter 
the reset state if it has granted 
Hold Acknowledge 
to a pre- 
vious 
HOLD 
request 
(HOLDA 
is active). 
Furthermore, 
the 
processor 
will 
grant 
new 
HOLD 
requests 
and 
enter 
the 
Hold Acknowledge 
state even while 
in reset. 


For example, 
if HOLDA 
is not active 
and the processor 
is 


in the reset 
state, 
then 
HOLD 
is asserted, 
the processor's 
bus 
pins 
will 
enter 
the 
Hold 
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The following 
table lists the condition 
of each proc- 
essor output 
pin while HOLDA 
is asserted 
(Iow). 


Table 
14. Hold Acknowledge 
and Backoff 
Conditions 


Pins 
State 
During 
HOLDA 


A31:A2 
Floating 


031:00 
Floating 


BE3:0 
Floating 


W/F{ 
Floating 


ADS 
Floating 


WAIT 
Floating 


BLAST 
Floating 


DT/R 
Floating 


DEN 
Floating 


LOCK 
Floating 


BREQ 
Driven (high or low) 


D/C 
Floating 


DMA 
Floating 


SUP 
Floating 


FAIL 
Driven high (Inactive) 


DACK3 
Driven high (Inactive) 


DACK2 
Driven high (Inactive) 


DACK1 
Driven high (Inactive) 


DACKO 
Driven high (Inactive) 


EOP/TC3 
Driven if output 


EOP/TC2 
Driven if output 


EOP/TC1 
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Driven if output 
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Figure 25. Cold Reset Waveform 
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Figure 28b. Clock Synchronization 
in the 1x Clock Mode 
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Figure 30. Non-Burst, Non-Plpellned Read Request with Wait States 
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Figure 32. Burst, Non-Plpellned Read Request without Wait States, 32-Blt Bus 
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Figure 33. Burst, Non-Pipelined Read Request with Wait States, 32-Bit Bus 
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Figure 35_Burst, Non-Plpellned Write Request with Wait States, 32-Blt Bus 
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Figure 36. Burst, Non-Plpellned Read Request with Wait States, 16-Blt Bus 
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Figure 37. Burst, Non-Plpelined Read Request with Wait States, 8-Blt Bus 
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Figure 38. Non-Burst, Plpellned Read Request without Wait States, 32-Blt Bus 


80960CF-33, -25, -16 


Region Table Entry 


I 
Syt. 
E 
SUI 
Pip•• 
Elt.rna' 
t 
Nwdd 
Nw.d 
Nld. 
Nrdd 
Nrld 
Burlt 
: 
Ord.r i 
Width 
lining 
R.ady 
: 
Control 


bill 31-23 
b1122 
'.121 
bi1120-li 
btta 18-17 
bit,'6·12 
bits 11·10 
•••• 9-8 
bitl7·3 
bit 2 
bII' 
bit 0 
0 
X 
0 
X 
X 
X 
X 
X 
1 
On 
X 
Dle.llled 
"..." 
x 
" 


xx 
xx 
xxxxx 
xx 
" 
00001 
, 
, 
" 
A 


PCLK [ 
m[.[ 
OIl: 


wii [ 


A32 
[ 
IDo 


031:0 
[ 


wm[ 


BLAST 
[ 


OTii 
[ 


DEN [ 


A'o 
0' 


272187-42 


I 


Figure 39. Non-Burst, Plpellned Read Request with Wait States, 32-Blt Bus 
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Figure 41. Burst, Pipelined Read Requests with Wait States, 32-Bit Bus 


3·317 


• 


80960CF-33, -25, -16 


PCLK 
[ 


ADS [ 


ASl :4, SDJ5, [ 
llUA. DIe. 
~M!R£, 
mcR.~/1!IE 


BEl/M 
[ 


031:0 [ 


WAff[ 


BLAST 
[ 


DTIA 
[ 


Region Table Entry 
~ 
l 
Plpe- 
· 
Byte 
Bu. 
Ext.rn.1 
· 
Nwdd 
Nwad 
Nld. 
Nrdd 
Nr.d 
Burat 
· 
Order 
! 
Width 
lining 
A•• dy 
· 
Control 
:: 
0: 
b1e131-23 
bit22 
"121 
bits 20-19 
bttI18-17 
bItIUH2 
*11-10 
bits" 
bits7-3 
bit2 
bill 
toto 
0 
X 
0 
16-811 
X 
X 
X 
1 
2 
On 
X 
En.bled 
0 ___ 
0 . 
0 
01 
.. 
..... 
xx 
01 
00010 
1 
. 
1 


A 
o 
o 


272187-45 


3-318 


.0 
A'o 
0' 
2 
2 


Valid 


A3:2 • 00 or 10 
A3:2 = 01 or 11 


PIpoI;nod .- 
conclude. 


Non-pipeHned 
requests 
begin 


Figure 42_Burst, Plpelined Read Requests with Wait States, 16-Blt Bus 


80960CF-33, -25, -16 


PClK 
[ 


AiiS[ 


~[ 


BEl/AI, 
[ 
BEWAIl 


031:0 
[ 


wm[ 


BLm[ 


OTIR 
[ 


BEN [ 


Region Table Entry 
1 
Iyt. i 
Iu. 
Plp.- 
1,'lrtI.' 
IUral 
i 


II.dd 
II.ad 
IIad. 
IIrdd 
Nrad 
IInlnll 
••••cI' 
Ord.r 
! 
Wldlft 
c."••.• 
t 
e 
• 
31-23 
,*22 
•••• 
bill 20-1' 
bitl11-17 
biII'I-,2 
bitl11-10 
••••• 
H 
bits 7·3 
'*• 
'*, 
""0 
0 
x 
0 
8-ln 
x 
x 
X 
I 
2 
On 
X 
en.It ••d 
"..." . 
" 
"" 


xx 
..... 
xx 
'" 
"""'" 
, 
. 
, 


A 
D 
2 
D' 


272187-46 


2 
D 
D 


V.11d 


A3:2-00,01,IO,11 


Figure 43_Burst, Plpellned Read Requests with Wait States, 8-Blt Bus 


3-319 


• 


00 
D3 
031:0 


I 
I 
I 
I 
I 
I 
I 
--~---~--~-~-(;)- 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


272187-47 


3-320 


Figure 44. Using External 
READY 


80960CF-33, 
-25, -16 


~.~ 
~J~::~::::~~::::~:::::::::~::V:~:D~~::::~:::::::::~::::~~::::~~::~ 
oc.~ 
~ 


00 
01 
A3.A2 


WAIT 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


03U) 
- 
- !---~---~ --!---~--~ --~---~--J;\- --~-- 


I 
I~ 
I 
I~ 
I 
I 
~ 
I 
tit 
I 
I 
I 
I 
I 
I 
t 


f 
I 
I 
I 
I 
I 
I 
I 
I 
I 
272187-48 


NOTE: 
READY 
adds memory 
access 
time to data transfers, 
whether 
or not the bus access 
is a burst access. 
BTERM 
interrupts 


a bus access, 
whether 
or not the bus access 
has more data transfers 
pending. 
Either the READY 
signal or the BTERM 


signal will terminate 
a bus access 
if the signal is asserted 
during the last (or only) data transfer 
of the bus access. 


Figure 45. Terminating 
a Burst with BTERM 
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Figure 46. BOFF Functional 
Timing 
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Figure 47. HOLD Functional 
Timing 
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NOTES: 
1. Case 
1: DREQ 
must 
deassert 
before 
DACK 
deasserts. 
Applications 
are 
Fly-by 
and 
some 
packing 
and 
unpacking 
modes, 
in which 
loads are followed 
by loads, or stores 
are followed 
by stores. 
2. Case 2: DREQ must be deasserted 
by the second 
clock 
(rising edge) 
after DACK is driven 
high. Applications 
are non 
fly-by transfers 
and adjacent 
load-stores 
or store-loads. 
3. DACKx 
is asserted 
for the duration 
of a DMA bus request. 
The request 
may consist 
of multiple 
bus accesses 
(defined 
by A'DS and BLAST. 
Refer to User's 
Manual 
for "access", 
"request" 
definition. 
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Figure 
48_ DREQ and DACK Functional 
Timing 
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NOTE: 
EOP has the same AC TimingBElquirements 
as DREQ to prevent 
unwanted 
DMA requests. 


EQi5 is NOT edge triggered. 
EQi5 must be held for a minimum 
of 2 clock 
cycles 
then EQi5 must be deasserted 
within 
15 clock 
cycles. 


Figure 
49_ EOP Functional 
Timing 
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NOTE: 
Terminal Count becomes active during the last bus request of a buffer transfer. If the last LOAD/STORE bus request is 
executed as multiple bus accesses, the TC will be active for the entire bus request. Refer to the User's Manual for 
further information. 


Figure 50. Terminal 
Count Functional 
Timing 
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Figure 51. FAIL Functional 
Timing 
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• 
Direct Generic 
Printer Engine Interface 
to TEC, Canon, Ricoh and Okldata 
Printer Engines 


• 
Programmable 
ROM/EPROM 
Interface 
- 
Direct Interface 
to Two Banks 
- 
Expandable 
to Eight Banks 
- 
Programmable 
ROM Size and Wait 
States 


• 
Programmable 
System 
Timer 


- 
Chip-Select, 
Access Time, Recovery 
Time 
- 
Wait State Control 
for Eight External 
Devices 


• 
Burst Interface 
Support 
for 1960 
KA/KB 
Bus 


• 
Automatic 
Data Conversion 
from 
16-blt 
Font Cartridge 
to 32-blt 1960 Embedded 
Processor 
Format 


• 
Low-Cost 
132-Lead 
Plastic Quad Flat 
Pack (PQFP) 


Intel's 82961 KA page printer controller 
is the first member of a new family of page printer companion 
peripher- 
als that address 
the need for high integration 
and cost reduction. 
It is designed 
to interface 
Intel's 
80960K- 
series of embedded 
processors 
to a variety of non-impact 
or laser printer mechanisms. 
The 82961 KA control- 
ler contains 
complete 
DRAM, 
ROM and I/O controllers, 
the associated 
logic required 
to control 
virtually 
all 
non-impact 
printer mechanisms, 
a programmable 
wait state generator 
and programmable 
chip select 
genera- 
tion logic. The 82961 KA controfler=coupled 
with an 80960Kx 
plus DRAM, 
ROM and the appropriate 
I/O- 


forms the nucleus 
for a cost-effective 
non-impact 
printer 
controller. 


Designs 
that 
employ 
the 82961 KA controller 
provide 
substantial 
system 
performance 
improvements. 
The 
82961 KA device 
decouples 
DRAM and video output 
subsystems 
from the synchronous 
80960KA/KB 
bus to 
achieve 
increased 
performance. 
The 82961 KA device's 
unique 
programmable 
video 
output 
controller 
man- 
ages 
virtually 
every 
aspect 
of memory 
address 
generation, 
timing 
and control 
once 
the 
device 
begins 
to 
produce 
the video output 
signal. 


1. The 82961 KA i960 Page 
Printer 
Controller 
is based 
on the single-chip 
controller 
architecture 
created 
by Peeriess 
Systems 
Corp. 


2. Throughout 
this data sheet, 
80960Kx 
refers 
to the 80960KA 
and KB processors. 
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1.0 
PURPOSE 


This data sheet describes 
the 82961 KA page printer 
controller. 
It provides 
the information 
required to be- 
gin designing 
with the 82961 KA printer 
controller. 
It 
contains 
functional 
and physical 
descriptions, 
elec- 
trical 
characteristics 
and 
specifications, 
absolute 
maximum 
ratings and package 
and pin definitions. 


2.0 
82961KA FUNCTIONAL 
OVERVIEW 


The 82961 KA integrates 
onto 
a single 
chip all the 
"glue" 
or support 
logic required 
in a 80960Kx 
proc- 
essor 
design 
with 
a high performance 
interface 
to 
non-impact 
printers. 
82961 KA functional 
blocks 
are: 


• 
80960K-Series 
L-Bus Interface 


• 
ROM Interface 


• 
DRAM 
Interface 


• 
110 Interface 


• 
Printer Video 
Interface 


• 
Printer Communication 
Interface 


• 
System 
Timer 


The 82961 KA is the first member 
of a new family of 
page printer companion 
peripherals. 
It is intended 
to 
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operate 
in the non-impact 
printer environment. 
How- 
ever, 
many 
of its features 
make 
it extremely 
well- 
suited for other applications. 
The 82961 KA provides 
a direct 
interface 
between 
the liltel 
80960Kx 
micro- 
processor 
and 
system 
memory, 
communication 
channels 
and printer 
engine. 


The 82961 KA is designed 
for flexibility, 
ease of use 
and optimum 
performance 
while 
employing 
a mini- 
mal number 
of external 
components. 
A page print- 
er's 
host 
communication 
environment 
can 
range 
from 
a simple 
serial or parallel 
port to a complete 
Ethernet 
implementation. 
To support this vast range, 
the 82961 KA is not limited to any specific 
communi- 
cation mechanism. 
Instead, the 82961 KA generates 
chip-selects 
and control 
signals 
that allow the user 
to easily 
connect 
standard 
communication 
devices 
to the 80960Kx. 
Additionally, 
the 82961 KA can be 
used 
in other 
operating 
environments 
to enhance 
Intel 80960Kx 
microprocessors 
by providing 
memory 
interfaces 
and peripheral 
timing controls. 


In the most 
simplistic 
implementations, 
the system El 
consists 
of clock and reset generation, 
an 80960Kx 
microprocessor, 
peripheral 
communications 
with in- 
terface 
devices, 
the 82961 KA, ROM devices, 
DRAM 
devices with series damping 
resistors, 
minimal 
print- 
er interface 
logic, 
along 
with the 
usual 
oscillators, 
connectors 
and PC board. 
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3.0 
PACKAGE INFORMATION 


Figure 1. 82961KA Block Diagram 


Table 1. Pin Description Nomenclature 


3.1 PackageIntroduction 


This section describes the 82961KA pins, pinouts, 
and thermal characteristics in the 132-pin Plastic 
Quad Flat Pack (PQFP). For complete package 
specifications and information, refer to the Intel 
Packaging Specification (Order No. 231369). 


3.2 Pin Descriptions 


82961KA pins are described in this section. Table 1 
presents the legend for interpreting pin descriptions 
contained in the following tables. Table 2 defines 
pins associated with the L-Bus. Tables 3, 4 and 5 
define pins associated with memory and 1/0 inter- 
face signals. Table 6 defines pins associated with 
the printer video interface. Table 7 defines printer 
communications interface signals. Table 8 defines 
pins associated with basic processor configuration 
and control. 
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Symbol(l) 
Description 
I 
Pin is input only 
0 
Pin is output only 


1/0 
Pincan be either an input or output 


L 
Signal is active LOW 


L(P) 
Signal is active LOW and the signal's 
sense is programmable 


H 
Signal is active HIGH 


TS 
Signal is tristate 


- OD 
Signal is open-drain output 


ST 
Signal is Schmitt trigger input 


NOTE: 
1. Bold 
indicates 
a pin; non-bold 
indicates 
a signal. 
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Table 2. L-Bus Interface 
Signals 


Name 
Type 
Description 


LAD31:0 
110 
LOCAL ADDRESS/DATA 
BUS carries 32-bit physical address and data to and from 
H 
memory, 1/0 devices or internal registers. 
During an address cycle, bits 31:2 contain 


TS 
a physical word address and bits 1:0 contain a burst size. During a data cycle, bits 
31 :0 contain 
read or write data. Burst size is an encoded 
number where 00 
represents 
one word and 11 represents 
four words. 


ALE 
I 
ADDRESS 
LATCH ENABLE indicates 
the transfer 
of a physical address. ALE is 
L 
asserted 
during an address cycle and deasserted 
before a data cycle begins. 


ADS 
I 
ADDRESSIDATA 
STATUS 
indicates 
an address state. It is asserted 
during an 
L 
address cycle and deasserted 
during a following 
data cycle. For a burst transaction, 


ADS is asserted 
again every data cycle where READY was asserted 
in the previous 
cycle. 


W/R 
I 
WRITE/READ 
specifies, 
during an address cycle, whether 
the operation 
is a write or 
a read. Valid during entire memory, register or 1/0 operation. 


DEN 
I 
OAT A ENABLE indicates 
a transfer 
on the LAD lines. Asserted 
during all data cycles. 


L 


READY 
0 
READY indicates 
data on the LAD signals can be sampled 
or removed. 
If READY is 
OD 
not asserted 
during a data transfer, 
the data cycle is extended 
to next cycle by 
inserting 
a wait state and ADS is not asserted 
in the next cycle. 


BE3:0 
I 
BYTE ENABLE: B~nals 
specify data bytes on the bus which take part in the _ 
L 
current data cycle. BEO corresponds 
to LAD7:0. BE3 corresponds 
to LAD31 :24. BE 
signals are pipelined. 
BE signals corresponding 
to the first data transfer 
of a burst are 


asserted 
during address cycle. BE signals corresponding 
to subsequent 
data 
transfers 
of a burst are asserted 
in the data cycle of the current data transfer. 


IRQ 
0 
INTERRUPT 
REQUEST 
indicates 
an interrupt to the 80960Kx 
processor. 
When the 
L 
82961 KA interrupt 
is cleared, the signal goes inactive for two bus cycles before it can 


OD 
be reasserted. 
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Name 
Type 
Description 


MD15:0 
I/O 
MEMORY 
DATA signals contain the two least significant 
data bytes for I/O and 
H 
DRAM memory operations. 
MD7:0 contain the least significant 
byte; MD15:8 
TS 
contain the second 
most significant 
byte. If DRAM is configured 
as 16 bits wide, or 


the I/O channel 
being accessed 
is packed, MD7:0 also contain the third most 
significant 
byte and MD15:8 also contain the most significant 
byte. During a DRAM 


write operation, 
only those bytes with a corresponding 
write enable signal asserted 
are driven by the 82961 KA. 


MAD31:29 
I/O 
MEMORY 
ADDRESS/DATA 
signals are multi-purpose 
depending 
on the device 
H 
that is selected: 
TS 
ROM 
MAD31 is not used. MAD30:39 
signals output an encoded 
address 
range 
select and are driven high or low. 
I/O 
Signals output an encoded 
address 
range selected. 


DRAM 
Signals contain data bits 29-31 
when DRAM is in 32-bit configuration. 


When DRAM is in 16-bit configuration, 
these signals are unused and are 


floated. 


During a DRAM write operation, 
only those bytes with a corresponding 
write enable 


signal asserted 
are driven by the 82961 KA. 


MAD28:16 
I/O 
MEMORY 
ADDRESS/DATA 
signals are multi-purpose 
depending 
on the device 
H 
selected: 


TS 
ROM 
MAD16 is not used. MAD25:17 
output address bits 02-08, 
17, 19. 


MAD28:26 
are not used and are driven high or low. 


I/O 
Signals output address bits 01-08, 
17, 19,21-23. 
DRAM 
Signals contain data bits 16-28 
with DRAM in 32-bit configuration. 
When 


DRAM is in 16-bit configuration, 
signals are unused and are floated. 


During a DRAM write operation, 
only those bytes with a corresponding 
write enable 


signal asserted 
are driven by the 82961 KA. 


MA9:0 
0 
MEMORY 
ADDRESS 
signals are multi-purpose 
depending 
on the device selected: 


H 
ROM 
Signals output address bits 09-16, 
18, 20. 
I/O 
Signals output address bits 09-16, 
18, 20. 
DRAM 
Signals output multiplexed 
address bits 00-09. The number of address 
bits actually multiplexed 
on these signals depends 
on the DRAM bank 
accessed 
and the programmed 
size for that bank. Use MA7:0 for 64 Kbit 


x 16/32 
DRAM; use MA8:0 for 256 Kbit x 16/32 
DRAM; use MA9:0 for 
1 Mbit x 16/32 
DRAM. 


RAS3:0 
0 
ROW ADDRESS 
STROBE 
signals, used for DRAM accesses, 
are asserted 
when 
L 
MA9:0 signals contain a valid row address. 
RASO corresponds 
to the first DRAM 
bank; RAS3 corresponds 
to the fourth DRAM bank. In page mode the RAS signal 
of the accessed 
bank remains asserted 
after completion 
of the memory operation, 


and is deasserted 
for a bank when a page miss occurs. 


CAS3:0 
0 
COLUMN 
ADDRESS 
STROBE 
signals, used for DRAM accesses, 
are asserted 
L 
when MA9:0 signals contain a valid column address. CASO corresponds 
to the first 


DRAM bank; CAS3 corresponds 
to the fourth DRAM bank. Only one of the four 


CAS signals is asserted 
during a memory operation. 


WE3:0 
0 
WRITE 
ENABLE signals, used for DRAM accesses, 
are asserted 
during write 
L 
operations. 


For 32-bit wide DRAM, WEO corresponds 
to the least significant 
byte MD7:0 and 
WE3 corresponds 
to the most significant 
byte MD31 :24. WEO also corresponds 
to 
the 80960Kx 
BEO signals and WE3 corresponds 
to the BE3 signal. Only those WE 
signals with a corresponding 
asserted 
BE signal are asserted. 


For 16-bit wide DRAM, WEO corresponds 
to both the least significant 
byte, and the 
third most Significant byte, which uses MD7:0. WE1 corresponds 
to both the 
second 
most significant 
byte and the most significant 
byte, which uses MD15:8. 


WEO also corresponds 
to BEO and BE2; WE1 corresponds 
to BE1 and BE3. For 
16-bit wide DRAM, DRAM accesses 
are performed 
in two 16-bit accesses; 
WE2 
and WE3 are unused and are deasserted. 


Table 3. Memory 
Interface 
Signals 
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Table 
4. ROM Signals 


Name 
Type 
Description 


ROMCS1:0 
1/0 
ROM CHIP SELECT 
signals indicate an access to one of the eight ROM banks. 
L 
ROMCSO is used for ROM banks 0-3; 
ROMGS1 for ROM banks 4-7. 
While 
1S 
ROMGSO or ROMGS1 is asserted, 
the encoded 
address range select signals 
MAD30:29 
determine 
the particular 
ROM bank to be accessed. 
MAD30:29 
signals 
are guaranteed 
to be valid during the entire time that ROMGS1:0 
is asserted. 


ROMGSO and ROMGS1 are never asserted 
at the same time. These signals are 
valid during the entire memory operation. 
, 


During reset, ROMGSO is used as an input to indicate 
DRAM bus width. If this pin is 


low during reset, DRAM is configured 
as a 16·bit bus. If this pin is high, DRAM is 
configured 
as a 32-bit bus. This pin is either pulled up (for a 32·bit bus) or pulled 
down (for a 16·bit bus) through a 10 KO resistor. 
During reset, ROMGS1 is used as an input to clear ROM bank 0 size field to zero, 
disabling 
it. This allows external 
ROM and control 
circuit at address zero. If pin is 
low during reset, ROM bank 0 is disabled. 
If pin is high, ROM bank 0 is enabled 
as 
normal. This pin is either pulled up (enabled) 
or pulled down (disabled) 
through 
a 
10 KO resistor. 


ROMOE 
0 
ROM OUTPUT 
ENABLE 
corresponds 
to the DEN signal and is asserted 
during all 
L 
data cycles to any of the eight ROM banks. 


Table 
5 I/O Inter1ace 
Signals 


Name 
Type 
Description 


10CS 
0 
I/O CHIP SELECT 
indicates 
an access to one of the eight I/O device address 
ranges in 
L 
an auto-poll 
cycle. It is asserted 
at the beginning 
of an I/O operation 
and remains 
asserted 
for a programmed 
duration of I/O device select. While lOGS is asserted, 
encoded 
address range select signals MAD31 :29 determine 
the particular 
1/0 channel 
to be accessed. 
MAD31 :29 signals are guaranteed 
to be valid during the entire time that 
lOGS is asserted. 


10WR 
0 
I/O WRITE 
indicates 
a write operation 
to one of the eight 1/0 device address ranges. It 
L 
is asserted 
a programmed 
duration 
after lOGS is asserted 
and remains asserted 
for a 
programmed 
duration. 
It is asserted 
for 1/0 write operations 
only. 


lORD 
0 
I/O READ indicates 
a read operation 
to one of the eight 1/0 device address ranges in 
L 
an auto-poll 
cycle. It is asserted 
a programmed 
duration 
after lOGS is asserted 
and 
remains asserted 
for a programmed 
duration. 
It is asserted 
for 1/0 read operations 
only. 


Table 
6. Printer 
Video 
Inter1ace 
Signals 


Name 
Type 
Description 


PRINT 
0 
PRINT REQUEST 
indicates 
printer engine should begin print operation. 
PRINT signal 
LtP) 
is a copy of an internal register bit and follows the programming 
of this bit. For non- 
printer applications, 
signal can be used as a general purpose output. 


FSYNC 
I 
FRAME SYNC indicates 
printer engine began to print and medium is positioned 
at top 
L(P) 
of page. When signal is asserted, 
the 82961 KA controller 
begins sampling 
LSYNG 


ST 
signal. 


LSYNC 
I 
LINE SYNC indicates 
printer engine began to move the medium and imaging circuitry 
L(P) 
is positioned 
at left page position. 
Each time this signal is asserted, 
the 82961 KA 


ST 
controller 
counts down the top margin size or initiates a scan line transfer 
of video data. 
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Name 
Type 
Description 


CBSY 
0 
COMMAND 
BUSY indicates 
82961 KA has command 
to transmit 
to the printer 
L 
engine. CBSY is asserted 
when 82961 KA's printer command 
register is written; 
it 


OD 
remains asserted 
until all command 
data is sent. 


SBSY 
I 
STATUS 
BUSY indicates 
the printer engine has status to transmit 
to the 82961 KA. 


L 
When signal is asserted, 
82961 KA assembles 
a printer engic 
status byte in the 


ST 
82961 KA's printer status register using eight transitions 
of C 
LK. Note that CCLK 
mCy come from the 82961 KA or the printer engine, depending 
on programmed 
C 
LK mode. 


CMD/STS 
I/O 
COMMAND/STATUS 
DATA is programmable 
to be either command 
data output to 
L 
printer engine or bidirectional 
command/status 
data to/from 
the printer. Command 


TS 
data is an 8-bit serial command 
stream to a printer engine. After CBSY is asserted 
by writing the 82961 KA's printer command 
register, each command 
bit is presented 
on this signal, accompanied 
by a transition 
of the CCLK signal. 


STS 
I 
STATUS 
DATA is an 8-bit serial status from the printer engine. After SBSY is 
L 
asserted, 
the printer engine presents 
each status bit on this signal with each CCLK 


ST 
transition 
from the 82961 KA controller. 
SBSY must be deasserted 
and reasserted 
to begin a second 8-bit status message. 


CCLK 
I/O 
COMMAND 
CLOCK is programmable 
to be either an input clock from the printer 
L 
engine or an output clock to the printer engine. 
It causes the printer engine to 


TS 
assemble 
an 8-bit command 
or transmit 
an 8-bit status one bit at a time with each 
transition 
of this signal. Each command 
bit is shifted on a falling edge of CCLK and 
each status bit is sampled 
on a rising edge of CCLK. 


• __ 
._ 
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_. 
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Name 
Type 
Description 


CLK2 
I 
SYSTEM 
CLOCK provides the fundamental 
timing for the 82961 KA. It is twice the 
frequency 
of an 80960Kx 
address or data cycle. 


RESET 
I 
RESET clears 82961 KA internal 
logic and initializes all internal registers. 
ST 


Vcc 
I 
SYSTEM 
POWER connections 
consist of eight pins; it is strongly 
recommended 
that 
these are connected 
externally 
to a vcc board plane. 


Vss 
I 
SYSTEM GROUND 
consists 
of 10 pins; it is strongly 
recommended 
that these are 
connected 
externally 
to a Vss board plane. 


Table 8. Processor 
Control 
Signals 


3-336 


3.3 
82961 KA PQFP Pinout 


82961KA 


Tables 
9 and 10 list 82961 KA pin names 
and package 
location. 
See Section 
4, Electrical 
Specifications 
for 
specifications 
and recommended 
connections. 


Table 9. PQFP Pin Name with Package 
Location 
(Signal Order) 


L-Bus 


Name 
Location 


LAD31 
128 


LAD30 
129 


LAD29 
130 


LAD28 
131 


LAD27 
132 


LAD26 
1 


LAD25 
2 


LAD24 
4 


LAD23 
5 


LAD22 
6 


LAD21 
7 


LAD20 
8 


LAD19 
9 


LAD18 
11 


LAD17 
12 


LAD16 
13 


LAD15 
15 


LAD14 
16 


LAD13 
17 


LAD12 
18 


LAD11 
19 


LAD10 
20 


LAD9 
21 


LAD8 
23 


LAD7 
24 


LAD6 
25 


LAD5 
27 


LAD4 
28 


LAD3 
29 


LAD2 
30 


LAD1 
31 


LADO 
32 


ALE 
119 


ADS 
118 


W/R 
120 


L-Bus (Continued) 


Name 
Location 


DEN 
117 


READY 
126 


BE3 
121 


BE2 
122 


BE1 
123 


BEO 
124 


IRQ 
125 


I/O 


Name 
Location 


IOCS 
96 


IOWR 
97 


lORD 
98 


Printer Video 


Name 
Location 


VIDEO 
109 


PRINT 
110 


VCLK 
111 


LSYNC 
112 


FSYNC 
113 


Printer Comm 


Name 
Location 


STS 
102 


SBSY 
104 


CCLK 
105 


CMD/STS 
107 


CBSY 
108 


Memory 


Name 
Location 


CASO 
46 


MOO 
47 


MD1 
48 


MD2 
50 


Memory 
(Continued) 


Name 
Location 


MD3 
51 


MD4 
52 


MD5 
53 


MD6 
54 


MD7 
56 


MD8 
57 


MD9 
59 


MD10 
60 


MD11 
61 


MD12 
62 


MD13 
63 


MD14 
64 


MD15 
65 


MAO 
66 


MA1 
67 


MA2 
68 


MA3 
69 


MM 
70 


MA5 
71 


MA6 
73 


MA7 
74 


MA8 
75 


MA9 
76 


MAD16 
77 


MAD17 
78 


MAD18 
80 


MAD19 
81 


MAD20 
82 


MAD21 
84 


MAD22 
85 


MAD23 
86 


MAD24 
88 


MAD25 
89 


MAD26 
90 


MAD27 
91 


Memory 
(Continued) 


Name 
Location 


MAD28 
92 


MAD29 
93 


MAD30 
94 


MAD31 
95 


DRAM 


Name 
Location 


WE3 
33 


WE2 
34 


WE1 
35 


WEO 
36 


RAS3 
37 


CAS3 
38 


RAS2 
40 


CAS2 
41 


RAS1 
43 


CAS1 
44 


RASO 
45 


ROM 


Name 
Location 


ROMOE 
99 


ROMCS1 
100 


ROMCSO 
101 


Control 


Name 
Location 


CLK2 
114 


RESET 
116 


Vcc 


14, 26, 49, 58, 72, 
103,115,127 


Vss 


3,10,22,39,42,55, 
79,83,87,106 
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Table 10. PQFP Pin Name with Package 
Location 
(Pin Order) 


Pin 
Signal 


1 
·LAD26 


2 
LAD25 


3 
Vss 


4 
LAD24 


5 
LAD23 


6 
LAD22 


7 
LAD21 


8 
LAD20 


9 
LAD19 


10 
Vss 


11 
LAD18 


12 
LAD17 


13 
LAD16 


14 
Vcc 


15 
LAD15 


16 
LAD14 


17 
LAD13 


18 
LAD12 


19 
LAD11 


20 
LAD 10 


21 
LAD9 


22 
Vss 


23 
LAD8 


24 
LAD7 


25 
LAD6 


26 
Vcc 


27 
LAD5 


28 
LAD4 


29 
LAD3 


30 
LAD2 


31 
LAD1 


32 
LADO 


33 
WE3 
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Pin 
Signal 


34 
WE2 


35 
WE1 


36 
WEO 


37 
RAS3 


38 
CAS3 


39 
Vss 


40 
RAS2 


41 
CAS2 


42 
Vss 


43 
RAS1 


44 
CAS1 


45 
RASO 


46 
CASO 


47 
MOO 


48 
MD1 


49 
Vcc 


50 
MD2 


51 
MD3 


52 
MD4 


53 
MD5 


54 
MD6 


55 
Vss 


56 
MD7 


57 
MD8 


58 
Vcc 


59 
MD9 


60 
MD10 


61 
MD11 


62 
MD12 


63 
MD13 


64 
MD14 


65 
MD15 


66 
MAO 


Pin 
Signal 


67 
MA1 


68 
MA2 


69 
MA3 


70 
MA4 


71 
MA5 


72 
Vcc 


73 
MA6 


74 
MA7 


75 
MA8 


76 
MA9 


77 
MAD16 


78 
MAD17 


79 
Vss 


80 
MAD18 


81 
MAD19 


82 
MAD20 


83 
Vss 


84 
MAD21 


85 
MAD22 


86 
MAD23 


87 
Vss 


88 
MAD24 


89 
MAD25 


90 
MAD26 


91 
MAD27 


92 
MAD28 


93 
MAD29 


94 
MAD30 


95 
MAD31 


96 
IOCS 


97 
IOWR 


98 
lORD 


99 
ROMOE 


Pin 
Signal 


100 
ROMCS1 


101 
ROMCSO 


102 
STS 


103 
Vcc 


104 
SBSY 


105 
CCLK 


106 
Vss 


107 
CMD/STS 


108 
CBSY 


109 
VIDEO 


110 
PRINT 


111 
VCLK 


112 
LSYNC 


113 
FSYNC 


114 
CLK2 


115 
Vcc 


116 
RESET 


117 
DEN 


118 
ADS 


119 
ALE 


120 
W/R 


121 
BE3 


122 
BE2 


123 
BE1 


124 
BEO 


125 
IRQ 


126 
READY 


127 
Vcc 


128 
LAD31 


129 
LAD30 


130 
LAD29 


131 
LAD28 


132 
LAD27 


3.4 Mechanical Data 


Package Dimensions and Mounting 


The 82961KA is available in a 132-lead plastic quad 
flat pack (PQFP). The plastic package uses fine- 
pitch gull wing leads arranged in a single row along 
the 
perimeter of the 
package with 
0.025 inch 


(0.64 mm) spacing. 


The PQFP is normally surface mounted to take best 
advantage of the plastic package's small footprint 
and low cost. In some applications, however, de- 
signers may prefer to use a socket, either to improve 
heat dissipation or reduce repair costs. 


Pin Assignment 


Figures 2 and 3 show the top view of the PQFP; 
notice that the pins are numbered in order from 1 to 
132 around the package's perimeter. Tables 9 and 
10 list the function of each pin in the PQFP. 
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It is strongly recommended that vcc and GND con- 
nections be made to multiple vcc and GND pins. 
Each vcc and GND pin should be connected to the 
appropriate 
voltage 
or 
ground 
and 
externally 
strapped close to the package. We recommend that 
you include separate power and ground planes in 
your circuit board for power distribution. 


3.5 Package Thermal Specification 


The 82961KA is specified for operation when case 
temperature is within the range Q'C to + 85'C 
(PQFP).The case temperature should be measured 
at the top center of the package as shown in Table 
11. 


The ambient temperature can be calculated from 0jc 
and 0ja by using the following equations: 


TJ = Tc + P*Ojc 
TA = TJ - 
P*Oja 
Tc = TA + P*[Oja - 
0jcl 


Table 11. 82961KA PQFP Package Thermal Characteristics 


PQFPThermal Reslstance-'C/Watt 


Alrflow-ft.lmln 
(m/sec) 
Parameter 
0 
50 
100 
200 
400 
600 
800 
(0) (0.25) (0.50) (1.01) (2.03) (3.04) (4.06) 
e Junction-to-Case 
9 
9 
9 
9 
9 
9 
9 
° Case-to-Ambient 
22 
19 
18 
16 
11 
9 
8 
(No Heatsink) 


NOTES: 
1. This table 
applies to 
82961KA 
PQFP soldered directly into board. 
2. 8JA = 8JC + 8CA· 


3. 8JL = 18'C/Watt 
8JB = 18'C/Watt 
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Values 
for 8ja and 8jc are given in Table 
11 for the 
PQFP for various 
airflows. 
Note that the 8ja for the 
PGA package 
can be reduced 
by adding a heatsink, 


while a heatsink 
is not generally 
used with the plas- 
tic package 
since it is intended 
to be surface 
mount- 
ed. The 
maximum 
allowable 
ambient 
temperature 


(TA) permitted 
without 
exceeding 
TC is shown by the 
charts 
in Figure 6. 


The curves 
assume 
the maximum 
permitted 
supply 
current 
(Icel at each 
speed, 
VCC of 5.0V, 
and 
a 
TCASE of + 85°C (PQFP). 


3.6 
Package Dimensions and Mounting 
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Figure 2. 82961KA 
132-Lead 
Plastic Quad Flat Pack (PQFP) Package 


PIN 34 
PIN 66 
272134-35 


PIN 132 
PIN lOO 


PIN 99 


• 
82961KA 
Ixxxxxxxx 


PIN -67 


Figure 3. 82961KA 
PQFP Package 
(Top View) 
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4.0 
ELECTRICAL 
SPECIFICATIONS 


4.1 
Power and Grounding 


Power and ground capacitors must be made to all 
82961KA power and ground pins. On the circuit 
board, all vcc pins must be strapped closely togeth- 
er. Similarly, all Vss pins should be strapped closely 
together. It is strongly recommended that vcc pins 
are connected to a common power plane and Vss 
pins are connected to a common ground plane in 
the PC board. 


4.2 
Power Decoupling 
Recommendations 


Decoupling capacitors should be placed near the 
82961KA. The 82961KA can cause transient power 
surges when multiple, loaded outputs switch simulta- 
neously. Proper power decoupling is required to 
avoid "ground lift" or "ground bounce" induced by 
these power surges. 


4.3 
Connection 
Recommendations 


Low inductance capacitors and interconnects are 
recommended for best high frequency electrical per- 
formance. Inductance can be reduced by shortening 
the PC board traces between the processor and de- 
coupling capacitors as much as possible. 


For reliable operation, always connect unused in- 
puts to an appropriate signal level. No unused Input 
pin should 
be left floating. 
Connect unused pins 
directly to Vss or to VCCthrough a pull-up resistor. 


82961KA 


Recommended value of the pull-up resistor is ap- 
proximately 20 KO.for each pin tied high. 


Output drivers for RAS3:0, CAS3:0, WE3:0 and 
MA9:0 are designed to directly drive the heavy ca- 
pacitive loads of DRAM arrays. To prevent outputs 
from ringing in the system, it is necessary to match 
the output driver's output impedance to that of the 
DRAM array. This is accomplished by placing a re- 
sistor in series with each signal. Place the series 
resistor near the 82961KA. Resistor value is depen- 
dent on DRAM loading and is best determined by 
experimentation at the prototype level. 


All open-drain outputs require a pull-up termination 
connected to the output pin. Signals READY, iRQ 
and CBSY are open-drain outputs on the 82961KA. 


While in most cases a simple pull-up resistor is ade- 
quate, a network consisting of pull-up and pull-down 
resistors may be necessary for the READY pin, 
since timing on this signal is critical. 


Figure 4 shows recommendations for low and high 
current drive network which assumes the circuit 
board has a characteristic impedance of 1000.. The 
resistor network should bias the output to a valid 
HIGH level (VIH ~ 2.0V). To minimize signal reflec- 
tion, termination should be placed close to the end 
of the PC board trace. Pull-up and pull-down resistor 
value should be chosen such that network imped- 
ance closely matches the characteristic impedance 
of the PC board trace. 


Figure 4 also shows a simple pull-up termination 
which can be used to terminate the open-drain out- 
puts. 


OPEN-DRAIN 
OUTPUT 


..., 


NETWORK 
TERIo4INA liON 


2100 dS 


600 


OPEN-DRAIN 
OUTPUT 


2800 


SIIo4PLE 
TERIo4INATION 
272134-2 


Figure 4. Network 
and Simple Termination 
Examples 
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Parameter 
IDescrlptlon 
Absolute 
Ratings 


Mln 
Max 
Units 
Conditions 


Storage Temperature 
-65 
150 
·C 


Case Temperature 
Under Bias 
-65 
110 
·C 


Supply Voltage with Respect to Vss 
-0.5 
6.5 
V 


Voltage on Other Pins with Respect to Vss 
-0.5 
vcc + 0.5 
V 


Table 
13. Targeted 
Operating 
Conditions 


KU82961KA-20, 
-16 (20,16 
MHz Specification) 


Symbol 
Parameter 
IDescrlptlon 
Mln 
Max 
Units 
Conditions 


Vcc 
Supply Voltage 
KU82961 KA-20 
4.75 
5.25 
V 
(Note 1) 
KU82961 KA-16 
4.5 
5.5 
(Note 2) 


fc 
Input Clock (CLK2) Frequency 
KU82961 KA-20 
1 
40 
MHz 
(Note 1) 
KU82961 KA-16 
1 
32 
(Note 2) 


Tc 
Case Temperature 
Under Bias 
KU82961 KA-20 
0 
85 
·C 
(Note 1) 
KU82961 KA-16 
0 
85 
(Note 2) 


NOTES: 
1. 82961KA-20 
is tested 
with voltage 
supplies 
set to ±5%. 


2. 82961KA-16 
is tested 
with voltage 
supplies 
set to 
± 10%. 
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6.0 
TARGETED 
DC CHARACTERISTICS 


Table 14. Targeted 
DC Characteristics 


82961KA-20, 
-16 (20,16 
MHz Specification) 


Symbol 
Parameter 
IDescriptlon 
Mln 
Max 
Units 
Conditions 


Vn, 
Input Low Voltage 
-0.3 
0.8 
V 
(Note 1) 


VIH 
Input High Voltage 
2.0 
vcc + 0.3 
V 
(Note 1) 


VOl 
Output Low Voltage 
0.45 
V 
(Note 2) 


VOH 
Output High Voltage 
2.4 
V 
(Note 3) 


'cc 
Power Supply Current 
82961KA-20 
250 
mA 
(Note 4) 
82961KA-16 
200 
mA 


III 
Input Leakage Current 
±10 
/-LA 
Os 
VIN s vcc 


ILO 
Output Leakage Current 
±10 
/-LA 
0.45 s vo s vcc 


CIN 
Input Capacitance 
8 
pF 
fc = 1 MHz(5) 


CIO 
l/O or Output Capacitance 
10 
pF 
fc = 1 MHz(5) 


NOTES: 
1. RESET, FSYNC, lSYNC. SBSY, and SfS are Schmitt trigger inputs. Hysteresis on these pins is approximately 200 mV, 
but is not tested for each device. 
2. VOLis measured under the following conditions: 


IOL= 6 mA 
LAD31:0, MAD31:16, MD15:0,TOGS,IORD,IOWR,ROMOE, ROMCS1:0, 
RAS3:0, CAS3:0, WE3:0, MA9:0 
IOL= 12 mA 
CCLK, CMD/STS, CBSY, VIDEO, PRINT 
IOL= 24 mA 
READY,i'RC:i 
3. VOHis measured under the following conditions: 


IOH= 6 mA 
LAD31:0, MAD31:16, MD15:0,lOCS,IORD,IOWR. ROMOE. ROMCS1:0, 
RAS3:0,CAS3:O,WE3:0. MA9:0 


IOH= 12 mA 
CCLK, CMD/STS, VIDEO, PRINT 
VOHis not measured for open-drain outputs, i'RC:i, READY. and CBSY 
4. Measured at worst case frequency, vcc. and temperature, with device operating and outputs loaded to the test conditions 
shown in Figure 5. 
5. Capacitance values are not tested. 
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Figure 5. Typical Supply Current (lcC>vs Frequency (fc) 


7.0 
TARGETED 
AC 
CHARACTERISTICS 
--=:::: 


20WH% 
-' 
--- 
~ 


16 11IHz 


~ 


1~@Ie= 
-SaC 


,- 
@Ie= 
+2SOC 
D@Ie- 
+10SOC 


Figure 8 shows output valid delay as a function of 
load capacitance. Output valid delays given in the 
AC Specifications must be adjusted using data in 
Figure 8 when pin loading in the system exceeds 
test load, CL. Derating information is verified at all 
operating conditions using a sampling of compo- 
nents which represent process extremes. Derating 
information is not tested for each device. 


AC Specifications relating to outputs are measured 
at the 1.5V crossing point of the output signal unless 
otherwise indicated. Input signals are driven during 
test with a rise and fall time of s 2 ns. AC Specifica- 
tions relating to inputs are measured from the 1.5V 
crossing point of the input waveform. 


AC Specifications presented in this document are 
tested with loading on output pins as shown in Fig- 
ure 7. Output test load (CO is never less than 50 pF 
unless specifically stated in the AC Specifications. 


4.5V 
5.0V 


SUPPL Y VOLTAGE 
(V) 


5.5V 


272134-4 


3-344 


Figure 6. Typical Supply Current 
(ICC)vs Supply Voltage (Vcc) 
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VREF = vcc 


Dl and D2 are matched 
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TRI-STATE 0------, 
OUTPUT 
...L 


~~ 


OPEN-DRAIN OUTPUT 1~ 
.1 
I"" 


~~ 


Figure 7. AC Test Loads 
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CD 


® 
® 


G) 
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~ 
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GROUP 
<D 
LAD31:0, 
MD15:0, 
MAD31:16, TOWR, lORD, rocg,~, 
ROMCS1:0 


GROUP 
<i> 
CBSY, OM)/g'ffi, CCLK, iJiQEO, J5RfNi 


GROUP 
a> 
MA9:0, RAS3:0, CAS3:0, WE3:0 


GROUP 
@ 
READY, iRO 


50 
150 


Figure 8. Output Valid Delay (tov) vs Load Capacitance 
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82961KA-16 
(16 MHz Specification) 


fc 
CLK2 Frequency 
1 
32 
MHz 
(Note 1) 


te 
CLK2 Period 
31.25 
1000 
ns 
(Note 1) 


teH 
CLK2 High Time 
12.5 
ns 
(Note 1) 


teL 
CLK2 Low Time 
12.5 
ns 
(Note 1) 


teR 
CLK2 Rise Time 
6 
ns 
(Note 1) 


teF 
CLK2 Fall Time 
6 
ns 
(Note 1) 


NOTE: 
1. See Figure 
7 for waveforms 
and specifications. 


- - 
2.0V 
-1.5V 


- 
O.8V 


272134-7 


Figure 9. Clock 
Input 
Waveforms 
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Table 16. Synchronous 
Input and Output Specifications 


Symbol 
Parameter IDescrlptlon 
Mln 
Max 
Unit 
Conditions 


82961KA-20 
(20 MHz Specification) 


toy 
Output Valid Delay (Maximum 
Value) 


toH 
Output Hold Delay (Minimum 
Value) 


toV1. tOH1 
LAD31:0 
5 
37 
ns 
Notes 1. 3. 7 


tov2.toH2 
LAD31:0 
(register read only) 
5 
59 
ns 
Notes 1. 2. 7 


tov3.toH3 
IRQ 
5 
37 
ns 
Notes 1. 2 


toV4.toH4 
READY 
5 
37 
ns 
Notes 1. 2 


toV5. toH5 
MD15:0 
5 
33 
ns 
Notes 1. 3.10 


tove. toH6 
MAD31:16 
(I/O cycles only) 
5 
34 
ns 
Notes 1. 3. 9 


toV7. toH7 
MAD31:16 
(ROM cycles only) 
5 
31 
ns 
Notes 1. 2. 9 


toV8. toH8 
MAD31 :29 (auto-poll 
only) 
5 
37 
ns 
Notes 1. 6. 9 


toV9. toH9 
MA9:0 
5 
32 
ns 
Notes 1. 4. 8 


toV10. toH10 
RAS3:0. CAS3:0 
5 
26 
ns 
Notes 1. 4 


toV11. toH11 
WE3:0 
5 
34 
ns 
Notes 1•.5 


toV12. toH12 
ROMCS1 :0. ROMOE 
5 
24 
ns 
Notes 1. 2 


toV13. toH13 
10CS. lORD. 10WR 
5 
27 
ns 
Notes 1. 3 


toV14. toH14 
iOCS. lORD (auto-poll 
only) 
5 
31 
ns 
Notes 1. 3 


toV15. toH15 
PRINT 
28 
ns 
Notes 1. 2 


toF 
Output Float Delay 


toF1 
MD15:0 
33 
ns 
Notes 1. 11 


toF2 
MAD31:16 
37 
ns 
Notes 1. 11 


tlS 
Input Setup Time 


tlS1 
ADS.W/R.BE3:0.DEN 
42 
ns 
Notes 1. 2 


tlS2 
LAD31:0 
10 
ns 
Notes 1. 2.12 


tlS3 
MD15:0 
8 
ns 
Notes 1. 3.13 


tlS4 
MAD31:16 
8 
ns 
Notes 1. 3. 14 


tlS5 
RESET 
0 
ns 
Notes 1. 3 


tlS6 
ROMCS1:0 
(on RESET) 
0 
ns 
Notes 1. 3 


tlH 
Input Hold Time 


tlH1 
ADS.W/R. 
BE3:0. DEN 
2 
ns 
Notes 1. 2 


tlH2 
LAD31:0 
2 
ns 
Notes 1. 2. 12 


tlH3 
MD15:0 
8 
ns 
Notes 1. 3.13 


tlH4 
MAD31:16 
10 
ns 
Notes 1. 3. 14 


tlH5 
RESET 
5 
ns 
Notes 1. 3 


tlH6 
/ 
ROMCS1:0 
(on RESET) 
8 
ns 
Notes 1. 3 


82961KA-16 
(16 MHz Specification) 


TBD 
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NOTES: 
1. See Figure B for waveforms 
and specifications. 


2. Signal 
is synchronous 
to the CLK2 A edge. 
3. Signal 
is synchronous 
to the CLK2 A or C edge. 


4. Signal 
is synchronous 
to any rising or falling 
edge of CLK2 (CLK2 A, B, C or 0 edge). 
5. Signal 
is synchronous 
to the CLK2 B edge. 


6. Signal 
is synchronous 
to the CLK2 C edge. 
7. LAD31:0 
are synchronous 
outputs 
when 
presenting 
data for an internal 
register 
read, or any I/O read cycle 
in which 
rising 


lORD 
precedes 
rising iOCS. 
LAD15:0 
are synchronous 
outputs 
for the first half of a packed 
1/0 or DRAM 
access. 
For all 
other 
accesses, 
data is driven 
on LAD31:0 
combinatorially, 
and is described 
by the tMDLD parameter. 


B. MA9:0 
signals 
are synchronous 
outputs 
when 
presenting 
the DRAM 
row address 
when 
a page miss or a video 
FIFO read 
occurs, 
or when switching 
from the DRAM 
row address 
to the DRAM column 
address. 
MA 1:0 are synchronous 
during 
DRAM 
burst 
cycles, 
and 
MA7 
is synchronous 
during 
16-bit 
cycles. 
For all other 
accesses, 
MA9:0 
are driven 
combinatorially 
as 
described 
by the tLAMA parameter. 


9. MAD31:16 
signals 
are synchronous 
outputs 
for all 1/0 and 
PROM 
accesses. 
For 
DRAM 
accesses 
these 
signals 
are 
driven 
combinatorially 
as described 
by the tLDMD parameter. 
10. MD15:0 
outputs 
are synchronous 
outputs 
during 
the second 
half of a packed 
110 or DRAM 
write 
access. 
For all other 
accesses, 
data is driven combinatorially 
as described 
by the tLDMD parameter. 


11. For DRAM 
write 
accesses, 
each 
byte in the MAD31 :16 and MD15:0 
bus is enabled 
only when 
the corresponding 
write 
enable 
(WE3:0) 
signal 
is active. 
Bytes 
in this bus which 
are not selected 
are floated 
to avoid contention 
on the DRAM 
data 
bus. At the end of a DRAM 
access 
the data bus (MAD31:16, 
MD15:0) 
stays valid until the CLK2 A edge of the Tr state, 
and 
float when 
WE3:0 
are deasserted 
at the CLK2 B edge of the Tr state. 


For IIQ and ROM accesses, 
MAD31:16 
signals 
are enabled 
in the first Tw state 
of an access 
synchronous 
to the CLK2 
A 
edge. After 
ROM accesses, 
MAD31:16 
are floated 
in the T, state, synchronous 
to the CLK2 A edge. After 1/0 accesses, 
the 
MAD31:29 
signals 
are floated 
later on the CLK2 
C edge, 
preventing 
glitches 
in external 
decoding 
logic. 
Note 
for 1/0, the 
MAD2B:16 
signals 
are also floated 
on the CLK2 C edge, but only stay valid until the CLK2 A edge of the Tr state. 


12. LAD31:0 
are synchronous 
inputs 
only for internal 
register 
writes. 
13. MD15:0 
are synchronous 
inputs 
on the first half of a packed 
1/0 or DRAM 
read access, 
any 1/0 access 
in which 
rising 
lORD precedes 
rising iOCS, auto-poll 
accesses, 
and when 
the video 
FIFO is being filled. 


14. MAD31:16 
are synchronous 
inputs only when the video 
FIFO is being filled. 
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Figure 10. Synchronous 
Input/Output 
Waveforms 
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tLDMD 
Local Data to Memory 
Data 
5 
22 
ns 
Note 1 


tMDLD 
Memory Data to Local Data 
5 
22 
ns 
Note 1 


tRHRL 
RESET High to RESET Low 
41 
2x Clocks(2) 
Note 1 


82961KA-16 
(16 MHz Specification) 


TBD 


NOTES: 
1. See Figure 
10 for waveforms 
and specifications. 


2. A 2x clock 
cycle 
is equal to a CLK2 period, le. 
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Figure 12. L-Bus Relative 
Timings 
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7.2 
L-Bus Operational 
Waveforms 
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Figure 13. L·Bus Operation 
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7.3 
Targeted 
DRAM Controller 
Timing Specifications 


Table 
18. DRAM Controller 
Relative Timings 


82961KA-20, 
·16 (20,16 
MHz Specification) 


Symbol 
Parameter IDescription 
Mln 
I 
Max 
Units 
Conditions 


tDQDQ 
Output to Output Delay 
n4x • (te/2) ± 4 
ns 
(Notes 1, 2, 3) 


tWLMQ 
WE3:0 Low to Memory: data bus enabled 
+4 
ns 
(Notes 
1, 4) 


tWHMF 
WE3:0 High to Memory: data bus float 
+4 
ns 
(Notes 
1, 4) 


NOTES: 
1. See Figure 15 for waveforms and specifications. 
2. The le parameter refers to the CLK2 period. The value for le is given in Table 15. n4x is the number of 4x clock cycles 
between DRAM outputs. The number of 4x clock cycles is selected by programming the DRAM controller. 
3. When the value for n4x is an odd number, an error term is required if the CLK2 duty cycle is asymmetric. For example, if 
CLK2 has a 40%-60% 
duty cycle, the falling edge varies ± 10% from a perfect 50% duty cycle. The error term in this case, 
which is added to the DRAM timing, is calculated as follows: 


Error = ± (0.10) (le) 
4. For non-aligned DRAM writes, each byte in the transfer is enabled and disabled onto the MAD31:16 and MD15:0 data bus 
by the corresponding WE. This guarantees no contention between the 82961KA and DRAM for bytes not selected for a 
particular write cycle. 


~A9:0, 
RAS3:0,[=t; 
ANY OUTPUT 
CAS3:0, 
WE3:0 
I 
_ 
woo_'c 


~A9:0, 
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[ 
AN~ IOUTPUT 


• 
CAS3:0, 
WE3:0 
~I~! 
_ 


WE3:0[J----d 


~ 


tWLM~ 
~ 
t 
wHMF 


, 


~AD31:16,~D15:0. 
[ 
_ 
(WRITES) 
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I 
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NOTE: 
tDODOdescribes delay between signals which are output synchronously to the rising or falling edge of CLK2x. 


Figure 15. DRAM Relative 
Timings 
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tcASrd 
CAS3:0 Pulse Width for Reads 
1 
15 
4xClocks 


tcASwr 
CAS3:0 Pulse Width for Writes 
1 
15 
4xClocks 


tcPrd 
CAS3:0 Precharge 
for Reads 
0 
15 
4xClocks 


tcPwr 
CAS3:0 Precharge 
for Writes 
2 
15 
4xClocks 


tRP 
RAS3:0 Precharge 
2 
15 
4xClocks 


tRPC 
RAS3:0 Hold to CAS3:0 Precharge 
(Refresh) 
0 
7 
2xClocks 


tcSR 
CAS3:0 Setup Time (Refresh) 
0 
7 
2xClocks 


tcHR 
CAS3:0 Hold Time (Refresh) 
0 
7 
2xClocks 


NOTE: 
A 4x clock is equal to 0.5 (tel, or half the CLK2 period. A 2x clock is equal to te, or the CLK2 period. When the value for n4x 
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Figure 20. Static Column Mode Read Cycle 


3-357 


El 


t.lA9:0l 
I 
ROW 
COLUt.lN0 
COLUt.lN 1 


I-tASR - c----... 
I 
I 


RAS3:0 [ 


'RAH 
I 
I 
I 
I 


tASC 
I 
_tASC" 


tcAS~ 
tcp 


CAS3:0 [ 


I 
(2) 


I 
. 
I 


I 
- 


WE3:0 [ 


I 
\ 
READ= 1. WRITE= 0 
/ 
\ 
I 
I 


t.lAD31:16 [ 
CI3 
C 
t.lD15:0 
(READS) 
I 


t.4AD31:16 [ 


I 
( 
) 
( 
t.lD15:0 
DATA 
DATA 
(WRITES) 
, 
272134-19 


NOTES: 
1. Address 
0 and Address 
1 are in the same DRAM 
page. 
2. Timing 
shown 
is greater 
of tAse and !cp. 


Figure 21. DRAM Page Hit 


3-358 


82961KA 


960 
[ 
re 
Tw 
T 
BUS STATE 
I 
I 
I 
W ••• 


I 
I 
I 


LA031:0 [--t-BCADDR 
0 
(READS) 
I 
'-- 
__ 
-" 
__ 
.1 
"'__ 
.n.._~I- 
_ 


I 
I 
I 
r: 
- 
I 
I 
S 


[ 


I 
(1)....-I 
__ :I~:_____ 
I 
(1) 
I 


t;~I~~~~ +B 
DATA 
~: 


I 
I ~~J-------------------------- 
: 
11 
~r---r----~:~--~:~Iill--~--------------------- 


I 
I 


Ta 
Tw 
Tw ••• 
Td 
Tr 


DATA 


READY [ 


WA9:0 
[ 
ROW 
COLUWN 0 
I 
ROW 
COLUWN 
1 


I 


-~ASR 
- 


I 
I r--- 'tip 
(2) - 
'tiAH 
I 
f-- 
I 
I 
I 
'tiAH 
I 
I 
t---- 
I 
I' 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
tASC 
teAS .L, 


I 
tASC 
I 
I 
I 
t--- 
I 
I 
I 
I 


I 
I 
I 
I 
I 
I 
I 
~ 
I 
I 
I 


RAS3:0 
[ 


CAS3:0 
[ 


WE3:0 
[ 
, 


I 


WAD31:16[ 
c:I3!) 
C 
WO15:0 
- 
••• ---- 
•••••------------- 
VALID 
•••••••• ---- 
••• ----------------------------- 
(READS) 
, 


I 


READ = 
1. WRITE = 0 
READ = 
1. WRITE = 0 


WA~~:~~~ 
[ 
__ -!-_....I_....( 
DATA 
)~-+__ +.... 
( 
(WRITES) 
\..-------_--1. 
10... 
_ 
DATA 


272134-20 


NOTES: 
1. Address 
0 and Address 
1 are not in the same 
DRAM 
page. 
2. Assumes 
nRP ;" nASR. 


Figure 
22. DRAM Page Miss 


3-359 


• 


82961KA 


3·360 


960 
[ 
BUS STATE 


LAD31:0 
[ 


READY [ 


CLK2 
A or C edge 
Ta 
Tw 
Tw ••• 
Td 
Tr 


, 
, 
, 
I 
, 
, 
, 
+Se 
IlstH~ 
, 
' 
, 


,I. 
I 
I 
I 


! 
' 
, 
; 
~ 


i 
i 
, 
, 
, 
I 
' 
, 
, 
, 
, 
i 
, 
, 
, 
I 
, 
' 
, 
i 
, 
, 
, 


ROW 
COLUIilN 
0 
I 
COLUIilN 
1 
I 


- 
tASR -1+--+ 
, 
I-- 'Rp- 
, 


'RAH 
i 
i, 
(2) 
, 


tASC 
tcPrd 
i 
tcASrd 
tcASrd 
.••.• 
H 


i 
\ 
i 
i 


( 1) 
,, 
i 
I 
listHALF! 
12nd 
HALF 
, 
I 
! 


IilA9:0 
[ 


RAS3:0 
[ 


CAS3:0 
[ 


IilD15:0 
[ 


WE3:0 = HIGH 


NOTES; 
1. ~~~g CAS3:0synchronousto CLK2A· or C·edgeon 1st half of packed read. 
2. 
3:0 remainsactive if page cache is enabled. 


Figure 23. DRAM 16-Blt Page Mode Read Cycle 


272134-21 


82961KA 


960 [ 
; 
To 
; 
Tw 
; 
Tw... 
CUCZ 
Td 
Tr: 


BUSSTA.TE[~s 
AOf," 
S: 
~, 


LAD,':0 
~ 
DATA. 
~ 


IDD'[~:~'~:~---~~~ 


: 
1 
: 


RASJ:O 
[ 


W"':O [~--'~---'l"""---""'T'..A----,",-"-- 


CASJ:Q 
[ 


wm[ 
WAD31'''[_i-+-C~~~~~~:J 
W015;0 
lit 
HAlf 
VALID 


(3) 
(3) 


Wt3:2 
• 
HIGH 
272134-22 
NOTES: 
1. RiS~g 
CAS3:0 
synchronous 
to CLK2 A- or C-edge 
on 1st half of packed 
write. 


2. RA 
3:0 remains 
active 
if page cache 
is enabled. 


3. WE1 :0 synchronous 
to CLK2 
B-edge. 


Figure 24. DRAM Aligned 16-Blt Page Mode Write Cycle 


"0 
[ 
" 
T. 
" 
T. 
" 
T.... 
• 
Td 
• 
T 
• 
Td 
T 
BUS 
STATE 
• 
I 
': 
w... 
CLI<2 
, 
r 


• 
" 
" 
S 
Aor~~. 
i 
: 


lAOJ1:0[ 
~iWOftDO/8YTE3 
C 
DATA 
WORD1/~EO.,.2 
>-t--- 


, 
:: 
:: 
: 
I 
:: 
, 
"3,o[~~mo.,.: 
,C 
: 
c::: 


"'0'[: 
: 
' 
I~ 
--ur+- 


: 
I 
: 
I 
: 


11I"':0 [ 


RASJ:O 
[ 


cm:o[ 
...[ 
(1) 


m[ 


WOls:e[ 


W07:0 
[ 


i[J';2 •• HIGH 


(3) 


I..:,;.:.... __ 
+ 
,;.--J (1) 


DATA: 
WORD O/BYTE 
3 


WORD 
1/aYT[ 
2 


272134-23 


. ' .. 
'. 
IYT( 
0,1:2.3 
0,112,3 


I,' 
I 
I 
I 
ACCESS SHOWN 
I, 
..••.• 
" 


NON-PACKEt) 
PACKED 


wRlft 
WRITE 


NOTES: 
1. WE1 :0 synchronous 
to CLK2 
B-edge. 


2. WE1 :0 synchronous 
to CLK2 
B- or D-edge 
following 
rising CAS3:0. 
3. RAS3:0 
remains 
active 
if page cache 
is enabled. 


4. CAS3:0 
rising edge synchronous 
to CLK2 A- or C-edge 
for first half of packed 
write. 


Figure 25. DRAM Non-Aligned 16-Blt Write Cycle 
I 


3-361 


11 


v 
v. 
I 
;j~ 
tx ctocks 


tWRWT 
10WR Wait High Time 
0 
31 
0 
31 
1xGlocks 


nWRACC 
IOWR Access 
Low Time 
0 
31 
1 
32 
1xGlocks 


nlOREC 
1/0 Recovery 
Time 
0 
31 
0 
31 
1xGlocks 


nlOGS 
lOGS Low Period 
1 
31 
2 
32 
1xGlocks 


NOTE: 
A lx clock 
is equal to 2'(lcl, 
or twice 
the CLK2 period. 


3-362 


intel~ 
82961KA 


LAD31:0 
[ 


[~'-~I-r---.,.-,....----,.--\~~~ 


[+--+{'-"""T"""""""--Sr-r----T'"---r...ll..--~n____r____r__I 
[ 
[ 


Io4AD30:29 


Io4A9:0 
Io4AD25:19 


Io4ADI8:17 


Ta 
Tw 
Tw 
Tw 
Td 
Tw 
Tw 
Tw 
Td 
Tr 


I 


READY 
[ 


1---'--+-'--\\-+- 
"ROIojRDY 
-L_~--.J'---+-_-'-'II-I- 


272134-24 


Figure 26. ROM Burst Read 


3·363 


82961KA 
intel~ 


LAD31:0 [ 


MAD31:29 [ 


MAD28:16 [ 
MA9:0 


lORD [ 


IOWR [ 


IOCS [ 


READY [ 


MD15:0 
[ 


Ta 
Tw 
Tw 
Tw 
Td 
Tr 
Ti 


_-'---_-+ __ 
L----t+_--+_ 
"IOREC 


READ or WRITE DATA 


3-364 


Figure 27. 1/0 Aligned 
Read or Write 


272134-25 


I 


82961KA 


Ta 
Tw 
Tw 
Tw 
Tw 
Tw 
Tw 
Tw 
Tw 
Tw 
Td 
Tr 
T; 


AAAAAAAAAAAAAAAAAAAAAAAAifV' 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


[ 
r< A~DRESS ~ 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


~~"\I 
I 
2nd 


I 


LAD31:0 
1st 
16-81TS 
is-errs 
I 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


[ 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


I 
I 
I 
I 
I 
I 
I 
I 
I 


101015:0 
1st 16-8IT5 
2nd 
16-81T5 
I 
I 
I 
I 
I 
I 
I 
I 
I 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


[~~ 


I 
I 
I 
I 
I 
I 
I 
I 
I 
0 
I 
I 


IotAD31:29 
I/o RANGE SELECT 
I/o RANGE SELECT 
~~ 


I 
I 
I lJ:IIJJ::trr:rr.t 


I 
I 
I 
I 


[~~ 


I 
I 
I 
I 
I 
I 
I 
I 
I 
0 
0 
0 


IotAD28:17 
I/o ADDRESS 
23:2 
I/o ADDRESS 
23:2 
~ 
IotA9:0 


I 
I 
I 
0 
I 
I rrrrr 


I 
I 
I 
I 


[ 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


I 
I 
\ 
I 
I 


IotAD16 
VALID 
FOR 1st 
HALf 
Of 
ACCESS 
VALID 
fOR 
2nd 
HALf 
Of 
ACCESS 
I 
I 
I 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


I 
4---t- 
"ROWT 
"ROACC +---00 
I 
I 
--.- 
"ROWT 
"ROACC +---00 
I 
I 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
[ 
1\ 


I 
I 
I 
1\ 


I 
I 
I 
lORD 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


I 
I 
I 
I 
I 
I 
I 
~ 
I 
I 


I 
I 
I 
"loos 
I 
I 
I 
"IOREC 
I 
I 
I 
"Iocs 
I 
I 
I "IOREC-+- 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
[ 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


10CS 
:\ 
I 
I 
I 
I 


I 
I 
\ 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


[ 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


READY 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I lW-I 


I 
I 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


0 
0 


272134-26 • 


Figure 28. 1/0 Packed 
Read 


I 


3-365 


, 
, 
! 
I 
I 
! 


~ 


J 
1\ 
I/o ADDRESS 
23:2 
I/o ADDRESS 
23:2 
~,""",'" 


! 
! 
! 
! 
! 
I mu 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
! 
! 
I 
I 
I 
I 


~"'I 
VALID 
FOR ht 
HALF 
OF ACCESS I 
VALID 
FOR 2nd 
HALF OF ACCESS 
~ 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


--.- 
"wRWT 
nWRACC+-+ 
I 
I 
I--+- "wRWT 
nWRACC+--+ 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
\ 


I 
I 
I1 
\ 


I 
I 
1 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
"toes 
I 
I 
I 
"tOREC 
I 
I 
I 
"IOCS 
I 
I 
"tOREC+- 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
:\ 
I 
I 
I 
I1 


I 
\ 


I 
I 
I 
1 


I 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


I 
I 
I 
I 
I 
I 
I 
I 
I I\..L.I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 


28:17 L 
MA9:0 


MAD16 
[ 


10WR 
[ 


IOCS 
[ 


READY 
[ 


272134-27 


Figure 29. 1/0 Packed Write 


3·366 
I 


82961KA 


Tw 
Tw 
Tw 
Td 
Tw 
Tw 
Tw 
Tw 
Tw 


r\JLr\JLr\JLr\JLAAAAAAAAN 
1 
1 
1 
1 
1 
1 
1 
1 
1 


[ 


1 
1 
1 
1 
1 
1 
1 
1 
1 


IIAD31:29 
1 
1 
I/o RANGE SELECT 
1 
1 
1 
1 
1 


1 
1 
1 
1 
1 
1 
1 
1 
1 


[ 


1 
1 
1 
1 
1 
1 
1 
1 
1 


IIAD28:19 
1 
1 
I/o AODRESS 23:4 
1 
1 
1 
1 
1 


IIA9:0 


1 
1 
1 
1 
1 
1 
1 
1 
1 


[ 


1 
1 
1 
1 
1 
1 
1 
1 
1 


IIAD18:16 
BURST ADDRESS 3: 1 I 
NEXT BURST ADDRESS 
I 


1 
1 
1 
I 
1 
1 
" 
1 
1 
nX~T 


1 
1 


1 
1 
nxACC 
1 
1 
1 


[ 


1 
1 
1 
1 
1 
1 
1 
1 
1 


10RD/ 
1 
1 
1 
1 
1 
\ 


1 
1 
I 
1 
1 
1 
1 
L4 
10WR 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
"Ioes 


1 
1 
1 
"IOREC 
1 
1 
1 
1 
1 
1 
1 


[!' 


1 
1 
1 
1 
1 
1 
1 
1 


IOCS 


1 
1 
1 
I 
1\ 


I 
1 


1 
1 
1 
1 
1 
I 
1 


1 
1 
1 


1 
1 
1 
1 
1 
1 
1 
1 


[ 


1 
1 
1 
1 
1 
1 
1 
1 
1 
1 


READY 
1 
1 
1 
\ 


1 
I 
1 
1 
1 
1 
1 


1 
1 
1 
1 
1 
1 
1 
1 
1 


1 
1 
1 
1 
1 
1 
1 
1 


272134-28 


Figure 30. I/O Address 
Transition 
in Burst Mode 
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Figure 31.1/0 Auto-Poll 
Cycle 
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7.6 
Targeted Printer Video and Communications 
Timing Specifications 


Table 21. Printer 
Video 
Interface 
Timings 


82961KA-20, -16, -10 (20,16,10 
MHz Specification) 


Symbol 
Parameter 
IDescrlptlon 
Mln 
Max 
Units 
Conditions 


tVC1 
VCLK Frequency, 
1x Clock Mode 
25 
MHz 
(Note 1) 


tvca 
VCLK Frequency, 
8x Clock Mode 
66 
MHz 
(Note 1) 


tVCH 
VCLK High Time 
ns 
(Note 1) 


tVCL 
VCLK Low Time 
ns 
(Note 1) 


tVOV1 
VIDEO Output Valid, 1x Clock Mode 
27 
ns 
(Note 1) 


tVOV8 
VIDEO Output Valid, 8x Clock Mode 
30 
ns 
(Note 1) 


tVIS 
LSYNC Input Setup, 1x Clock Mode 
0 
ns 
(Note 1) 


tVIH 
LSYNC Input Hold, 1x Clock Mode 
9 
ns 
(Note 1) 


tVW1 
FSYNC Input Pulse Width, 1x Clock Mode 
2 
tVC1 
(Note 1) 


tVW8 
LSYNC, FSYNC Input Pulse Width, 8x Clock Mode 
16 
tVC8 
(Note 1) 


NOTE: 
1. See Figure 30 for waveforms 
and specifications. 
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Table 
22. Printer 
Communications 
Interface 
Timings 


Symbol 
Parameter 
IDescriptlon 
Mln 
Max 
Units 
Conditions 


82961 KA Supplies 
CCLK 


tpv 
CMD Output Valid Delay 
4 
2xClocks 
(Note 1) 


tps 
STS Input Setup 
0 
ns 
(Note 1) 


tpH 
STS Input Hold 
4 
2x Clocks 
(Note 1) 


tpsc 
SBSY Valid to CCLK Driven 
0 
6 
2xClocks 
(Note 1) 


Programmable 
Parameters 


npCK 
CCLK High and Low Time 
1 
4096 
162x 
Clocks 
(Notes 1. 2) 


CCLK Supplied 
Externally 


tpv 
CMD Output Valid Delay 
8 
2xClocks 
(Note 1) 


tps 
STS Input Setup 
0 
2xClocks 
(Note 1) 


tpH 
STS Input Hold 
8 
2xClocks 
(Note 1) 


tpc 
CCLK Period 
16 
2xClocks 
(Note 1) 


tPCH 
CCLK High Time 
8 
2x Clocks 
(Note 1) 


tpCL 
CCLK Low Time 
8 
2xClocks 
(Note 1) 


tpcs 
SBSY Setup to CCO< Valid 
0 
zx Clocks 
(Note 1) 


Programmable 
Parameters 


npF 
CMD and CBSY Float Time 
1 
4096 
162xClocks 
(Notes 1. 2) 


NOTES: 
1. See Figures 
31 and 32 for waveforms 
and specifications. 


2. npCK and npF are programmed 
through 
the CCiJ( divisor 
register; 
parameter 
is programmed 
in (n"16) 
2x clocks. 
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8x Clock Mode 
I--- \YCS --l 
[ 
YCLK 
L""1 
[ 
X 
C 
VIDEO 


fSYNC 
[ 
L::J 
LSYNC 
272134-31 


NOTE: 
1. tVDV1 is referenced 
to the edge of VCLK that is programmed 
for video 
data output. 


Figure 32. Printer Video Interface 
Timing 
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Figure 33. Printer Communications 
Interface Timing (82961KA Supplies CCLK) 
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Figure 34. Printer Communications 
Interface Timing (CCLK Supplied Externally) 
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82961KD 
PRINTER COPROCESSOR 
• High Performance Printer 
• Programmable DRAM Interface 
Coprocessor· 
- 
Direct Interface to Four Banks 
- 
Direct Interface to Intel's i960@KA 
- 
Expandable to Eight Banks 
or KB·· 32-Blt Embedded 
- 
Transparent Refresh of DRAM Banks 
Processors 
- 
Supports 256 Kbit, 1 Mblt and 4 Mbit 
• Direct Generic Printer Engine Interface 
DRAMs 


- 
TEC, Canon, Rlcoh, Okidata and Ink 
-Individually 
Programmable Timing 


Jet Printer Engines 
Parameters for All Banks of DRAM 
• Burst Interface Support for 1960KA/KB 
• Programmable 110 Control 


Bus 
- 
Chip Select, Access Time, Recovery 
Time 
• Big Endian or Little Endlan Byte 
- 
Wait State Control for Eight External 
Ordering of Font Cartridges 
Devices 
• DRAM Burst Accesses up to 256 Words 
• Automatic Data Conversion from 16-Blt 
for Graphics Operations 
Font Cartridge to 32-Blt 1960Embedded 
• Bit Map Image Compression 
Processor Format 
• Compressed Display List Processing 
• Low-Cost 164-Lead Plastic Quad Flat 
• Programmable EPROMInterface ' 
Pack (PQFP) 


-Interfaces 
to Eight Banks of 
• Provides System Timer Functions 
Interleaved EPROM 
-Individually 
Programmable Timing 
Parameters 


Intel's 
82961 KO Printer Coprocessor 
provides 
the Intel i960 KA or i960 KB microprocessors 
with a powerful 
graphics 
accelerator 
and compression 
processor 
that dramatically 
increase 
system 
performance 
and reduce 
printer coprocessor 
system cost. This single chip device provides 
all necessary 
system control 
for the i960 KA 
or i960 KB microprocessors 
and a direct interface 
to most laser printer engines. 
The 82961 KO coprocessor 
contains 
complete 
ORAM, 
1/0 and interleaved 
ROM controllers, 
font cartridge 
support 
and the associated 
logic 
required 
to control 
most 
non-impact 
printer 
mechanisms, 
a programmable 
wait 
state 
generator 
and 
programmable 
chip select generation 
logic. 


The 82961 KO Printer Coprocessor 
performs 
all graphics 
functions 
necessary 
for complex 
page description 
language 
(POL) or printer control 
language 
(PCL) controllers. 
Image compression 
is achieved 
using "Scanline 
Tables". 
Memory 
requirements 
for storage of bit mapped 
images-such 
as character 
font cache and graphics 
objects-is 
significantly 
reduced 
using these structures. 


The 82961 KO coprocessor 
processes 
a compressed 
display list to form the bit mapped 
image of the page to 
be printed. 
The 82961 KO coprocessor 
automatically 
supports 
"Band 
buffered" 
print operations. 
The chip's 
compressed 
display list, coupled with its fast graphics 
operations, 
allow band buffered 
printing of very complex 
page description 
language 
(POL) pages such as those 
PostScript 
generates. 


·The 
82961 KO i960 Printer 
Coprocessor 
is based 
on the single·chip 
controller 
architecture 
created 
by Peerless 
Systems. 
Corp. 
• "Tbrouqhout 
this data sheet, 
80960Kx 
refers 
to the 80960KA 
and KB processors. 


PostScript 
is a registered 
trademark 
of Adobe 
Systems, 
Inc. 
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1.0 
PURPOSE 


This data sheet describes the 82961KO Printer Co- 
processor. This coprocessor focuses primarily on 
the non-impact printer marketplace, although many 
of its features make it extremely applicable for other 
applications such as FAX and newer multifunction 
peripherals. The data sheet provides the information 
required to begin designing with the 82961KO Print- 
er Coprocessor; such as functional and physical de- 
scriptions, electrical characteristics and specifica- 
tions, absolute maximum ratings and package and 
pin definitions. 


2.0 
82961KD FUNCTIONAL 
OVERVIEW 


The 82961KO coprocessor integrates a powerful 
graphics accelerator and compression processor on 
a single chip. It provldes all necessary system con- 
trol for the i960 KA or i960 KB microprocessors to 
directly interface to most laser printer engines. The 
82961KO coprocessor's functional blocks include: 
• i960 K-Series .L-BusInterface 


• ROM Interface 
• DRAM Interface 


• 
1/0 Interface 
• Print Engine Video Controller (PVC) 
• Printer Communications Interface 
• Banding Coprocessor 
• Graphics Execution Unit (GEU) 


The 82961KO coprocessor contains two indepen- 
dent subsystems which simultaneously render and 
print page images: the Graphics Execution Unit 
(GEU) and the Print Engine Video Controller (PVC). 
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The GEU executes a display list of graphics orders 
and renders a page image; the PVC transfers the 
rendered page image data to the laser printer en- 
gine. The 82961KO coprocessor processes a com- 
pressed display list which forms the bit mapped im- 
age of the page to be printed. The 82961KOcoproc- 
essor's compressed display list, coupled with its fast 
graphics operations, allows band buffered printing of 
very complex PDL pages such as those PostScript 
generates. 


3.0 
PACKAGE INFORMATION 


3.1 
PQFP Package Introduction 


This section describes the pins, pinouts and thermal 
characteristics for the 82961KO coprocessor in the 
164-pin Plastic Quad Flat Pack (PQFP). The plastic 
package uses fine-pitch gull wing leads arranged in 
a single row along the perimeter of the package with 
0.025 inch (0.64 mm) spacing. (See Figure 2. Dimen- 
sions are given in Section 3.4, Mechanical 
Data.) 


The PQFP is normally surface mounted to take best 
advantage of the plastic package's small footprint 
and low cost. In some applications, however, de- 
signers may prefer to use a socket, either to improve 
heat dissipation or reduce repair costs. vcc and 
GND connections must be made to multiple vcc 
and GND pins. Each vcc and GND pin must be con- 
nected to the appropriate voltage or ground and ex- 
ternally strapped close to the package. We recom- 
mend that you include separate power and ground 
planes in your circuit board for power distribution. 


For complete package specifications and informa- 
tion, see the Intel Packaging Specification (Order 
#240800). 


"@) 
~ 
© 
©e 


~ 
l§J 
~ 
IiiiiJ-eI 


Co) 
00-..j-..j 


."Cc; 
:-" 
!! 
on~ 
Cii 
IQ; 
3 


_ 
CONTROL 


ADDRESS 


CLK2 
MICRO 


READ 
DATA 
ADDRESS/ 


ORA 10:00 


RESET 
PROCESSOR 
WRITE 
DATA 
DATA 
t.4AD31:16 


INTERFACE 
PATHING 
t.4D15:00 
~ 


~ 
IRQ 
ROt.4CS1:0 


R0t.4SEL1:0 
~ 


ROt.4CHAN7:0 
- 
ADDRESS 
I/OCHAN7:0 
ROM 
ROt.40EE 


J----.J 
COMPARATOR 
RAt.4CHAN7:0 
INTERFACE 
ROt.40EO 


BANDING 
~ 


ROt.4AOE - 


COPROCESSOR 


ROt.4AOO 


I-- 


I- ~ 
IOCS 


I/O 
lORD 


INTERFACE 
IOWR 


IP RQST 


IP GRNT 


BAND 
RQST 
ARBITER 
•....- 


RAS3:0 


BAND 
GRNT 
DRAM 
CAS3:0 
~ 


INTERFACE 
WE 
-- 
- 
PRINT 


fSYNC 
- 
PRINTER 


PRINT 
RQST 
VIDEO 
LSYNC 


I 
PRINT 
GRNT 


INTERFACE 
VIDEO 


- 


VCLK 
~ 


CBSY 


SBSY 
..• 


PRINTER 
COMMUNICA TlON 
Ct.4D/STS 


INTERFACE 
STS 


CCLK 
- 


ADDR/DATA 


11 


272221-1 


--er 


@ 


C» 
N 
IQ 
Q)..•.,..o 


82961KD 
intel~ 


3.2 
Pin Descriptions 
Table 
1. Pin Description 
Nomenclature 


The 82961 KO coprocessor 
pins are described 
in this 
section. 
Table 
1 presents 
the legend for interpreting 
pin descriptions 
contained 
in the 
following 
tables. 
Table 2 defines 
pins associated 
with the L-Bus. Ta- 
bles 
3 and 4 define 
pins 
associated 
with 
memory 
and I/O interface 
signals. Table 5 defines 
pins asso- 


ciated 
with the printer 
video 
interface. 
Table 
6 de- 
fines 
printer 
communications 
interface 
signals. 
Ta- 
ble 7 defines 
pins associated 
with basic processor 
configuration 
and control. 


Symbol(1) 
Description 
I 
Pin is Input Only 


0 
Pin is Output Only 


I/O 
Pin can be Either an 
Input or Output 


L 
Signal is Active LOW 


L(P) 
Signal is Active LOW and the 
Signal's 
Sense is Programmable 


H 
Signal is Active HIGH 


TS 
Signal is Tri-State 


OD 
Signal is Open-Drain 
Output 


ST 
Signal is Schmitt Trigger 
Input 


NOTE: 
1. Bold indicates 
a pin; non-bold 
indicates 
a signal: 


Name 
Type 
Description 


LAD31:0 
I/O 
LOCAL ADDRESS/DATA 
BUS carries 32-bit physical address and data to and from 


H 
memory or I/O devices or internal registers. 
During an address cycle, bits 31:2 


TS 
contain a physical word address and bits 1:0 contain a burst size. During a data cycle, 
bits 31:0 contain read or write data. The burst size is an encoded 
number where 00 
represents 
one word and 11 represents 
four words. 


ALE 
I 
ADDRESS 
LATCH ENABLE indicates 
the transfer 
of a physical address. ALE is 
L 
asserted 
during an address cycle and deasserted 
before a data cycle begins. 


ADS 
I 
ADDRESS/DATA 
STATUS 
indicates 
an address state. It is asserted 
during an 
L 
address cycle and deasserted 
during a following 
data cycle. For a burst transaction, 


ADS is asserted 
again in every data cycle where READY was asserted 
in the 
previous cycle. 


W/R 
I 
WRITE/READ 
specifies, 
during an address cycle, whether 
the operation 
is a write or 
a read. Valid during the entire memory register or I/O transfer. 


DEN 
I 
OAT A ENABLE indicates 
transfer 
on the LAD lines. Asserted 
during all data cycles. 


L 


READY 
0 
READY indicates 
data on the LAD signals can be sampled 
or removed. 
If READY is 
OD 
not asserted 
during a data transfer, 
the data cycle is extended 
to the next cycle by 
inserting 
a wait state and ADS is not asserted 
in the next cycle. 


BE3:0 
I 
BYTE ENABLE signals specify data bytes on the bus associated 
with the current 
L 
data cycle. BEO corresponds 
to LAD7:0. BE3 corresponds 
to LAD31 :24. BE signals 
are pipelined. 
BE signals corre~onding 
to the first data transfer 
of a burst are 
asserted 
in the address cycle. BE signals corresponding 
to subsequent 
data 
transfers 
of a burst are asserted 
in the data cycle of the current data transfer. 


IRQ 
0 
INTERRUPT 
REQUEST 
indicates 
an interrrupt 
to the i960 Kx processor. 
When 
L 
82961 KO coprocessor 
interrupt 
is cleared, the signal goes inactive for two bus cycles 


OD 
before it can be reasserted. 


Table 2. L-Bus Interface 
Signals 


3-378 


intel~ 
82961KD 


Table 
3. Memory 
Interface 
Signals 


Name 
Type 
Description 


MD15:0 
I/O 
MEMORY 
DATA signals contain the least significant 
two data bytes for I/O and 
H 
DRAM operations. 
MD7:0 contain the least significant 
byte; MD15:8 contain the 


TS 
second 
most significant 
byte. If the 1/0 channel 
being accessed 
is packed, 
MD7:0 


also contain the third most significant 
byte and MD15:8 also contain the most 


significant 
byte. 


MAD31:29 
I/O 
MEMORY/ADDRESS 
OAT A signals are multi-purpose 
depending 
on the device 


H 
that is selected: 


TS 
I/O 
The signals output an encoded 
address range selected. 


DRAM 
The signals contain data bits 29-31 
for 32-bit DRAM mode. 


MAD28:16 
I/O 
MEMORY/ADDRESS 
DATA signals are multi-purpose 
depending 
on the device 


H 
that is selected: 


TS 
I/O 
The signals MAD27:16 
output address bits 11-23. 


DRAM 
The signals contain data bits 16-28 
for 32-bit DRAM mode. 


IOA10:1 
0 
I/O ADDRESS 
signals are used by the I/O device: 


H 
1/0 
The signals output address 
bits 01-10. 


DRA10:1 
0 
DRAM ADDRESS 
signals output 11 bits of a multiplexed 
address for DRAM 
H 
accesses. 
DRA8:0 are used for 256 Kbit x 16/32 
DRAM; DRA9:0 are used for 


1 Mbit x 16/32 
DRAM; ORA 10:0 are used for 4 Mbit x 16/32 
DRAM. 


RAS3:0 
0 
ROW ADDRESS 
STROBE 
signals are used for DRAM accesses 
and are asserted 
L 
when the ORA 10:0 signals contain a valid row address. 


Non-Encoded: 
RASO corresponds 
to the first bank of DRAM; RAS3 corresponds 


to the fourth bank of DRAM. Only one of the four RAS signals is 
asserted 
during a memory operation. 


Encoded: 
RASO enables the external 
RAS decoder 
for banks 0-3. 
RAS1 


enables the external 
RAS decoded 
for banks 4-7; 
RAS2 and 


RAS3 select 1 of 4 banks. When RASO and RAS1 are both true, 
a refresh cycle is indicated 
and the external 
RAS decoded 


asserts RAS for banks 0- 7. 


CAS3:0 
0 
COLUMN 
ADDRESS 
STROBE 
signals, used for DRAM accesses, 
are asserted 
L 
when the ORA 10:0 signals contain 
a valid column address. CASO corresponds 
to 


byte 0 (bits 7:0) of the DRAM banks; CAS3 corresponds 
to byte 3 (bits 31 :24) of the 
. 
DRAM banks. One to four CAS signals are asserted 
during a memory operation . 


WE 
0 
WRITE ENABLE 
signal, used for DRAM accesses, 
is asserted 
at the beginning 
of a 
L 
write cycle in advance 
of the RAS and CAS outputs and is deasserted 
at the end of 


a write cycle. 


ROMAOE 
0 
ROM ADDRESS 
BIT 0 EVEN, ROM ADDRESS 
BIT 0 ODD are the least significant 
ROMAOO 
H 
ROM address bits. When ROM is interleaved, 
ROMAOE is used by the even half of 


the bank; ROMAOO is used by the odd half of the bank. When ROM is not 
interleaved, 
ROMAOE is address bit 2; ROMAOO is address bit 3. 
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Table 3. Memory 
Interface 
Signals (Continued) 


Name 
Type 
Description 


ROMCS1:0 
1/0 
ROM CHIP SELECT signals indicate 
an access to one of the eight ROM banks. 
L 
ROMCSO is used for ROM banks 0-3; 
ROMCS1 for ROM banks 4-7. 
While 


TS 
ROMCSO or ROMCS1 is asserted, 
the encoded 
ROM bank select signals 
ROMSEL 1:0 determine 
the particular 
ROM bank to be accessed. 
The 
ROMSEL 1:0 signals are guaranteed 
to be valid during the entire time that 
ROMCS1:0 
is asserted. 
ROMCSO and ROMCS1 are never asserted 
at the same 
time. These signals are valid during the entire memory operation. 
If the system 
contains 
only two ROM banks, ROMSEL 1:0 are not needed and ROMCS1:0 
could be used directly, thus saving external decoder 
logic. 


During reset, ROMCSO is used as an input to indicate the ROM bank 0 interleave 
configuration. 
If this pin is low during reset, ROM bank 0 configuration 
is 
interleaved. 
If this pin is high, ROM bank 0 configuration 
is non-interleaved. 


During reset, ROMCS1 
is used as an input to clear the ROM bank 0 size field to 
zero, disabling 
it. This allows for external 
ROM and control 
circuit at address zero. 


If this pin is low during reset, ROM bank 0 is disabled. 
If this pin is high, ROM 
bank 0 is enabled as normal. 


ROMSEL1:0 
0 
ROM BANK SELECT signals are used to select one of four banks of ROM. They 
H 
are used with the ROMCS 1:0 signals to select an access to one of the eight ROM 
banks. The ROMSEL 1:0 signals are guaranteed 
to be valid during the entire time 
that ROMCS1:0 
is asserted. 
The signals are valid during the entire memory 
operation. 


ROMOEE 
0 
ROM OUTPUT 
ENABLE EVEN, ROM OUTPUT 
ENABLE ODD corresponds 
to. 


ROMOEO 
L 
the i960 Kx DEN signal and is asserted 
during all data cycles to any of the eight 
ROM banks. When ROM is interleaved, 
ROMOEE 
is used to enable the even half 
of the bank and ROMOEO 
is used to enable the odd half of the bank. When ROM 
is not interleaved, 
ROMOEE 
and ROMOEO 
are identical 
and one or both could 
be used. 


Table 4. 1/0 Interface 
Signals 


Name 
Type 
Description 


10CS 
0 
I/O CHIP SELECT indicates 
an access to one of the eight 1/0 device address 
ranges. It 
L 
is asserted 
at the beginning 
of the transfer 
operation 
and remains asserted 
for the 
programmed 
duration of the 1/0 device selected. 
While roes is asserted, 
the encoded 
address range select signals MAD31 :29 determine 
the particular 
1/0 channel 
to be 
accessed. 
The MAD31 :29 signals are guaranteed 
to be valid during the entire time that 
IOCS is asserted. 


10WR 
0 
1/0 WRITE indicates 
a write operation 
to one of the eight 1/0 device address 
ranges. It 
L 
is asserted 
a programmed 
duration after iOCS is asserted 
and remains asserted 
for a 
programmed 
duration. 
It is asserted 
for 1/0 write operations 
only. 


lORD 
0 
I/O READ indicates 
a read operation 
to one of the eight 1/0 device address ranges. It is 
L 
asserted 
a programmed 
duration after TOCS is asserted 
and remains asserted 
for a 
programmed 
duration. 
It is asserted 
for 1/0 read operations 
only. 
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Table 5. Printer Video Interface 
Signals 


Name 
Type 
Description 


PRINT 
0 
PRINT REQUEST 
signals the printer engine to begin its print operation. 
The PRINT 
L(P) 
signal is a copy of an internal register bit and follows the programming 
sense of this bit. 


For non-printer 
applications, 
this signal can be used as a general purpose 
output. 


FSYNC 
I 
FRAME SYNC indicates 
the printer engine has begun to print and that the medium is 
L(P) 
positioned 
at the top of the page. When this signal is asserted, 
the 82961 KO 


ST 
coprocessor 
begins sampling the LSYNC signal. 


LSYNC 
I 
LINE SYNC indicates 
that the printer engine has begun to move the medium and the 
L(P) 
imaging circuitry is positioned 
at the left page position. 
Each time this signal is 


ST 
asserted, 
the 82961 KO coprocessor 
either counts down the top margin size or initiates 
a scanline transfer 
of video data. 


VIDEO 
0 
VIDEO is the digital serial video data stream. It is driven after each assertion 
of the 
(P) 
LSYNC signal after any top margin size is counted. 
By default, a high on this signal 
indicates 
"white 
space" 
and a low on this signal indicates 
"black 
space". 


VCLK 
I 
VIDEO CLOCK is the video shift rate clock. If the printer engine supplies a video shift 
rate clock, it is presented 
on this pin. If the printer engine does not supply a video shift 
rate clock, an 8X video shift rate clock must be presented 
on this pin and the 82961 KO 
coprocessor 
must be programmed 
for phase locked loop operation. 
In ghase locked 
loop operation, 
the internal video shift rate clock is locked to the LSYN 
signal active 
edge. By default, video shift rate clock VCLK falling edge shifts a bit of video data. 
El 


Table 6. Printer Communications 
Interface 
Signals 


Name 
Type 
Description 


CBSY 
0 
COMMAND 
BUSY indicates 
that the 82961 KO coprocessor 
has a command 
to 
L 
transmit to a printer engine. CBSY is asserted 
when the 82961 KO coprocessor's 


OD 
printer command 
register is written; it remains asserted 
until all command 
data is 
sent. 


SBSY 
I 
STATUS 
BUSY indicates 
that a printer engine has status to transmit 
to the 
L 
82961 KO coprocessor. 
When this signal is asserted, 
the 82961 KO coprocessor 


ST 
assembles 
the status byte in the 82961 KO eo processor's 
printer status register 
using eight transitions 
of CCLK. Note that the CCLK may come from the 82961 KD 
coprocessor 
or the printer engine, depending 
on the CCLK mode that is 
programmed. 


CMD/STS 
I/O 
COMMAND/STATUS 
DATA is programmable 
to be either command 
data output to 
L 
the printer engine or bidirectional 
command/status 
data to/from 
the printer. The 


TS 
command 
data is an 8-bit serial command 
stream to a printer engine. After ~ 
has been asserted 
by writing the 82961 KO coprocessor'ctinter 
command 
register, 


each command 
bit-accompanied 
by a transition 
of the 
LK signal-is 
presented 
on this signal. 


STS 
I 
STATUS 
DATA is an 8-bit serial status from a printer engine. After S8S( has been 
L 
asserted, 
the engine presents 
each status bit on this signal with each CCLK 


ST 
transition 
from the 82961 KO co processor. SfS must be deasserted 
and reasserted 
to begin a second 8-bit status message. 


CCLK 
I/O 
COMMAND 
CLOCK is programmable 
to be either an input clock from the printer 
L 
engine or an output clock to the printer engine. It causes a printer engine to 


TS 
assemble 
an 8-bit command 
or transmit 
an 8-bit status one bit at a time with each 
transition 
of this signal. Each bit is shifted on a falling edge of CCLK. 


NOTE: 
The Printer 
Communication 
Interface 
signals 
are not used on a TEC or other 
parallel 
command/status 
engines. 
Refer to the 
User's 
Manual 
for a description 
of interfacing 
to the TEC engine 
family. 
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Table 
7. Processor 
Control 
Signals 


Name 
Type 
Description 


CLK2 
I 
SYSTEM 
CLOCK 
provides 
the fundamental 
timing for the 82961 KO coprocessor. 
It is 
twice the frequency 
of an i960 Kx address cycle. 


RESET 
I 
RESET clears 82961 KO co processor 
logic and causes it to initialize all internal 
registers. 


Vcc 
I 
SYSTEM 
POWER connections 
consist of 18 pins which must be connected 
externally 
to a vcc board plane. 


Vss 
I 
SYSTEM GROUND 
consists 
of 19 pins which must be connected 
externally 
to a Vss 
board plane. 


Table 
8. 82961KD 
1/0 Interface 


Packed 
Unpacked 
I/O 
Signal 
I/O 
Address 
Address 


AO 
IOA1 
- 


A1 
IOA2 
AO 


A2 
IOA3 
A1 


A3 
IOA4 
A2 


A4 
IOAS 
A3 


AS 
IOA6 
A4 


A6 
lOA? 
AS 


A? 
IOA8 
A6 


A8 
IOA9 
A? 


A9 
IOA10 
A8 


A10 
MA016 
A9 


A11 
MA01? 
A10 


A12 
MA018 
A11 


A13 
MA019 
A12 


A14 
MA020 
A13 


A1S 
MA021 
A14 


A16 
MA022 
A1S 


A17 
MA023 
A16 


A18 
MA024 
A1? 


A19 
MA02S 
A18 


A20 
MA026 
A19 


A21 . 
MA02? 
A20 
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3.3 
82961KD PQFP Pinout 


Tables 
9 and 10 list the 82961 KO coprocessor 
pin 
names with package 
location. 
See Section 
5, Elec- 
trical 
Specifications 
for specifications 
and recom- 


mended 
connections. 
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Table 9. PQFP Pin Name with Package 
Location 
(Signal Order) 


L-Bus 


Name 
Location 


LAD31 
1 


LAD30 
2 


LAD29 
3 


LAD28 
4 


LAD27 
6 


LAD26 
7 


LAD25 
8 


LAD24 
9 


LAD23 
11 


LAD22 
12 


LAD21 
13 


LAD20 
14 


LAD19 
16 


LAD18 
17 


LAD17 
19 


LAD16 
20 


LAD15 
22 


LAD14 
23 


LAD13 
25 


LAD12 
26 


LAD11 
28 


LAD10 
29 


LAD9 
30 


LAD8 
31 


LAD7 
33 


LAD6 
34 


LAD5 
35 


LAD4 
36 


LAD3 
38 


LAD2 
39 


LAD1 
40 


LADO 
41 


ALE 
49 


ADS 
47 


W/R 
51 


DEN 
48 


L-Bus (Continued) 


Name 
Location 


READY 
56 


BE3 
42 


BE2 
43 


BE1 
44 


BEO 
45 


iRQ 
52 


I/O 


Name 
Location 
roes 
73 


IOWR 
74 


lORD 
75 


IOA10 
86 


IOA9 
85 


IOA8 
84 


IOA7 
83 


IOA6 
82 


IOA5 
81 


IOA4 
80 


IOA3 
79 


IOA2 
78 


IOA1 
77 


ROM 


Name 
Location 


ROMCS1 
57 


ROMCSO 
58 


ROMSELO 
59 


ROMSEL1 
60 


ROMAOE 
64 


ROMAOO 
65 


ROMOEE 
62 


ROMOEO 
63 


Memory 


Name 
Location 


MD15 
110 


MD14 
111 


MD13 
112 


MD12 
113 


MD11 
115 


MD10 
116 


MD9 
117 


MD8 
118 


MD7 
120 


MD6 
121 


MD5 
122 


MD4 
123 


MD3 
124 


MD2 
125 


MD1 
126 


MDO 
127 


MAD31 
88 


MAD30 
89 


MAD29 
90 


MAD28 
91 


MAD27 
93 


MAD26 
94 


MAD25 
95 


MAD24 
96 


MAD23 
98 


MAD22 
99 


MAD21 
101 


MAD20 
102 


MAD19 
104 


MAD18 
105 


MAD17 
107 


MAD16 
108 


DRAM 


Name 
Location 


DRA10 
144 


DRA9 
142 


DRA8 
141 


DRA7 
139 


DRA6 
138 


DRAS 
136 


DRA4 
135 


DRA3 
133 


DRA2 
132 


DRA1 
130 


DRAO 
129 


RAS3 
148 


RAS2 
151 


RAS1 
154 


RASO 
157 


CAS3 
149 


CAS2 
152 


CAS1 
155 


CASO 
158 


WE 
146 


Printer Video 


Name 
Location 


PRINT 
161 


FSYNC 
164 


LSYNC 
163 


VIDEO 
160 


VCLK 
162 
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Table 
10. PQFP Pin Name with 
Package 
Location 
(Pin Order) 


Pin 
Signal 
1 
LAD31 


2 
LA030 


3 
LAD29 


4 
LAD28 


5 
VSS 


6 
LAD27 


7 
LAD26 


8 
LAD25 


9 
LAD24 


10 
Vcc 


11 
LAD23 


12 
LAD22 


13 
LAD21 


14 
LAD20 


15 
Vss 


16 
LAD19 


17 
LAD18 


18 
Vcc 


19 
LAD17 


20 
LAD16 


21 
Vcc 


22 
LAD15 


23 
LAD14 


24 
Vss 


25 
LAD13 


26 
LAD12 


27 
Vss 


28 
LAD11 


29 
LAD10 


30 
LAD9 


31 
LAD8 


32 
Vcc 


33 
LAD7 


34 
LAD6 


35 
LAD5 


36 
LAD4 


37 
Vcc 


38 
LAD3 


39 
LAD2 


40 
LAD1 


41 
LADO 


Pin 
Signal 


42 
BE3 


43 
BE2 


44 
BE1 


45 
BEO 


46 
RESET 


47 
ADS 


48 
DEN 


49 
ALE 


50 
Vss 


51 
W/R 


52 
IRQ 


53 
Vss 


54 
·CLK2 


55 
Vss 


56 
READY 


57 
ROMCS1 


58 
ROMCSO 


59 
ROMSELO 


60 
ROMSEL1 


61 
Vcc 


62 
ROMOEE 


63 
ROMOEO 


64 
ROMAOE 


65 
ROMAOO 


66 
Vcc 


67 
CBSY 


68 
CCLK 


69 
CMD/STS 


70 
STS 


71 
SBSY 


72 
Vss 


73 
IOCS 


74 
IOWR 


75 
lORD 


76 
Vcc 


77 
IOA1 


78 
IOA2 


79 
IOA3 


80 
IOA4 


81 
IOA5 


82 
IOA6 


Pin 
Signal 


83 
IOA7 


84 
IOA8 


85 
IOA9 


86 
IOA10 


87 
Vss 


88 
MAD31 


89 
MAD30 


90 
MAD29 


91 
MAD28 


92 
Vss 


93 
MAD27 


94 
MAD26 


95 
MAD25 


96 
MAD24 


97 
Vcc 


98 
MAD23 


99 
MAD22 


100 
Vcc 


101 
MAD21 


102 
MAD20 


103 
. vcc 


104 
MAD19 


105 
MAD18 


106 
Vss 


107 
MAD17 


108 
MAD16 


109 
Vcc 


110 
MD15 


111 
MD14 


112 
MD13 


113 
MD12 


114 
Vss 


115 
MD11 


116 
MD10 


117 
MD9 


118 
MD8 


119 
Vcc 


120 
MD7 


121 
MD6 


122 
MD5 


123 
MD4 


Pin 
Signal 


124 
MD3 


125 
MD2 


126 
MD1 


127 
MDO 


128 
Vss 


129 
DRAO 


130 
DRA1 


131 
Vcc 


132 
DRA2 


133 
DRA3 


134 
Vss 


135 
DRA4 


136 
DRA5 


137 
Vcc 


138 
DRA6 


139 
DRA7 


140 
Vcc 


141 
DRA8 


142 
DRA9 


143 
Vss 


144 
DRA10 


145 
Vcc 


146 
WE 


147 
Vss 


148 
RAS3 


149 
CAS3 


150 
Vss 


151 
RAS2 


152 
CAS2 


153 
Vcc 


154 
RAS1 


155 
CAS1 


156 
Vss 


157 
RASO 


158 
CASO 


159 
Vcc 


160 
VIDEO 


161 
PRINT 


162 
VCLK 


163 
LSYNC 


164 
FSYNC 


• 
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3.4 
Mechanical 
Data 


Package Dimensions and Mounting 


Table 11. Package 
Dimensions: 
82961KD 
PQFP 


Inches 
mm 
Symbol 
Description 
Mln 
Max 
Mln 
Max 


N 
Lead Count 
164 Leads 
164 Leads 


A 
Package 
Height 
0.160 
0.180 
4.06 
4.57 


A1 
Standoff 
0.020 
0.040 
0.51 
1.02 


D,E 
Terminal 
Dimension 
1.270 
1.290 
32.26 
32.77 


D1, E1 
Package 
Body 
1.147 
1.153 
29.13 
29.29 


D2,E2 
Bumper Distance 
1.297 
1.303 
32.94 
33.09 


D3,E3 
Lead Dimension 
1.000 REF 
25.40 REF 


D4,E4 
Foot Radius Location 
1.223 
1.237 
31.06 
31.41 


L1 
Foot Length 
0.020 
0.030 
0.51 
0.76 


E2 
El 
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mm 
(inch) 


Figure 2. Principle 
Dimensions 
of the 164·Lead 
PQFP 


I 
BASE PLANE 
--r--.;-1 Jf- A 1 
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~------------------D2------------------- 


DETAIL 
M 


f 


w-+------,--.-----l 
t 
2.03 
(0.080) 
1.93 (0.076) 


Figure 3. Details of the 164-Lead 
PQFP Moldlng 
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0.31 (0.012) 
I I 
0.20 (O.OOS)....., r- 


• 
!! 
• 


0.20 (O.OOS) 
0.1 -4 (0.005) 


mm 
(inch) 
DETAIL J 
DETAIL L 
272221-4 


Figure 4. Terminal 
Details for the 164-Lead 
PQFP 


3.5 
Package Thermal Specifications 


Table 12. 82961KD 
PQFP Package 
Thermal 
Characteristics 


272221-5 


PQFP Thermal 
Reslstance-°C/W 


Alrflow-ft.lmln. 
(m/sec) 


Parameter 
0 
50 
100 
200 
400 
600 
800 
(0) 
(0.25) 
(0.50) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 


8 Junction-to-Case 
7 
7 
7 
7 
7 
7 
7 


8 Case-to-Ambient 
20 
17 
16 
14 
9 
8 
7 
(No Heatsink) 


NOTES: 
1. This table applies 
to 82961KA 
PQFP soldered 
directly 
into board. 
2. ()JA = ()JC + ()CA· 
3. ()JL = 18'C/W 
()JS = .18'C/W 
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4.0 
ELECTRICAL 
SPECIFICATIONS 


4.1 
Power and Grounding 


Power and ground connections must be made to all 
82961KO coprocessor power and ground pins. On 
the circuit board, all vcc pins must be strapped 
closely together. Similarly, all Vss pins should be 
strapped closely together. It is strongly recommend- 
ed that vcc pins be connected to a common power 
plane and Vss pins be connected to a common 
ground plane in the PC board. 


4.2 
Power Decoupllng 
Recommendations 


Oecoupling capacitors should be placed near the 
82961KO coprocessor. The coprocessor can cause 
transient power surges when multiple, loaded out- 
puts switch simultaneously. Proper power decou- 
pling is required to avoid "ground lift" or "ground 
bounce" induced by these power surges. 


Low inductance capacitors and interconnects are 
recommended for best high frequency electrical per- 
formance. Inductance can be reduced by shortening 
the PC board traces between the processor and de- 
coupling capacitors as much as possible. 


82961KD 


4.3 
Signal Termination 
Recommendations 


Output 
drivers 
for 
RAS3:0, 
CAS3:0, 
WE 
and 
DRA10:0 are designed to directly drive the heavy 
capacitive loads of DRAM arrays. To prevent out- 
puts from ringing in the system, it is necessary to 
match the output driver's output impedance to that 
of the DRAM array. This is accomplished by placing 
"a resistor in series with each signal. Place the series 
resistor near the 82961KO coprocessor. Resistor 
value is dependent on DRAM loading and is best 
determined by experimentation at prototype level. 


All open-drain.outputs require a pull-Up termination 
connected to the output pin. Signals READY, iFi<:i 
and CBSY are open-drain outputs on the 82961KD 
coprocessor. 


While in most cases a simple pull-up resistor is ade- 
quate, a network consisting of pull-up and pull-down 
resistors may be necessary for the READY pin, 
since timing on this signal is critical. 


Figure 5 shows recommendations for low and high 
current drive network which assumes the circuit 
board has a characteristic impedance of 100n. The 
resistor network should bias the output to a valid 
HIGH level (VIH 2 2.0V). To minimize signal reflec- 
tion, termination should be placed close to the end 
of the PC board trace. Pull-up and pull-down resistor 
value should be chosen such that network imped- 
ance closely matches the characteristic impedance 
of the PC board trace. 


Figure 5 also shows a simple pull-up termination 
which can be used to terminate open-drain outputs. 


• 
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Network 
Termination 


1300 


Simple Terminationd3 


600 


OPEN-DRAIN 
OUTPUT 
OPEN-DRAIN 
OUTPUT 


2800 
272221-7 


272221-6 
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Figure 5. Recommended 
Termination 
of Open Drain Outputs 
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5.0 
ABSOLUTE MAXIMUM RATINGS 


Table 
13. Absolute 
Ratings 
82961KD-20, 
-16 
(20 MHz, 16 MHz Specification) 


Storage Temperature 
- 65°C to + 150°C 


Case Temperature 
Under Bias 
- 65°C to + 110°C 


Supply Voltage 
with Respect 
to VSS 
-0.5V 
to + 6.5V 


Voitage 
on Other 
Pins 
with Respect 
to VSS 
-0.5V 
to VCC + 0.5V 


Table 
14. Operating 
Conditions 
82961KD-20, 
-16 (20 MHz, 16 MHz Specification) 


Supply Voltage 
(Vcd 
82961 KD-20 
4.75V to 5.25V 
82961 KD-16 
.4.5V to 5.5V 


Input Clock (CLK2) 
Frequency 
(fd 
82961 KD-20 
1 MHz to 32 MHz 
82961 KD-16 
1 MHz to 20 MHz 


Case Temperature 
Under Bias (Td 
82961 KD-20 
O°C to 85°C 
82961 KD-16 
O°C to 85°C 


NOTICE: 
This 
document 
contains 
information 
on 
products in the design phase of development. 
Do not 
finalize a design with this information. 
Revised infor- 


mation will be published when the product is avail- 
able. Verify with your local Intel Sales office that you 
have the latest data sheet before finalizing a design. 


• WARNING: Stressing the device beyond the "Absolute 
Maximum 
Ratings" 
may 
cause 
permanent 
damage. 
These are stress ratings 
only. Operation 
beyond 
the 


"Operating 
Conditions" 
is not recommended 
and ex- 


tended exposure 
beyond 
the 
"Operating 
Conditions" 


may affect device reliability. 


6.0 
TARGETED DC SPECIFICATIONS 


Table 
15. DC Characteristics 


82961KD-20, 
-16 (20 MHz, 16 MHz Specification) 


Symbol 
Parameter 
Min 
Max 
Units 
Test Conditions 


Vll 
Input Low Voltage 
-0.3 
0.8 
V 
(Note 1) 


VIH 
Input High Voltage 
2.0 
VCC + 0.3 
V 
(Note 1) 


VOl 
Output Low Voltage 
0.45 
V 
(Note 2) 


VOH 
Output High Voltage 
2.4 
V 
(Note 3) 


tee 
Power Supply Current 
82961KD-20 
TBD 
mA 
82961 KD-16 
TBD 
mA 


III 
Input Leakage Current 
±10 
p.A 
o 5: VIN 5: vcc 


IOl 
Output Leakage Current 
±10 
p.A 
0.45V 5: Vo 5: VCC 


CIN 
Input Capacitance 
8 
pF 
tc = 1 MHz(5) 


CIO 
1/0 or Output Capacitance 
10 
pF 
fc = 1 MHz(5) 


NOTES: 
1. RESET, FSYNC,[SY"NC, SBSY and STS are Schmitt trigger inputs. Hysteresis on these pins is approximately 200 mV, but 
is not tested for each device. 


2. VOl is measured under the following conditions: 
IOl = 6mA 
LAD31:0, MAD31:16, MD15:0, IOA10:1,lOCS, iORD, IOWA, ROMAOE,ROMADO, ROMOEE, ROMOEO, 
ROMSEl1:0, ROMCS1:0, RAS3:0, CAS3:0, WE, DRA10:0, iRQ 
IOl = 12 mA 
CCO<,CMD/STS, CBSi', ViDEO, PRINT 
IOl = 20 mA 
READY 
3. VOHis measured under the following conditions: 


IOH= 6 mA 
LAD31:0, MAD31:16, MD15:0, IOA10:1, iRQ, 
iOCS, lORD, IOWR, ROMAOE, ROMAOO, ROMOEE, 


ROMOEO, ROMSEL1:0, ROMCS1:0, RAS3:0, CAS3:0, WE, DRA10:0 
IOH= 12 mA 
CCO<,CMD/STS, VIDEO, PRINT 
VOHis not measured for open-drain outputs, IRQ, READY and CBSi' 
4. Measured at worst case frequency, vcc and temperature, with device operating and outputs loaded to the test conditions 


shown in Figure 8. 


5. Capacitance values are not tested. 
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Figure 6. Typical Supply Current (Ice) vs Frequency (fe) 
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Figure 7. Typical Supply Current (Ice) vs Supply Voltage (Vee) 
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7.0 
TARGETED AC CHARACTERISTICS 


7.1 
AC Specification 
Tables 


27222'-'0 


TRI-STATE 0-----, 
OUTPUT 
..L 


~Ct 


OPEN-DRAIN 
OUTPUT 
~ 
l 
I~ 


~Ct 


Figure 
8. AC Test 
Loads 
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Signal Group 
e 


@ 
o 
o 


50 
100 


Capacitive Load (pr) 


150 


Group 
1 
LAD31:0, 
MD15:0, 
MAD31:16, 
IOWR, 
lORD, ~, 
RC:5MOE, ROMCS1:0 
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9. Output 
Valid Delay (tov) 
vs Load Capacitance 


Table 
16. Input 
Clock 
Specifications 


82961KD-20 
(20 MHz Specification) 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


fc 
CLK2 Frequency 
0 
40 
MHz 
(1) 


te 
CLK2 Period 
25 
00 
ns 
(1) 


teH 
CLK2 High Time 
10 
ns 
(1) 


teL 
CLK2 Low Time 
10 
ns 
(1) 


teR 
CLK2 Rise Time 
0 
6 
ns 
(1) 


teF 
CLK2 Fall Time 
0 
6 
ns 
(1) 


82961KD·16 
(16 MHz Specification) 


fc 
CLK2 Frequency 
0 
32 
MHz 
(1) 


te 
CLK2 Period 
31.25 
00 
ns 
(1) 


teH 
CLK2 High Time 
12.5 
ns 
(1) 


teL 
CLK2 Low Time 
12.5 
ns 
(1) 


teR 
CLK2 Rise Time 
0 
6 
ns 
(1) 


teF 
CLK2 Fall Time 
0 
6 
ns 
(1) 


NOTE: 
1. See Figure 
10 for waveforms 
and specifications. 
I~[Rl©[Q)(!JJ©tr~[Rl~~O~ 
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Figure 10. Clock Input Waveform 


Table 17. Synchronous 
Input and Output Specifications 


82961KD-20 
(20 MHz Specification) 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


toy 
Output Valid Delay (Max. Value) 


toH 
Output Hold Delay (Min. Value) 


tovr. 
toHl 
LAD31:0 
5 
37 
ns 
(1.3.5) 


tovz. toH2 
LAD31:0 
(Reg. Read only) 
5 
59 
ns 
(1.5) 


toV3. toH3 
IRQ 
5 
37 
ns 
(1.2) 


toV4.toH4 
READY 
5 
37 
ns 
(1.2) 


toV5. toH5 
MD15:0 
5 
33 
ns 
<1.3.8) 


tove. tOH6 
MAD31:16 
5 
34 
ns 
(1.3.7) 


toV7. toH7 
MAD31 :29 (Auto-Poll 
only) 
5 
37 
ns 
(1.4.7) 


toV8. toH8 
DRA10:0 
5 
32 
ns 
(1.3.6) 


toV9. toH9 
RAS3:0. CAS3:0 
5 
26 
ns 
(1.3) 


tovlO. toHl0 
WE 
5 
27 
ns 
(1.2) 


toVll. 
toHll 
ROMSC1 :0. ROMSEL 1:0 
5 
24 
ns 
(1.2.13) 


toVlO. to-no 
ROMAOE. ROMAOO 
5 
24 
ns 
(1.2.14) 


toVlO. toHl0 
ROMOEE. 
ROMOEO 
5 
24 
ns 
(1.2) 


toVlO. toHl0 
IOA10:1 
5 
24 
ns 
(1.2) 


toV12. toH12 
IOCS. lORD. IOWR 
5 
27 
ns 
(1.3) 


toV13. toH13 
IOCS. lORD (Auto-Poll 
only) 
5 
31 
ns 
(1.3) 


tOV14. toH14 
PRINT 
28 
ns 
(1.2) 


toF 
Outut Float Delay 


toFl 
MD15:0 
33 
ns 
(1.9) 


toF2 
MAD31:16 
37 
ns 
(1.9) 
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Table 17. Synchronous 
Input and Output Specifications 
(Continued) 


82961KD-20 
(20 MHz Specification) 
(Continued) 
, 


Symbol 
Parameter 
Min 
Max 
Units 
Notes 


tlS 
Input Setup Time 


tlSl 
ADS, W/R, 
BE3:0, DEN 
28 
ns 
(1,2) 


tlS2 
LAD31:0 
10 
ns 
(1,10) 


tlS3 
MD15:0 
8 
ns 
(1, 11) 


tlS4 
MAD31:16 
8 
ns 
(1,12) 


tlS5 
RESET 
7 
ns 
(1,2) 


tlS6 
ROMCS1:0 
(on RESET) 
10 
ns 
(1,2) 


tlH 
input Hold Time 


tlHl 
ADS, W/R, 
BE3:0, DEN 
2 
ns 
(1,2) 


tlH2 
LAD31:0 
2 
ns 
(1, 10) 


tlH3 
MD15:0 
8 
ns 
(1, 11) 


tlH4 
MAD31:16 
8 
ns 
(1,12) 


tlH5 
RESET 
5 
ns 
(1,2) 


tlH6 
ROMCS1:0 
(on RESET) 
8 
ns 
(1,2) 


82961KD-16 
(16 MHz Specification) 


TBD 


NOTES: 
1. See Figure 11 for waveforms and specifications. 
2. Signal is synchronous to the edge of the CLK2 A edge. 
3. Signal is synchronous to the CLK2 A or C edge. 
4. Signal is synchronous to the CLK2 C edge. 
5. LAD31:0 are synchronous outputs when presenting data for an internal register read or any 110 read cycle in which rising 
lORD precedes rising lOGS. LAD15:0 are synchronous outputs for the first half of a packed 1/0 or DRAM access. For all 
other accesses, data is driven on LAD31:0 combinatorially and is described by the tLAMAparameter. 
6. The DRA10:0 signals are synchronous outputs when presenting the row address when the Print Engine Video Controller 
is being filled and when the Graphic Execution Unit is accessing DRAM. The DRA10:0 present a synchronous column 
address whenever switching from a row to a column address. DRA1:0 are synchronous during DRAM burst cycles and 
DRA8 is synchronous during 16-bit cycles. For all other accesses, DRA10:0 are driven combinatorially as described by the 
tLAMAparameter. 
7. The MAD31:16 signals are synchronous outputs for all 1/0 accesses or when the Graphic Execution Unit is writing DRAM. 
For all other accesses these signals are driven combinatorially as described in the tLAMAparameter. 
8. MD15:0 outputs are synchronous outputs during the second half of a packed 110 or DRAM write access or when the 
Graphic Execution Unit is writing DRAM. For all other accesses, data is driven combinatorially as described by the tLAMA 
parameter. 
9. For DRAM write accesses, MAD31:16 and MD15:0 are enabled and disabled synchronous to CLK2 A edge. 
Note for 1/0 accesses, the MAD31:29 signals are floated later on the CLK2 C edge to prevent glitches in external decoding 
logic. 
10. LAD31:0 are synchronous inputs only for internal register writes. 
11. MD15:0 are synchronous inputs on the first half of a packed 1/0 or DRAM read access, when the PVC is being filled or 
when the GEU is reading DRAM. 
12. MAD31:16 are synchronous inputs when the PVC is being filled and when the GEU is reading DRAM. 
13. Only trailing edges of ROMCS1:0 and ROMSEL1:0 are synchronous at the end of a cycle. At the beginning of a cycle, 
ROMCS1:0 are governed by tLHCL(ALE high to ROMCS1:0 low) and ROMSEL1:0 is purely combinatorial, based on the 
address decode of LAD31:17. 
14. ROMAOE and ROMAOO are synchronous during burst cycles and at the end of burst and non-burst cycles. At the 
beginning of a cycle, ROMAOEand ROMAOOare governed by tLHMV(ALE high to ROMAOEand ROMAOOvalid). 
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Figure 11. Synchronous 
Input/Output 
Waveform 
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Figure 12. Reset Timing 
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7.2 Targeted L-Bus Timing Specifications 


Table 
18. L·Bus 
Relative 
Timing 
SpeCifications 


82961KD-20 
(20 MHz Specification) 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


tAvLH 
LAD Address 
Valid to ALE High 
5 
ns 
(1) 


tLHAX 
LAD Address 
Hold from ALE High 
1 
ns 
(1) 


tLLLH 
ALE Low to ALE High 
10 
ns 
(1) 


tELOD 
DEN Low to LAD Data Bus Driven 
28 
ns 
(1) 


tEHQF 
DEN High to LAD Data Floating 
28 
ns 
(1) 


tLAMA 
Local Address 
to Memory Data 
5 
22 
ns 
(1) 


tMDLD 
Memory 
Data to Local Data 
5 
22 
ns 
(1) 


tRHRL 
RESET High to RESET Low 
41 
CLK2 
(1) 


82961KD·16 
(16 MHz Specification) 


TBD 


NOTE: 
1. See Figure 
13 for waveforms 
and specifications. 
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Figure 
13. L·Bus 
Relative 
Timings 
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Table 19. DRAM Controller 
Relative 
Timings 


82961KD-20, 
-16 (20 MHz, 16 MHz Specification) 


Symbol 
I 
Parameter 
I 
Mln/Max 
I 
Units 
I 
Notes 


tDQDQ 
I 
Output to Output Delay 
I 
n2x' 
(te) ±4 
I 
ns 
I 
(1,2) 


NOTES: 
1. See Figure 16 for waveforms and specifications. 
2. The le parameter refers to the CLK2 period. The value for le is given elsewhere in the specitication. n2x is the number of 
2x clock cycles between DRAM outputs. The number of 2x clock cycles is selected by programming the DRAM controller. 
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Figure 16. DRAM Relative 
Timings 


Table 20. DRAM Controller 
Programmable 
Timings 


Symbol 
Parameter 
Min 
Max 
Units 
Value 
Value 


nASR 
Row Address 
Setup to RAS3:0 Active 
1 
7 
2xClocks 


nRAH 
Row Address 
Hold after RAS3:0 Active 
1 
7 
2x Clocks 


nASC 
Column Address 
Setup to CAS3:0 Active 
1 
7 
2x Clocks 


nCASrd 
CAS3:0 Pulse Width for Reads 
1 
7 
2x Clocks 


nCASwr 
CAS3:0 Pulse Width for Writes 
1 
7 
2x Clocks 


nCPrd 
CAS3:0 Precharge 
for Reads 
0 
7 
2xClocks 


nCPwr 
CAS3:0 Precharge 
for Writes 
1 
7 
2xClocks 


nRP 
RAS3:0 Precharge 
1 
7 
2x Clocks 


nRPC 
RAS3:0 Hold to CAS3:0 Precharge 
(Refresh) 
0 
7 
2x Clocks 


nCSR 
CAS3:0 Setup Time (Refresh) 
0 
7 
2x Clocks 


nCHR 
CAS3:0 Hold Time (Refresh) 
0 
7 
2x Clocks 


3-399 


• 


"~~-'UL 
~ 
n 
---- 
: 
"'<: ,....---------- 
CAS5:0[ 
,_ 
_ 


, 


MA~~:~~~[ 
0 
VAL:D 
)---------------- 


WE5:0 = HIGH 


Figure 17. DRAM Non-Page 
Read Cycle 


960 
[ 
BUS STATE 
Ta, 
Tw 
: 
Tw... 
Td 
Tr 
Ta: 
Tw 


I 
I 
I 
I 


[~: 
L.:........r.:: 
DATA 
:~ 
LAD51:0 
~ 
DATA 
~ 
., 
, 
, 
' 
, "-----~~--- 


[ 


',', 
CC 
I 
I 
I 
I 
-----;.-.....;..-H,,--", L.iJ' 
I 
5\ 
READY' 
'" 
, 
, 
, 


I, 
I 
I 
I 
I 
I 
I 
I 
, 
" 
, 
" 


Tw •.. 


DRA10:0[ 
ROW 
COLUMN 
I 
I 
ROW 
, 
"ASR 
, 
, 
I--- 
, 
"RP 
, 
"RAH 
,, 
, 
,, 
- 
"ASC 
, 


"cASwr~ ,,, 


RAS5:0 
[ 


WE[ 
\'------o!-" 
\'------ 


MAD51:16[_~_-!_~( 
~ATA 
)1-__ ~--1( 
DATA 
MD15:0 
•••• 
•..••• 
•••• 
_ 


Figure 18. DRAM Non-Page 
Write Cycle 


3-400 


272221-19 


272221-20 


intel~ 
82961KD 


Td 
I 
Tr 
Ti 
Ta 
Tw 
I 
Tw ... 


I 
I 


I 
" 
[ \\:B 
BC 


LAD31:0 
~~--- 
VALID 
_ 
•••••_ 
••••••_ 
ADDR 


(READS) 
• 


I 
I 


LAD31:0 
[:: 
)i-I----,,-r----r-- 


(WRITES) 
--- 
•••,\\.------~. 
L-..A...- 
,, 
l.il 


I 
I 


WE [ 
R_EA_D_=_l'_W 
__RI_TE_=_O 
__ 
•••1 


I 
I 


t.tAD31:16 
[ 
~ 
C 
t.tD15:0 
------... 
VALID f-------!------------------- 


(READS) 
I 
I 


960 
[ 


BUS STATE 


READY [ 


DRA10:0 
[ 


RAS3:0 
[ 


CAS3:0 
[ 


Tw ... 


1\ 


COLUt.tN 
X 
X 
ROW 


, 
I 
, 
, 
"RP 
_nRP 


I 
, 
"RAH 
, 
I 
I 
I 
(NOTE 1) 


I 
I 
I 
I,nASC, 
:f4- nRPC-f4- 
nCSR ...J..o, 
, 
I- 
nCHR_ 
, 
I 


I 
:, 
L 
, 


I 
I 


\ 


t.tAD31 : 16 [ 
) 
( 


t.tD15:0 
DATA 
.f-------;-.... 


(WRITES) 
----------... 
,,----------------- 


DATA 


272221-21 


NOTE: 
1. Assumes 
nAP " 
nASA. 


Figure 19. DRAM CAS before 
RAS Refresh 
Cycle 


3-401 


82961KD 


960 
[ 
Ta 
Tw 
T 
Td 
T 
BUS STATE 
, 
, 
,w... 
: 
w ... 


I 
I 
I 
I 


lAD31:0[~: 
~ 


(READS) 
~joo. 
-- 
__ ~I----"---.1 
' 


I 
I 
I 
I 


lAD31:0[--+EC' 
': 
' c: 
)--f-EC:" 
! 
(WRITES) 
ADDRESS 
'-_D_AT_A_O_--_- 
,-_D_A 
••T_A_1___ 
,ADDRESS 
, 


, 
, 
r 
, 
,r 
'" 
1-,--- 


READY 
[ 
:: 
\~ 
;;; 
! 


I 
I 
I 
I 
I 
, 
, 


Tr 
Ta 
Tw 
Tw ... 
Td 


ORA 10:0 
[ 
__ 
--_""'_RO_W_'I'- 
__ 
C_O_l_U_M_N_O 
__ 
..I,••••--C-O-l-U-M-N-_-.n.-- 
•••••,'- 
RO_W 
_ 


t-----"ilP 
r.... 
----"""11 
"RAH 


RAS3:0 
~~---------~----------------r~ 
"ASC 


CAS3:0 
[ 


\ 
READ = 
1, WRITE = 0 
X 
READ = 
1, WRITE = 0 y 


, 
, 
, 
, 


CE3 
~ 
,, 


( 
DATA 0 
X 
DATA 1 
) 


MAD31:16 
[ 
MD15:0 
- 
•••• -- 
•••••------- 


(READS) 


MAD31:16 
[ 
MD15:0 
-+---+-- 


(WRITES) 
---~('---- 


272221-22 


Figure 20. DRAM Page Mode Read/Write 
Burst Cycle 
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Figure 21. DRAM Static Column Mode Read Burst Cycle 
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NOTE: 
1. Rising ~ 
synchronous 
10 CLK2 A or C edge on 1st half of packed 
read. 


Figure 22. DRAM 16·Blt Page Mode Read Cycle 
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NOTE: 
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Figure 23. DRAM 16-Blt Page Mode Aligned Write Cycle 
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.Table 
21. ROM and 1/0 Controller 
Programmable 
Timings 


Symbol 
Parameter 
Mln 
Max 
Mln 
Max 
Units 
Value 
Value 
Timing 
Timing 


nROMRDY 
READY Timing for ROM Accesses 
0 
15 
1 
16 
1x Clocks 


nRDWT 
lORD Wait High Time 
0 
31 
0 
31 
1x Clocks 


nRDACC 
lORD Access 
Low Time 
0 
31 
1 
32 
1xClocks 


nWRWT 
IOWR Wait High Time 
0 
31 
0 
31 
1xClocks 


nWRACC 
IOWR Access 
Low Time 
0 
31 
1 
32 
1x Clocks 


nlOREC 
110 Recovery 
Time 
0 
31 
0 
31 
1x Clocks 


nlOGS 
roes Low Period 
0 
31 
1 
32 
1x Clocks 
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Table 
22. Printer 
Video 
Interface 
Timings 


82961KD-20, 
-16 (20 MHz. 16 MHz Specification) 


Symbol 
Parameter" 
Mln 
Max 
Units 
Notes 


tVC1 
VCLK Frequency. 
1x Clock Mode 
25 
MHz 


tVC8 
VCLK Frequency. 
8x Clock Mode 
80 
MHz 


tVCH 
VCLK High Time 
ns 
(1) 


tVCL 
VCLK Low Time 
ns 
(1) 


tVOV1 
VIDEO Output Valid. 1x Clock Mode 
27 
ns 
(1) 


tvov8 
VIDEO Output Valid. 8x Clock Mode 
30 
ns 
(1) 


tVIS 
LSYNC Input Setup. tx Clock Mode 
0 
ns 
(1) 


tVIH 
LSYNC Input Hold. 1x Clock Mode 
9 
(1) 


tVW1 
FSYNC Input Pulse Width. 1x Clock Mode 
2 
tVC1 
(1) 


tVW8 
LSYNC. FSYNC Input Pulse Width. 
16 
tVC8 
(1) 
8x Clock Mode 


NOTE: 
1. See Figure 
32 for waveforms 
and specifications. 


Table 
23. Printer 
Communications 
Interface 
Timings 


82961KD 
SUPPLIES 
CCLK 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


tpv 
CMD Output Valid Delay 
4 
2xClocks 
(1) 


tps 
STS Input Setup 
0 
ns 
(1) 


tpH 
STS Input Hold 
4 
2x Clocks 
(1) 


tpsc 
SBSY Valid to CCLK Driven 
0 
6 
2xClocks 
(1) 


CCLK SUPPLIED 
EXTERNALLY 


tpv 
CMD Output Valid Delay 
8 
2xClocks 
(1) 


tps 
STS Input Setup 
0 
2x Clocks 
(1) 


tpH 
STS Input Hold 
8 
2x Clocks 
(1) 


tpc 
CCLK Period 
1000 
ns 
(1) 


tpCH 
CCLK High Time 
400 
ns 
(1) 


tpCL 
CCLK Low Time 
400 
ns 
(1) 


tpcs 
CCLK Driven from SBSY Valid 
0 
2x Clocks 
(1) 


NOTE: 
1. See Figures 
32 and 33 for waveforms 
and specifications. 
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NOTES: 
1. VIDEO can be referenced to either the rising or falling ed~e of VCLK. 
2. LSVNC must meet setup/hold requirements relative to V LK edge that is programmed. 


Figure 32. Printer Video Interface 
Timing 
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Figure 34. Printer Communications 
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INTRODUCTION 


The 82596CA is an intelligent, high-performance 
32-bit 
Local 
Area 
Network 
coprocessor. 
The 
82596CA implements the CSMAlCD access method 
and can be configured to support all existing IEEE 
802.3 
standards-TYPEs 
10BASE-T, 
10BASE5, 
10BASE2, 1BASE5, and 10BROAD36. It can also be 
used to implement the proposed standard TYPE 
10BASE-F. The 82596CA performs high-level com- 
mands, command chaining, and interprocessor com- 
munications via shared memory, thus relieving the 
host CPU of many tasks associated with network 
control. All time-critical functions are performed in- 
dependently of the CPU, this increases network per- 
formance and efficiency. The 82596CA bus interface 
is 
optimized 
for 
Intel's 
i486™SX, 
i486™DX, 
i487™SX, 80960CA, and 80960KB processors. 


The 82596CA implements all IEEE 802.3 Medium 
Access Control and channel interface functions, 
these include framing, preamble generation and 
stripping, source address generation, destination ad- 
dress checking, short-frame detection, and automat- 
ic length-field handling. Data rates up to 20 Mb/s are 
supported. 


The 82596CA provides a powerful host system inter- 
face. It manages memory structures automatically, 
with command chaining and bidirectional data chain- 


. ing. An on-chip DMA controller manages four chan- 
nels, this allows autonomous transfer of data blocks 
(buffers and frames) and relieves the CPU of byte 
transfer overhead. Buffers containing errored or col- 
lided frames can be automatically recovered without 
CPU intervention. The 82596CA provides an up- 
grade path for existing 82586 software drivers by 
providing an 82586-software-compatible mode that 
supports the current 82586 memory structure. The 
82586CA also has a Flexible memory structure and 
a Simplified memory structure. The 82596CA can 
address up to 4 gigabytes of memory. The 82596CA 
supports Little Endian and Big Endian byte ordering. 


The 82596CA bus interface can achieve a burst 
transfer rate of 106 MB/s at 33 MHz. The bus inter- 
face 
employs 
bus 
throttle 
timers 
to 
regulate 
82596CA bus use. Two large, independent FIFOs- 
128 bytes for Receive and 64 bytes for Transmit- 
tolerate long bus latencies and provide programma- 
ble thresholds that allow the user to optimize bus 
overhead for any worst-case bus latency. The high- 
performance bus is capable of back-to-back trans- 
mission and reception during the IEEE 802.3 9.6-,..,s 
Interframe Spacing (IFS) period. 


The 82596CA provides a wide range of diagnostics 
and network management functions, these include 
internal and external loopback, exception condition 
I 


82596CA 


tallies, channel activity indicators, optional capture 
of all frames regardless of destination address 
(promiscuous mode), optional capture of errored or 
collided frames, and time domain reflectometry for 
locating fault points on the network cable. The sta- 
tistical counters, in 32-bit segmented and linear 
modes, are 32-bits each and include CRC errors, 
alignment errors, overrun errors, resource errors, 
short frames, and received collisions. The 82596CA 
also features a monitor mode for network analysis. 
In this mode the 82596CA can capture status bytes, 
and update statistical counters, of frames monitored 
on the link without transferring the contents of the 
frames to memory. This can be.done concurrently 
while transmitting and receiving frames destined for 
that station. 


The 82596CA can be used in both baseband and 
broadband networks. It can be configured for maxi- 
mum network efficiency (minimum contention over- 
head) with networks of any length. Its highly flexible 
CSMAlCD unit supports address field lengths of 
zero through six bytes-configurable 
to either IEEE 
802.3/Ethernet or HDLC frame delimitation. It also 
supports 16- or 32-bit cyclic redundancy checks. 
The CRC can be transferred directly to memory for 
receive operations, or dynamically inserted for trans- • 
mit operations. The CSMAlCD unit can also be con- 
• 
figured for full duplex operation for high throughput 
in point-to-point connections. 


The 82596 C-step incorporates several new features 
not found in previous steppings. The following is a 
summary of the 82596 C-step's new features. 
• The 82596 C-step fixes Errata found in the A1 
and B steppings. 


• The 82596 C-step has improved AC timings over 
both the A and B steppings. 


• The 82596 C-step has a New Enhanced Big Endi- 
an Mode where in Linear Addressing Mode, true 
32-bit Big Endian functionality is achieved. New 
Enhanced Big Endian Mode is enabled by setting 
bit 7 of the SYSBUS byte. This mode is software 
compatible with the big endian mode of the 
B-step with one exception-no 
32-bit addresses 
need to be swapped by software in the C-step. In 
this new mode, the 82596 C-step treats 32-bit ad- 
dress pointers as true 32-bit entities and the SCB 
absolute address and statistical counters are still 
treated as two 16-bit big endian entities. Not set- 
ting this mode will configure the 82596 C-step to 
be 100% compatible to the A1-step big endian 
mode. 


• The 82596 C-step is hardware and software com- 
patible to both the A1 and B steppings allowing 
for easy "drop-in" to current designs. Pinout and 
control structures remain unchanged. 
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The 82596CA 
is fabricated 
with Intel's reliable, 5-V, CHMOS 
IV (process 
648.8) technology. 
It is available 
in a 
132-pin 
PQFP or PGA package. 
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82596CA PGA Cross Reference by Pin Name 


Address 
Data 
Control 
Serial 
Vcc 
Vss 
Interface 


Signal 
Pin No. 
Signal 
Pin No. 
Signal 
Pin No. 
Signal 
Pin No. 
Pin No. 
Pin No. 


A2 
N9 
DO 
J2 
ADS 
M5 
COT 
A13 
B6 
A6 
A3 
M9 
01 
H3 
AHOLO 
N5 
CAS 
A14 
B7 
A7 
A4 
M10 
02 
G2 
BEO 
M7 
CfS 
C11 
B10 
A8 
A5 
P11 
03 
G3 
BE1 
P5 
LPBK 
A12 
C13 
A10 
A6 
N11 
04 
G1 
BE2 
M8 
RTS 
C10 
E2 
E1 
A7 
P12 
05 
01 
BE3 
P9 
RXC 
B11 
E13 
E14 
A8 
M11 
06 
C1 
BLAST 
N2 
RxO 
B12 
F2 
F1 
A9 
N12 
07 
F3 
BOFF 
N6 
TxC 
C12 
G13 
G14 
A10 
M12 
08 
02 
BROY 
M1 
TxO 
A11 
H2 
H1 
A11 
P13 
09 
C2 
BREQ 
P4 
H13 
H14 
A12 
L12 
010 
E3 
BS16 
N1 
J13 
J1 
A13 
N13 
011 
03 
CA 
P3 
K2 
J14 
A14 
M13 
012 
B2 
CLK 
J3 
L13 
K1 
A15 
P14 
013 
B1 
OPO 
L2 
N7 
L14 
A16 
K12 
014 
C3 
OP1 
L3 
N8 
P6 
A17 
N14 
015 
A1 
OP2 
L1 
N10 
P7 
A18 
J12 
016 
B3 
OP3 
K3 
P8 
A19 
K13 
017 
C4 
HLOA 
M6 
P10 
A20 
M14 
018 
A2 
HOLD 
P2 
A21 
H12 
019 
C5 
INTIINT 
N3 
A22 
K14 
020 
A3 
LE/8E 
814 
A23 
G12 
021 
B4 
LOCK 
M4 
A24 
F14 
022 
A4 
'j5'C"ffi( 
P1 
A25 
F12 
023 
C6 
PORT 
M2 
A26 
F13 
024 
85 
READY 
M3 
A27 
014 
025 
C7 
RESET 
B13 
A28 
E12 
026 
A5 
W/R 
N4 
A29 
013 
027 
B8 
A30 
012 
028 
C8 
A31 
C14 
029 
A9 
030 
C9 
031 
89 
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PIN DESCRIPTIONS 


Symbol 
PQFP 
Type 
Name and Function 
Pin No. 


ClK 
9 
I 
CLOCK. 
The system clock input provides the fundamental 
timing for 
the 82596. It is a 1X ClK 
input used to generate 
the 82596 clock and 
requires TTl 
levels. All external timing parameters 
are specified 
in 
reference 
to the rising edge of ClK. 


00-031 
14-53 
1/0 
DATA BUS. The 32 Oata Bus lines are bidirectional, 
tri-state 
lines that 
provide the general purpose data path between 
the 82596 and 
memory. With the 82596 the bus can be either 16 or 32 bits wide; this 
is determined 
by the B516 signal. The 82596 always drives all 32 data 
lines during Write operations, 
even with a 16-bit bus. 031- 
00 are 
floated 
after a Reset or when the bus is not acquired. 
These lines are inputs during a CPU Port access; in this mode the CPU 
writes the next address to the 82596 through the data lines. Ouring 
PORT commands 
(Relocatable 
SCP, Self-Test, 
Reset and Oump) the 
address 
must be aligned to a 16-byte boundary. This frees the 03-00 
lines so they can be used to distinguish 
the commands. 
The following 
is a summary of the decoding 
data. 


DO 
D1 
D2 
D3 
D31-D4 
Function 


0 
0 
0 
0 
0000 
Reset 
0 
1 
0 
0 
AOOR 
Relocatable 
SCP 
1 
0 
0 
0 
AOOR 
Self-Test 
1 
1 
0 
0 
AOOR 
Oump Command 


OPO-OP3 
4-7 
110 
DATA PARITY. 
These are tri-stated 
data parity pins. There is one 
parity line for each byte of the data bus. The 82596 drives them with 
even-parity 
information 
during write operations 
having the same timing 
as data writes. Likewise, even-parity 
information, 
with the same timing 
as read information, 
must be driven back to the 82596 over these pins 
to ensure that the correct 
parity check status is indicated 
by the 
82596. 


PCHK 
127 
0 
PARITY 
CHECK. This pin is driven high one clock after ROY to inform 


Read operations 
of the parity status of data sampled 
at the end of the 
previous clock cycle. When driven low it indicates 
that incorrect 
parity 
data has been sampled. 
It only checks the parity status of enabled 
bytes, which are indicated 
by the Byte Enable and Bus Size signals. 


PCHK is only valid for one clock time after data read is returned 
to the 
82596; i.e., it is inactive 
(high) at all other times. 


A31-A2 
70-108 
0 
ADDRESS 
LINES. These 30 tri-stated 
Address 
lines output the 
address bits required for memory operation. 
These lines are floated 


after a Reset or when the bus is not acquired. 


BE3-BEO 
109-114 
0 
BYTE ENABLE. 
These tri-stated 
signals are used to indicate which 


bytes are involved with the current memory access. The number of 
Byte Enable signals asserted 
indicates 
the physical size of. the data 


being transferred 
(1, 2, 3, or 4 bytes). 


• BEO indicates 
07-00 


, 
• BE1 indicates 
015-08 
• BE2 indicates 
023-016 
• BE3 indicates 
031-024 
These lines are floated 
after a Reset or when the bus is not acquired. 


WIR 
120 
0 
WRITE/READ. 
This dual function 
pin is used to distinguish 
Write and 
Read cycles. This line is floated 
after a Reset or when the bus is not 


acquired. 


I 
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PIN DESCRIPTIONS 
(Continued) 


Symbol 
PQFP 
Type 
Name and Function 
Pin No. 


ADS 
124 
0 
.ADDRESS STATUS. 
The 82596 uses this tri-state 
pin to indicate ~ 
indicate that a valid bus cycle has begun and that A31-A2, 
BE3-BEO, 
and WlR are being driven. It is asserted 
during t1 bus states. This line 
is floated 
after a Reset or when the bus is not acquired. 


RDY 
130 
I 
READY. Active low. This signal is the acknowledgment 
from 
addressed 
memory that the transfer 
cycle can be completed. 
When 
high, it causes wait states to be inserted. 
It is ignored at the end of the 
first clock of the bus cycle's 
data cycle. This active-Iow 
signal does not 
have an internal pull-up resistor. This signal must meet the setup and 
hold times to operate correctly. 


BRDY 
2 
I 
BURST READY. Active low. Burst Ready, like RDY, indicates 
that the 
external 
system has presented 
valid data on the data pins in response 
to a Read, or that the external system has accepted 
the 82596 data in 
response 
to a Write request. Also, like RDY, this signal is ignored at 
the end of the first clock in a bus cycle. If the 82596 can still receive 
data from the previous cycle, ADS will not be asserted 
in the next 
clock cycle; however, 
Address 
and Byte Enable will change to reflect 
the next data item expected 
by the 82596. BRDY will be sampled 
during each succeeding 
clock and if active, the data on the pins will be 
strobed to the 82596 or to external 
memory (read/write). 
BRDY 
operates 
exactly like READY during the last data cycle of a burst 
sequence 
and during nonburstable 
cycles. 


BLAST 
128 
0 
BURST LAST. A signal (active low) on this tri-state 
pin indicates 
that 
the burst cycle is finished 
and when BRDY is next returned 
it will be 
treated as a normal ready; i.e., another 
set of addresses 
will be driven 
with ADS or the bus will go idle. BLAST is not asserted 
if the bus is not 
acquired. 
' 


AHOLD 
117 
I 
ADDRESS 
HOLD. This hold signal is active high, it allows another 
bus 
master to access the 82596 address bus. In a system where an 82596 
and an i486 processor 
share the local bus, AHOLD allows the cache 
controller 
to make a cache invalidation 
cycle while the 82596 holds the 
address lines. In response 
to a signal on this pin, the 82596 
immediately 
(i.e. during the next clock) stops driving the entire address 
bus (A31-A2); 
the rest of the bus can remain active. For example, 


data can be returned for a previously 
specified 
bus cycle during 
Address 
Hold. The 82596 will not begin another 
bus cycle while 
AHOLD is active. 


BOFF 
116 
I 
BACKOFF. 
This signal is active low, it informs the 82596 that another 
bus master requires access to the bus before the 82596 bus cycle 
completes. 
The 82596 immediately 
(i.e. during the next clock) floats its 
bus. Any data returned 
to the 82596 while BOFF is asserted 
is ignored. 
BOFF has higher priority than RDY or BRDY; if two such signals are 
returned 
in the same clock period, BOFF is given preference. 
The 
82596 remains in Hold until BOFF goes high, then the 82596 resumes 
its bus cycle by driving out the address and status, and asserting 
ADS. 


LOCK 
126 
0 
LOCK. This tri-state 
pin is used to distinguish 
locked and unlocked 
bus 
cycles. LOCK generates 
a semaphore 
handshake 
to the CPU. LOCK 
can be active for several memory cycles, it goes active during the first 
locked memory cycle (t1) and goes inactive at the last locked cycle 
(t2). This line is floated 
after a Reset or when the bus is not acquired. 


LOCK can be disabled 
via the sysbus byte in software. 
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PIN DESCRIPTIONS 
(Continued) 


Symbol 
PQFP 
Type 
Name and Function 
Pin No. 


BS16 
129 
I 
BUS SIZE. This signal allows the 82596CA 
to work with either 16- or 
32-bit bytes. Inserting BS16 low causes the 82596 to perform two 16- 
bit memory accesses 
when transferring 
32-bit data. In little endian 
mode the D15-DO 
lines are driven when B516 is inserted, 
in Big 
Endian mode the D31- D16 lines are driven. 


HOLD 
123 
0 
HOLD. The HOLD signal is active high, the 82596 uses it to request 
local bus mastership. 
In normal operation 
HOLD goes inactive before 
HLDA. The 82596 can be forced off the bus by deasserting 
HLDA or if 


the bus throttle 
timers expire. 


HLDA 
118 
I 
HOLD ACKNOWLEDGE. 
The HLDA signal is active high, it indicates 
that bus mastership 
has been given to the 82596. HLDA is internally 
synchronized; 
after HOLD is detected 
low, the CPU drives HLDA low. 


NOTE: 
Do not connect 
HLDA to Vcc-it 
will cause a deadlock. 
A user wanting 


to give the 82596 permanent 
access to the bus should connect 
HLDA 


to HOLD. If HLDA goes inactive before HOLD, the 82596 will release 
the bus (by deasserting 
HOLD) within a maximum of within a specified 


number of bus cycles as specified 
in the 82596 User's Manual. 


BREQ 
115 
I 
BUS REQUEST. 
This signal, when configured 
to an externally 
activated 
mode, is used to trigger the bus throttle 
timers . 


PORT 
3 
I 
PORT. When this signal is received, 
the 82596 latches the data on the 


data bus into an internal 32-bit register. When the CPU is asserting 
this 


signal it can write into the 82596 (via the data bus). This pin must be 
activated 
twice during all CPU Port access commands. 


RESET 
69 
I 
RESET. This active high, internally 
synchronized 
signal causes the 
82596 to terminate 
current activity. The signal must be high for at least 


five system clock cycles. After five system clock cycles and four TxC 
clock cycles the 82596 will execute a Reset when it receives 
a high 
RESET signal. When RESET returns to low the 82596 waits for the 
first CA signal and then begins the initialization 
sequence. 


LE/BE 
65 
I 
LITTLE ENDIAN/BIG 
ENDIAN. This dual-function 
pin is used to 
select byte ordering. When LE/BE 
is high, little endian byte ordering 
is 


used; when low, big endian byte ordering 
is used for data in frames 
(bytes) and for control 
(SCB, RFD, CBL, etc). 


CA 
119 
I 
CHANNEL 
ATTENTION. 
The CPU uses this pin to force the 82596 to 


begin executing 
memory resident Command 
blocks. The CA signal is 
internally 
synchronized. 
The signal must be high for at least one 
system clock. It is latched internally 
on the high to low edge and then 
detected 
by the 82596. 


The first CA after a Reset forces the 82596 into the initialization 
sequence 
beginning 
at location 
00FFFFF6h 
or an SCP address written 


to the 82596 using CPU Port access. All subsequent 
CA signals cause 


the 82596 to begin executing 
new command 
sequences 
from the SCB. 


INT/INT 
125 
0 
INTERRUPT. 
A high signal on this pin notifies the CPU that the 82596 


is requesting 
an interrupt. This signal is an edge triggered 
interrupt 
signal, and can be configured 
to be active high or low. 


I 


• 
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PIN DESCRIPTIONS 
(Continued) 


Symbol 
PQFP 
Type 
Name and Function 
Pin No. 


Vcc 
17 Pins 
POWER. 
+5V 
±10%. 


Vss 
17 Pins 
GROUND.OV. 


TxD 
54 
0 
TRANSMIT 
DATA. This pin transmits 
data to the serial link. It is high 
when not transmitting. 


TxC 
64 
I 
TRANSMIT 
CLOCK. This signal provides the fundamental 
timing for 
the serial subsystem. 
The clock is also used to transmit 
data 
synchronously 
on the TxD pin. For NAZ encoding, 
data is transferred 
to the TxD pin on the high to low clock transition. 
For Manchester 
encoding, 
the transmitted 
bit center is aligned with the low to high 
transition. 
Transmit 
clock must always be running for proper device 
operation. 


LPBK 
58 
0 
LOOPBACK. 
This TTL-Ievel 
control 
signal enables the loopback 
mode. In this mode serial data on the TxD input is routed through the 
82C501 internal circuits and back to the AxD output without driving the 
transceiver 
cable. To enable this signal, both internal and external 
loopback 
need to be set with the Configure 
command. 


AxD 
60 
I 
RECEIVE 
DATA. This pin receives 
NAZ serial data only. It must be 
high when not receiving. 


AxC 
59 
I 
RECEIVE 
CLOCK. This signal provides timing information 
to the 
internal shifting logic. For NAZ data the state of the AxD pin is 
sampled 
on the high to low transition 
of the clock. 


ATS 
57 
0 
REQUEST TO SEND. When this signal is low the 82596 informs the 
external interface 
that it has data to transmit. 
It is forced 
high after a 
Aeset or when transmission 
is stopped. 
' 


CTS 
62 
I 
CLEAR TO SEND. An active-Iow 
signal that enables the 82596 to 
send data. It is normally 
used as an interface 
handshake 
to ATS. 
Asserting 
CTS high stops transmission. 
CTS is internally 
synchronized. 


If CTS goes inactive, meeting the setup time to the TxC negative 
edge, 


the transmission 
will stop and ATS will go inactive within, at most, two 
TxCcycles. 


CAS 
63 
I 
CARRIER 
SENSE. This signal is active low, it is used to notify the 
82596 that traffic is on the serial link. It is only used if the 82596 is 
configured 
for external Carrier Sense. In this configuration 
external 
circuitry is required for detecting 
traffic on the serial link. CAS is 
internally 
synchronized. 
To be accepted, 
the signal must remain active 
for at least two serial clock cycles (for CASF = 0). 


CDT 
61 
I 
COLLISION 
DETECT. This active-Iow 
signal informs the 82596 that a 
collision 
has occurred. 
It is only used if the 82596 is configured 
for 
external Collision 
Detect. External circuitry 
is required for collision 
detection. 
CDT is internally 
synchronized. 
To be accepted, 
the signal 


must remain active for at least two serial clock cycles (for CDTF = 0). 
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82596 AND HOST CPU INTERACTION 


The 82596CA and the 
host CPU communicate 
through shared memory. Because of its on-chip 
DMA capability, the 82596 can make data block 
transfers (buffers and frames) independently of the 
CPU; this greatly reduces the CPU byte transfer 
overhead. 


The 82596 is a multitasking coprocessor that com- 
prises two independent logical units-the 
Command 
Unit (CU) and the Receive Unit (RU). The CU exe- 
cutes commands from shared memory. The RU han- 
dles all activities related to frame reception. The in- 
dependence of the CU and RU enables the 82596 to 
engage in both activities simultaneously-the 
CU 
can fetch and execute commands from memory 
while the RU is storing received frames in memory. 
The CPU is only involved with this process after the 
CU has executed a sequence of commands or the 
RU has finished storing a sequence of frames. 


The CPU and the 82596 use the hardware Signals 
Interrupt (1Nl) and Channel Attention (CA) to initiate 
communication 
with 
the 
System 
Control 
Block 
(SCB),see Figure 4. The 82596 uses INT to alert the 
CPU of a change in the contents of the SCB, the 
CPU uses CA to alert the 82596. 


The 82596 has a CPU Port Access state that allows 
the CPU to execute certain functions without ac- 
cessing memory. The 82596 PORT pin and data bus 
pins are used to enable this feature. The CPU can 
directly activate four operations when the 82596 is in 
this state. 
• Write an alternative System Configuration Pointer 
(SCP).This can be used when the 82596 cannot 
use the default SCP address space. 
• Write a different Dump Command Pointer and ex- 
ecute Dump. This can be used for troubleshoot- 
ing No Response problems. 
• The CPU can reset the 82596 via software with- 
out disturbing the rest of the system. 


• A self-test can be used for board testing; the 
82596 will execute a self-test and write the re- 
sults to memory. 


1 


82596CA 


82596 BUS INTERFACE 


The 82596CA has bus interface timings and pin defi- 
nitions 
that 
are 
compatible 
with 
Intel's 
32-bit 
i486TMSX and 
i486TMDX microprocessors. 
This 
eliminates the need for additional bus interface logic. 
Operating at 33 MHz, the 82596's bus bandwidth 
can be as high as 106 MB/s. Since Ethernet only 
requires 1.25 MB/s, 
this leaves a considerable 
amount of bandwidth for the CPU. The 82596 also 
has a bus throttle to regulate its use of the bus. Two 
timers can be programmed through the SCB: one 
controls the maximum time the 82596 can remain on 
the bus, the other controls the time the 82596 must 
stay off the bus (see Figure 5). The bus throttle can 
be programmed to trigger internally with HLDA or 
ex1ernallywith BREQ. These timers can restrict the 
82596 HOLD activation time and improve bus utiliza- 
tion. 


82596 MEMORY 
ADDRESSING 


The 82596 has a 32-bit memory address range, 
which allows addressing up to four gigabytes of 
memory. The 82596 has three memory addressing 
modes (see Table 1). 
• 82586 Mode. The 82596 has a 24-bit memory 
address range. The System Control Block, Com- 
mand List, Receive Descriptor List, and Buffer 
Descriptors must reside in one 64-KB memory 
segment. Transmit and Receive buffers can re- 
side in a 24-bit address space. 
• 32-Blt Segmented Mode. The 82596 has a 32- 
bit memory address range. The System Control 
BlOCk,Command List, Receive Descriptor List, 
and Buffer Descriptors must reside in one 64-KB 
memory segment. Transmit and Receive buffers 
can reside in a 32-bit address space. 


• Linear Mode. The 82596 has a 32-bit memory 
address range. Any memory structure can reside 
anywhere within the 
32-bit memory address 
range. 
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Figure 4. 82596 and Host CPU Intervention 
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Figure 5. Bus Throttle 
Timers 


Operation 
Mode 


Pointer 
or Offset 
82586 
32-Blt 
Linear 
Segmented 


15CP Address 
24-Bit Linear 
32-Bit Linear 
32-Bit Linear 


5CB Address 
Base (24) + Offset (16) 
Base (32) + Offset (16) 
32-Bit Linear 


Command 
Block Pointers 
Base (24) + Offset (16) 
Base (32) + Offset (16) 
32-Bit Linear 


Rx Frame Descriptors 
Base (24) + Offset (16) 
Base (32) + Offset (16) 
32-Bit Linear 


Tx Frame Descriptors 
Base (24) + Offset (16) 
Base (32) + Offset (16) 
32-Bit Linear 


Rx Buffer Descriptors 
Base (24) + Offset (16) 
Base (32) + Offset (16) 
32-Bit Linear 


Tx Buffer Descriptors 
Base (24) + Offset (16) 
Base (32) + Offset(16) 
32-Bit Linear 


Rx Buffers 
24-Bit Linear 
32-Bit Linear 
32-Bit Linear 


Tx Buffers 
24-Bit Linear 
32-Bit Linear 
32-Bit Linear 


Table 
1.82596 
Memory 
Addressing 
Formats 
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Figure 6. 82596 Shared Memory Structure 


The Shared Memory structure consists of four parts: 
the Initialization Root, the System Control Block, the 
Command List, and the Receive Frame Area (see 
Figure 6). 


The Initialization Root is in an established location 
known to the host CPUand the 82596 (OOFFFFF6h). 
However, the CPU can establish the Initialization 
Root in another location by using the CPU Port ac- 
cess. This root is accessed during initialization, and 
points to the System Control Block. 
l 


The System Control Block serves as a bidirectional 
mail drop for the host CPU and the 82596 CU and 
RU. It is the central point through which the CPU and 
the 82596 exchange control and status information. 
The SCB has two areas. The first contains instruc- 
tions from the CPU to the 82596. These include: 
control of the CU and RU (Start, Abort, Suspend, 
and Resume), a pointer to the list of CU commands, 
a pointer to the Receive Frame Area, a set of Inter- 
rupt Acknowledge bits, and the T-ON and T-OFF 
timers for the bus throttle. The second area contains 
status information the 82596 is sending to the CPU. 
Such as, the CU and RU states 
(Idle, Active 


• 
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Ready, Suspended, No Receive Resources, etc.), in- 
terrupt bits (Command Completed, Frame Received, 
CU Not Ready, and RU Not Ready), and statistical 
counters. 


The Command List functions as a program for the 
CU; individual commands are placed in memory 
units called Command Blocks (CBs). These CBs 
contain the parameters and status of specific high- 
level commands called Action Commands; e.g., 
Transmit or Configure. 


Transmit causes the 82596 to transmit a frame. The 
Transmit CB contains the destination address, the 
length field, and a pointer to a list of linked buffers 
holding the frame that is to be constructed from sev- 
eral buffers 
scattered 
throughout 
memory. The 
Command Unit operates without CPU intervention; 
the DMA for each buffer, and the prefetching of ref- 
erences to new buffers, is performed in parallel. The 
CPU is notified only after a transmission is complete. 


The Receive Frame Area is a list of Free Frame De- 
scriptors (descriptors not yet used) and a list of user- 
prepared buffers. Frames arrive at the 82596 unso- 
licited; 'the 82596 must always be ready to receive 
and store them in the Free Frame Area. The Re- 
ceive Unit fills the buffers when it receives frames, 
and reformats the Free Buffer List into received- 
frame structures. The frame structure is, for all prac- 
tical purposes, identical to the format of the frame to 
be transmitted. The first Frame descrtptor 
is refer- 
enced by the SCB. Unless the 82596 is configured 
to Save Bad Frames, the frame descriptor, and the 
associated buffer descriptor, which is wasted when 
a bad frame is received, are automatically reclaimed 
and returned to the Free Buffer List. 


Receive buffer chaining (storing incoming frames in 
a linked buffer list) significantly improves memory 
utilization. Without buffer chaining, the user must al- 
locate consecutive blocks of memory, each capable 
of containing a maximum frame (for Ethernet, 1518 
bytes). Since an average frame is about 200 bytes, 
this is very inefficient. With buffer chaining, the user 
can allocate small buffers and the 82596 will only 
use those that are needed. 


Figure 7 A-D 
illustrates how the 82596 uses the 
Receive Frame Area. Figure 7A shows an unused 
Receive Frame Area composed of Free Frame De- 
scriptors and Free Receive Buffers prepared by the 
user. The SCB points to the first Frame Descriptor of 
the Frame Descriptor List. Figure 7B shows the 
same Receive Frame Area after receiving one 
frame. This first frame occupies two Receive Buffers 
and one Frame Descriptor-a 
valid received frame 
will only occupy one Frame Descriptor. After receiv- 


4-14 


ing this frame the 82596 sets the next Free Frame 
Descriptor RBD pointer to the next Free RBD. Figure 
7C shows the RFA after receiving a second frame. 
In this example the second frame occupies only one 
Receive Buffer and one RFD. The 82596 again sets 
the RBD pointer. This process is repeated again in 
Figure 70, showing the reception of another frame 
using one Receive Buffer; in this example there is an 
extra Frame Descriptor. 


TRANSMIT 
AND RECEIVE 
MEMORY 
STRUCTURES 


There are three memory structures for reception and 
transmission. The 82586 memory structure, the 
Flexible memory structure, and the Simplifiedmemo- 
ry structure. The 82586 mode is selected by config- 
uring the 82596 during initialization. In this mode all 
the 82596 memory structures are compatible with 
the 82586 memory structures. 


When the 82596 is not configured to the 82586 
mode, the other two memory structures, Simplified 
and Flexible, are available for transmitting and re- 
ceiving. These structures are selected by setting the 
S/F bit in the Transmit Command and/or the Re- 
ceive Frame Descriptor (see Figures 29, 30, 41, and 
42). It is recommended that any linked list of buffers 
be relegated to a single type--either 
simplified or 
flexible. The Simplified memory structure offers a 
simple structure for ease of programming (see Fig- 
ure 8). All information about a frame is contained in 
one structure; for example, during reception the RFD 
and data field are contained in one structure. 


The Flexible memory structure (see Figure 9) has a 
control field that allows the programmer to specify 
the amount of receive data the RFD will contain for 
receive operations and the amount of transmit data 
the Transmit Command Block will contain for trans- 
mit operations. For example, when the control field 
in the RFD is set to 20 bytes during a reception, the 
first 20 bytes of the data field are stored in the RFD 
(6 bytes of destination address, 6 bytes of source 
address, 2 bytes of length field, and 6 bytes of data) 
and the remainder of the data field is stored in the 
Receive Data Buffers. This is useful for capturing 
frame headers when header information is con- 
tained in the data field. The header information can 
then be automatically stored in the RFD partitioned 
from the Receive Data Buffer. 


The control field can also be used for the Transmit 
Command when the- Flexible memory structure is 
used. The quantity of data field bytes to be transmit- 
ted from the Transmit Command Block is specified 
by the variable control field. 
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Figure 8. Simplified 
Memory 
Structure 
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Figure 9. Flexible 
Memory 
Structure 
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TRANSMITTING 
FRAMES 


The 
82596 
executes 
high-level 
Action 
Commands 
from 
the Command 
List in system 
memory. 
Action 
Commands 
are fetched 
and executed 
in parallel with 
the host CPU operation, 
thereby 
significantly 
improv- 
ing system 
performance. 
The format 
of the Action 
Commands 
is shown 
in Figure 
10. Figure 28 shows 
the 82586 
mode, 
and Figures 
29 and 30 show the 
command 
formats 
of the Linear and 32-bit Segment- 
ed modes. 


A single Transmit 
command 
contains, 
as part of the 
command-specific 
parameters, 
the 
destination 
ad- 
dress and length field of the transmitted 
frame and a 
pointer to buffer area in memory containing 
the data 
portion 
of the frame. The data field is contained 
in a 
memory 
data 
structure 
consisting 
of a buffer 
de- 
scriptor 
(BD) and a data buffer-or 
a linked 
list of 
buffer 
descriptors 
and buffers-as 
shown 
in Figure 
11. 


Multiple 
data buffers 
can be chained 
together 
using 
the BOs. Thus, a frame with a long data field can be 
transmitted 
using 
several 
(shorter) 
data 
buffers 
chained 
together. 
This chaining technique 
allows the 
system designer 
to develop 
efficient 
buffer manage- 
ment. 


The 
82596 
automatically 
generates 
the 
preamble 
(alternating 
1s and 
Os) and 
start 
frame 
delimiter, 
fetches 
the destination 
address and length field from 
the Transmit 
command, 
inserts 
its unique 
address 
as the source 
address, 
fetches 
the data field speci- 
fied by the Transmit 
command, 
and cpmputes 
and 
appends 
the CRC to the end of the frame 
(see Fig- 
ure 12). In the Linear and 32-bit 
Segmented 
mode 
the CRC can be optionally 
inserted 
on a frame-by- 
frame 
basis 
by setting 
the 
NC bit in the Transmit 
Command 
Block (see Figures 29 and 30). 


The 82596 can be configured 
to generate 
two types 
of start 
and 
end 
frame 
delimiters-End 
of Carrier 
(EOC) or HOLC. In EOC mode the start frame delimi- 
ter is 10101011 
and the end frame 
delimiter 
is indi- 
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cated by the lack of a signal after the last bit of the 
frame check sequence 
field has been transmitted. 
In 
EOC mode the 82596 
can be configured 
to extend 
short frames by adding pad bytes (7Eh) during trans- 
mission, according 
to the length field. In HOLC mode 
the 82596 
will generate 
the 01111110 
flag for the 
start and end frame 
delimiters, 
and do standard 
bit 
stuffing 
and stripping. 
Furthermore, 
the 82596 
can 
be configured 
to pad frames 
shorter 
than the speci- 
fied minimum 
frame 
length by appending 
the appro- 
priate number of flags to the end of the frame. 


When 
a collision 
occurs, 
the 
82596 
manages 
the 
jam, random 
wait, and retry processes, 
reinitializing 
OMA 
pointers 
without 
CPU 
intervention. 
Multiple 
frames 
can be sent by linking the appropriate 
num- 
ber of Transmit 
commands 
together. 
This is particu- 
larly useful when transmitting 
a message 
larger than 
the maximum 
frame size (1518 bytes for Ethernet). 
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Figure 10. Action Command 
Format 
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Figure 11. Data Buffer Descrlptor 
and 
Data Buffer Structure 
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RECEIVING 
FRAMES 


To reduce CPU overhead, the 82596 is designed to 
receive frames without CPU supervision. The host 
CPU first sets aside an adequate receive buffer 
space and then enables the 82596 Receive Unit. 
Once enabled, the RU watches for arriving frames 
and automatically stores them in the Receive Frame 
Area (RFA). The RFA contains Receive Frame De- 
scriptors, Receive Buffer Descriptors, and Data Buff- 
ers (see Figure 13). The individual Receive Frame 
Descriptors make up a Receive Descriptor List 
(RDL) used by the 82596 to store the destination 
and source addresses, the length field, and the 
status of each frame received (see Figure 14). 


Once enabled, the 82596 checks each passing 
frame for an address match. The 82596 will recog- 
nize its own unique address, one or more multicast 
addresses, or the broadcast address. If a match is 
found the 82596 stores the destination and source 
addresses and the length field in the next available 
RFD. It then begins filling the next available Data 
Buffer on the FBL, which is pointed to by the current 
RFD, with the data portion of the incoming frame. As 
one Data Buffer is filled, the 82596 automatically 
fetches the next DB on the FBL until the entire frame 
is received. This buffer chaining technique is particu- 
larly memory efficient because it allows the system 
designer to set aside buffers to fit frames much 
shorter than the maximum allowable frame length. If 
AL-LOC = 1, or if the flexible memory structure is 
used, the addresses and length field can be placed 
in the Receive Buffer. 


Once the entire frame is received without error, the 
82596 does the following housekeeping tasks. 
• The actual count field of the last Buffer Descrip- 
tor used to hold the frame just received is updat- 
ed with the number of bytes stored in the associ- 
ated Data Buffer. 


• The next available Receive Frame Descriptor is 
fetched. 
• The address of the next available Buffer Descrip- 
tor is written to the next available Receive Frame 
Descriptor. 
• A frame received interrupt status bit is posted in 
the SCB. 


• An interrupt is sent to the CPU. 


If a frame error occurs, for example a CRC error, the 
82596 automatically reinitializes its DMA pointers 
and reclaims any data buffers containing the bad 


4-18 


frame. The 82596 will continue to receive frames 
without CPU help as long as Receive Frame De- 
scriptors and Data Buffers are available. 


82596 NETWORK 
MANAGEMENT 
AND DIAGNOSTICS 


The behavior of data communication networks is 
normally very complex because of their distributed 
and asynchronous nature. It is particularly difficult to 
pinpoint a failure when it occurs. The 82596 has ex- 
tensive diagnostic and network management func- 
tions that help improve reliability and testability. The 
82596 reports on the following events after each 
frame is transmitted. 
• Transmission successful. 
• Transmission unsuccessful. Lost Carrier Sense. 
• Transmission unsuccessful. Lost Clear to Send. 
• Transmission unsuccessful. A DMA underrun oc- 
curred because the system bus did not keep up 
with the transmission. 


• Transmission unsuccessful. The number of colli- 
sions exceeded the maximum allowed. 
• Number of Collisions. The number of collisions 
experienced during transmission of the frame. 


• Heartbeat Indicator. This indicates the presence 
of a heartbeat during the last Interframe Spacing 
(IFS) after transmission. 


When configured to Save Bad Frames the 82596 
checks each.incoming frame and reports the follow- 
ing errors. 
• CRC error. Incorrect CRC in a properly aligned 
frame. 
• Alignment error. Incorrect CRC in a misaligned 
frame. 
• Frame too short. The frame is shorter than the 
value configured for minimum frame length. 


• Overrun. Part of the frame was not placed in 
memory because the system bus did not keep up 
with incoming data. 


• Out of buffer. Part of the frame was discarded 
because of insufficient memory storage space. 
• Receive collision. A collision was detected during 
reception and the destination address of the in- 
coming frame matches the 82596 individual ad- 
dress. Collisions in the preamble are not counted. 
• Length error. A frame not matching the frame 
length parameter was detected. 
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Figure 13. Receive 
Frame Area Diagram 
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Figure 14. Receive 
Frame Descrlptor 
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NETWORK 
PLANNING 
AND 
MAINTENANCE 


To properly plan, operate, and maintain a communi- 
cation network, the network management entity 
must accumulate information on network behavior. 
The 82596 provides a rich set of network-wide diag- 
nostics that can serve as the basis for a network 
management entity. 


Information on network activity is provided in the 
status of each frame transmitted. The 82596 reports 
the following activity indicators after each frame. 
• Number of collisions. The number of collisions 
the 82596 experienced while attempting to trans- 
mit the frame. 
• Deferred transmission. During the first transmis- 
sion attempt the 82596 had to defer to traffic on 
the link. 


The 82596 updates its 32-bit statistical counters af- 
ter each received frame that both passes address 
filtering and is longer than the Minimum Frame 
Length configuration parameter. The 82596 reports 
the following statistics. 
• CRC errors. The number of well-aligned frames 
that experienced a CRC error. 


• Alignment 
errors. The 
number of 
misaligned 
frames that experienced a CRC error. 


• No resources. The number of frames that were 
discarded because of insufficient resources for 
reception. 


• Overrun errors. The number of frames that were 
not completely stored in memory because the 
system bus did not keep up with incoming data. 


• Receive Collision counter. The number of colli- 
sions detected during receive. Collisions occur- 
ring before the minimum frame length will be 
counted as short frames. Collisions in the pream- 
ble will not be counted at all. 


• Short Frame counter. The number of frames that 
were discarded because they were shorter than 
the configured minimum frame length. 


Once again, these counters are not updated until the 
82596 decodes a destination address match. 


The 82596 can be configured to Promiscuous mode. 
In this mode it captures all frames transmitted on the 
network without checking the Destination Address. 
This is useful when implementing a monitoring sta- 
tion to capture all frames for analysis. 


A useful method of capturing frame headers is to 
use the Simplified memory mode, configure the 
82596 to Save Bad Frames, and configure the 
82596 to Promiscuous mode with space in the RFD 
allocated for specific number of receive data bytes. 
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The 82596 will receive all frames and put them in the 
RFD. Frames that exceed the available space in the 
RFD will be truncated, the status will be updated, 
and the 82596 will retrieve the next RFD. This allows 
the user to capture the initial data bytes of each 
frame (for instance, the header) and discard the re- 
mainder of the frame. 


The 82596 also has a monitor mode for network 
analysis. During normal operation the receive func- 
tion enables the 82596 to receive frames that pass 
address filtering. These frames must have the Start 
of Frame Delimiter (SFD) field and must be longer 
than the absolute minimum frame length of 5 bytes 
(6 bytes in case of Multicast address filtering). Con- 
tents and status of the received frames are trans- 
ferred to memory. The monitor function enables the 
82596 to simply evaluate the incoming frames. The 
82596 can monitor the frames that pass or do not 
pass the address filtering. It can also monitor frames 
which do not have the SFD fields. The 82596 can be 
configured to only keep statistical information about 
monitor frames. Three options are available in the 
Monitor mode. These options are selected by the 
two monitor mode configuration bits available in the 
configuration command. 


When the first option is selected, the 82596 receives 
good frames that pass address filtering and trans- 
fers them to memory while monitoring frames that 
do not pass address filtering or are shorter than the 
minimum frame size (these frames are not trans- 
ferred to memory). When this option is used the 
82596 updates six counters: CRC errors, alignment 
errors, no resource errors, overrun errors, short 
frames and total good frames received. 


When the second option is selected, the receive 
function is completely disabled. The 82596 monitors 
only those frames that pass address filterings and 
meet the minimum frame length requirement. When 
this option is used the 82596 updates six counters: 
CRC errors, alignment errors, total frames (good and 
bad), short frames, collisions detected and total 
good frames. 


When the third option is selected, the receive func- . 
tion is completely disabled. The 82596 monitors all 
frames, including frames that do not have a Start 
Frame Delimiter. When this option is used the 82596 
updates six counters: CRC errors, alignment errors, 
total frames (good and bad), short frames, collisions 
detected and total good frames. 
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STATION 
DIAGNOSTICS 
AND SELF-TEST 


The 82596 provides a large set of diagnostic and 
network management functions. These include inter- 
nal and external loopback and time domain reflec- 
tometry for locating fault points in the network cable. 
The 82596 ensures software reliability by dumping 
the contents of the 82596 internal registers into sys- 
tem memory. The 82596 has a self-test mode that 
enables it to run an internal self-test and place the 
results in system memory. 


82586 SOFTWARE 
COMPATIBILITY 


The 82596 has a software-compatible state in which 
all its memory structures are compatible with the 
82586 memory structure. This includes all the Action 
Commands, the Receive Frame Area (including the 
RFD, Buffer Descriptors, and Data Buffers), the Sys- 
tem Control Block, and the initialization procedures. 
There are two minor differences between the 82596 
in the 82586-Compatible memory structure and the 
82586. 
• When the internal and external loopback bits in 
the Configure command are set to 11 the 82596 
is in external loopback and the LPBK pin is acti- 
vated; in the 82586 this situation would produce 
internal loopback. 


• During a Dump command both the 82596 and 
82586 dump the same number of bytes; however, 
the data format is different. 
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INITIALIZING 
THE 82596 


A Reset command is issued to the 82596 to prepare 
it for normal operation. The 82596 is initialized 
through two data structures that are addressed by 
two 
pointers, the 
System Configuration 
Pointer 


(SCP) and the Intermediate System Configuration 
Pointer (ISCP). The initialization procedure begins 
when a Channel Attention signal is asserted after 
RESET. The 82596 uses the address of the double 
word 
that 
contains 
the 
SCP 
as 
a 
default- 


00FFFFF4h. Before the CA signal is asserted this 
default address can be changed to any other avail- 
able address by asserting the PORT pin and provid- 
ing the desired address over the 031-04 pins of the 
address bus. Pins 03-00 must be 0010; i.e., any 
alternative address must be aligned to 
16-byte 


boundaries. All addresses sent to the 82596 must be 
word aligned, which means that all pointers and 
memory structures must start on an even address 
(Ao=zero). 


SYSTEM 
CONFIGURATION 
POINTER 
(SCP) 


The SCP contains the sysbus byte and the location 4 
of the next structure of the initialization process, the 
ISCP. The following parameters are selected in the 
SYSBUS. 


• The 82596 operation mode. 
• The Bus Throttle timer triggering method. 


• Lock enabled. 
• Interrupt polarity. 
• Big Endian 32-bit entity mode. 


Byt~rdering 
is determined by the LE/BE 
pin. 


LE/BE = 1 selects Little Endian byte ordering and 
LE/BE = 0 selects Big Endian byte ordering. 


NOTE: 
In the following, X indicates a bit not checked 
82586 mode. This bit must be set to 0 in all other 
modes. 
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The following 
diagram 
illustrates 
the format 
of the SCP. 
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0 o 
OFFFFF4h 


X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X xix 
X 
X 
X 
X 
X 
X xix 
X 
X 
X 
X 
X 
X 
X 
OFFFFF8h 


A31 ................ 
A24 
A23 
ISCP ADDRESS 
AO OFFFFFCh 


A31 .....•.......... 
A24 
are not checked 
in 82586 mode 
X ........•........... 
X areas are not checked 
in 82586 mode; they must be 0 in all other modes. 


23 
16 


SYS8US 
I BE I 
1 
liNT ILOCK ITRG I 1011 I woI x Il 


W 
L:NOT CHECKED 


Loo 
: 82586 
modo 
o 1 : 32-8it 
Segmonted modo 
1 0 : Linear 
mode 
1 1 : R.s.rv~ 


o : lnternol 
triggering 
of the 
Bus Throttle timers 
1 : ext.rnal triggering of the 
Bus Throttle timers 


0- 
Th. 32-blt 
address 
pointers 
in linear 
mode are treated ~ 
08 two 16-bit big endlcn entities. 
This is identical to 
tho 82596 
Al 
.topping 
doflnitlon. 
1 - 
The 32-blt 
address pointers 
in linear 
mod. or. 
treated 
as 32-bit 
big endian 
entiti... 
This mode i. only 
supported 
In tho 82596 
B stopping. 
In thl. 
modo tho SCB ab.oluto 
address and statistical 
counters are still treated 
08 two 
16-blt 
big endien entitles. 


Interrupt 
polarity 
o - Interrupt 
pin is active 
high 
1 - 
Interrupt 
pin is active 


low 
L.....--- 
0 : Lock function 
enabled 
1 : Lock function 
disabled 


ISCP ADDRESS- 
The physical 
address 
of the ISCP. In the 82586 mode, bits A31-A24 
are considered 
to 
be zero. 


Figure 15. The System Configuration 
Pointer 


Writing the Sysbus 


When writing 
the sysbus byte it is important 
to pay attention 
to the byte order. 


• 
When a Little Endian processor 
is used, the sysbus byte is located at byte address OOFFFFF6h (or address 
n+ 2 if an alternative 
SCP address n was programmed). 


• 
When a processor 
using Big Endian byte ordering 
is used, the sysbus, alternative 
SCP, and ISCP addresses 
will be different. 


• 
The sysbus byte is located 
at OOFFFFF5h . 


• 
If an alternative 
SCP address 
is programmed, 
the sysbus byte should 
be at byte address n+ 1. 
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INTERMEDIATE 
SYSTEM 
CONFIGURATION 
POINTER 
(ISCP) 


The ISCP indicates 
the location 
of the System Control 
Block. Often the SCP is in ROM and the ISCP is in RAM. 
The CPU loads the SCB address 
(or an equivalent 
data structure) 
into the ISCP and asserts CA. This Channel 
Attention 
signal causes 
the 82596 
to begin its initialization 
procedure 
and to get the SCB address 
from the 
ISCP and SCP. In 82586 and 32-bit Segmented 
modes the SCP base address 
is also the base address 
of all 
Command 
Blocks, 
Frame Descriptors, 
and Buffer Descriptors 
(but not buffers). 
All these data structures 
must 
reside in one 64-KB segment; 
however, 
in Linear mode no such limitation 
is imposed. 


The following 
diagram 
illustrates 
the ISCP format. 


ODD WORD 
EVENWORD 
8 7 
o 
16 15 
31 
SCBOFFSET 
A23 
A15 
AO 
BUSY 
SCB BASE ADDRESS 
r 
X X X X X X X X -in 
82586 mode 
A31 ... ,' , , , , , , , , , , ,A24 - 
in 32·bit segmented mode, 


BUSY 
- 
Indicates 
that the 82596 is being initialized, 
The CPU sets the ISCP to 01 h before it gives 
the first CA to the 82596, The ISCP is cleared 
by the 82596 after the SCB base and offset 
are read. Note that the most significant 
byte of the first word of the ISCP is not modified 
when BUSY is cleared, 


SCB OFFSET-This 
16-bit quantity 
specifies 
the offset 
portion 
of the address 
of the SCB, 


SCB BASE 
- 
Specifies 
the base portion of the address of the SCB, The base of SCB is also the base of 
all 82596 
Command 
Blocks, 
Frame 
Descriptors 
and, Buffer 
Descriptors. 
In the 
82586 
mode, bits A31-A24 
are considered 
to be zero, 


Figure 16.The Intermediate System Configuration 
Polnter-82586 
and 32-Blt Segmented Modes 


EVENWORD 
31 
16 15 
8 7 
0 
ODD WORD 


1~~=3=1 
O~=O~==,'='=''='=''='='=''='=''='=''='=''='='=''='='~=~=~=~='~=~=~='~=~T=' 
~='=~'=~=~'=~=~ss=' 
'='=',=~o=~o=~o~I~=====~B~U~S~Y=====A=O 
I::~: + 4 


BUSY 
- 
Indicates 
that the 82596 is being initialized. 
The ISCP is set to 01 h by the CPU before its 
first CA to the 82596, 
It is cleared 
by the 82596 after the SCB address 
is read, 


SCB ADDRESS- 
This 32-bit quantity 
specifies 
the physical 
address 
of the SCB, 


Figure 17.The Intermediate System Configuration 
Pointer-Linear 
Mode. 


INITIALIZATION 
PROCESS 


The CPU sets up the SCP, ISCP, and the SCB structures, 
and, if desired, 
an alternative 
SCP address, 
It also 
sets BUSY to 01 h. The 82596 is initialized when a Channel Attention 
signal follows 
a,Reset signal, causing the 
82596 
to access 
the System 
Configuration 
Pointer, 
The sysbus ~ 
the operational 
mode, the bus throttle 
timer triggering 
method, 
the interrupt 
polarity, 
and the state of LOCK are read, After 
reset the Bus Throttle 
timers are essentially 
disabled-the 
T-ON value is infinite, the T-OFF value is zero, After the SCP is read, the 
82596 
reads the ISCP and saves the SCB address. 
In 82586 
and 32-bit Segmented 
modes 
this address 
is 
represented 
as a base address 
plus the offset 
(this base address 
is also the base address 
of all the control 
blocks), 
In Linear mode the base address 
is also an absolute 
address, 
The 82596 clears BUSY, sets CX and 
CNR to equal 1 in the SCB, clears the SCB command 
word, sends an interrupt to the CPU, and awaits another 
Channel 
Attention 
signal. RESET configures 
the 82596 to its default 
state before CA is asserted, 
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CONTROLLING 
THE 82596CA 


The host CPU controls 
the 82596 with the commands, 
data structures, 
and methods 
described 
in this section. 


The CPU and the 82596 communicate 
through 
shared 
memory 
structures. 
The 82596 contains 
two indepen- 
dent units: the Command 
Unit and the Receive 
Unit. The Command 
Unit executes 
commands 
from the CPU, 


and the Receive 
Unit handles 
frame 
reception. 
These 
two units are controlled 
and monitored 
by the CPU 
through 
a shared 
memory 
structure 
called the System 
Control 
Block (SCB). The CPU and the 82596 
use the 
CA and INT signals to communicate 
with the SCB. 


82596 CPU ACCESS 
INTERFACE 
(PORT) 


The 82596 
has a CPU access 
interface 
that allows the host CPU to do four things. 


• 
Write an alternative 
System 
Configuration 
Pointer address. 


• 
Write an alternative 
Dump area pointer 
and perform 
Dump. 


• 
Execute 
a software 
reset. 


• 
Execute 
a self-test. 


The following 
events 
initiate the CPU access 
state. 


• 
Presence 
of an address 
on the 031-04 
data bus pins. 


• 
The 03-00 
pins are used to select 
one of the four functions. 


• 
The PORT input pin is asserted, 
as in a regular write cycle. 


NOTE. 
The SCP Dump and Self-Test 
addresses 
must be 16-byte aligned. 


The 82596 requires two 16-bit write cycles for a port command. 
The first write holds the internal 
machines 
and 
reads the first 16 bits; the second 
activates 
the PORT command 
and reads the second 
16 bits. 


The PORT Reset is useful when only the 82596 needs to be reset. The CPU must wait for 10-system 
and 5-se- 
rial clocks 
before 
issuing another 
CA to the 82596; this new CA begins a new initialization 
process. 


The Damp function 
is useful for troubleshooting 
No Response 
problems. 
If the chip is in a No Response 
state, 
the P 
RT Dump operation 
can be executed 
and a PORT Reset can be used to reinitialize 
the 82596 without 


disturbing 
the rest of the system. 


The Self-Test 
function 
can be used for board testing; the 82596 will execute 
a self-test 
and write the results to 


memory. 


Table 2. PORT Function 
Selection 


031 .................................. 
04 ............................. 
00 


Function 
Addresses 
and Results 
03 
02 
01 
DO 


Reset 
A31 
Don't Care 
A4 
0 
0 
0 
0 


Self-Test 
A31 
Self-Test 
Results Address 
A4 
0 
0 
0 
1 


SCP 
A31 
Alternative 
SCP Address 
A4 
0 
0 
1 
0 


Dump 
A31 
Dump Area Pointer 
A4 
0 
0 
1 
1 


MEMORY 
ADDRESSING 
FORMATS 


The 82596 
accesses 
memory 
by 32-bit addresses. 
There 
are two types of 32-bit addresses: 
linear and seg- 
mented. 
The type of address 
used depends 
on the 82596 operating 
mode and the type of memory 
structure 
it 
is addressing. 
The 82596 
has three operating 
modes. 
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• 
82586 
Mode 


• 
A Linear address 
is a single 24-bit entity. Address 
pins A31-A24 
are always zero. 


• 
A Segmented 
address 
uses a 24-bit base and a 16-bit offset. 


• 
32-bit Segmented 
Mode 


• 
A Linear address 
is a single 32-bit entity. 


• 
A Segmented 
address 
uses a 32-bit base and a 16-bit offset. 


NOTE: 
In the previous 
two 
memory 
addressing 
modes, 
each 
command 
header 
(CB, TBO, 
AFO, 
ABO, and SCB) 
must wholly 
reside within one segment. 
If the 82596 encounters 
a memory 
structure 
that does not follow 
this 
restriction, 
the 82596 will fetch the next contiguous 
location 
in memory 
(beyond 
the segment). 


• 
Linear Mode 


• 
A Linear address 
is a single 32-bit entity. 


• 
There are no Segmented 
addresses. 


Linear addresses 
are primarily 
used to address 
transmit 
and receive 
data buffers. 
In the 82586 
and 32-bit 
Segmented 
modes, segmented 
addresses 
(base plus offset) are used for all Command 
Blocks, Buffer Oescrip- 
tors, 
Frame 
Oescriptors, 
and 
System 
Control 
Blocks. 
When 
using 
Segmented 
addresses, 
only 
the 
offset 
portion 
of the entity being addressed 
is specified 
in the block. The base for all offsets 
is the same-that 
of the 
SCB. See Table 
1. 


LITTLE 
ENDIAN 
AND BIG ENDIAN 
BYTE ORDERING 


The 82596 supports 
both Little Endian and Big Endian byte ordering 
for its memory 
structures. 


The 82596 A 1 stepping 
supports 
Big Endian byte ordering 
for word and byte entities. 
Dword entities 
are not 
supported 
with 82596 
A 1 Big Endian 
byte ordering. 
This results 
in slightly 
different 
82596A 1 memory 
struc- 
tures for Big Endian operation. 
These structures 
are defined 
in the 32-8it 
LAN Components 
User's 
Manual. •• 


The 82596 
B stepping 
supports 
Big Endian 
byte ordering 
for Linear mode only. All 82596 
B 32-bit address 
pointers 
are treated 
as 32-bit Big Endian entities, 
however, 
the SCB absolute 
address 
and statistical 
counters 
are treated 
as two 16-bit Big Endian entities. This 32-bit Big Endian entity support 
is configured 
through 
bit 7 in 
the SYSBUS 
byte. 


The 82596 C-step 
has a New Enhanced 
Big Endian Mode where 
in Linear Addressing 
mode, true 32-bit Big 
Endian functionality 
is achieved. 
New Enhanced 
Big Endian Mode is enabled 
exactly the same as the B-step, 
by setting bit 7 of the SYSBUS 
byte. This mode is software 
compatible 
with the big endian mode of the B-step 
with one exception-no 
32-bit addresses 
need to be swapped 
by software 
in the C-step. In this new mode, the 
82596 C-step treats 32-bit address 
pointers as true 32-bit entities and the SCB absolute 
address and statistical 
counters 
are still treated as two 16-bit big endian entities. 
Not setting this mode will configure 
the 82596 C-step 
to be 100% 
compatible 
to the A1-step 
big endian 
mode. 


NOTE: 
All 82596 
memory 
entities 
must be word or dword 
aligned, 
except 
the transmit 
buffers 
can be byte aligned 
for the 82596 
B or C-steppings. 


An example 
of a dword entity is a frame descriptor 
command/status 
dword, whereas 
the raw data of the frame 
are byte entities. 
Both 32- and 16-bit buses are supported. 
When a 16-bit bus is used with Big Endian memory 
organization, 
data lines 
015-00 
are used. The 82596 
has an internal 
crossover 
that 
handles 
these 
swap 
operations. 
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COMMAND 
UNIT (CU) 


The Command 
Unit is the logical unit that executes 
Action Commands 
from a list of commands 
very similar to 
a CPU program. 
A Command 
Block is associated 
with each Action Command. 
The CU is modeled 
as a logical 
machine 
that takes, at any given time, one of the following 
states. 


• 
Idle. The CU is not executing 
a command 
and is not associated 
with a CB on the list. This is the initial state. 


• 
Suspended. 
The CU is not executing 
a command; 
however, 
it is associated 
with a CB on the list. 


• 
Active. 
The 'CU is executing 
an Action 
Command 
and pointing 
to its CB. 


The CPU can affect 
CU operation 
in two ways: 
by issuing 
a CU Control 
Command 
or by setting 
bits in the 
Command 
word of the Action 
Command. 


When 
programming 
the 82596 
CU, it is important 
to consider 
the asynchronous 
way the 82596 
processes 
commands. 
If a command 
is issued to the 82596 CU, it may be busy processing 
other commands. 
In order to 
avoid asynchronous 
race conditions, 
the following 
guidelines 
are recommended 
to the 82596 
programmer: 


• 
If the CU is already 
in the Active 
state, 
and another 
command 
needs 
to be executed, 
it is unwise 
to 
immediately 
issue another 
CU Start 
command. 
If a new command 
(or list of commands) 
needs 
to be 
started, 
first issue a CU Suspend 
command, 
wait for the CU to become 
Suspended, 
then issue the new CU 
Start. This will insure that all commands 
are processed 
correctly. 


• 
In general, 
it is a good 
idea to make 
sure any CU command 
has been accepted 
and executed 
before 
issuing a new control 
command 
to the CU. 
I 


RECEIVE 
UNIT (RU) 


The Receive 
Unit is the logical unit that receives 
frames and stores them in memory. The RU is modeled 
as a 
logical 
machine 
that takes, at any given time, one of the following 
states. 


• 
Idle. The RU has no memory 
resources 
and is discarding 
incoming 
frames. 
This is the initial state. 
• No Resources. 
The RU has no memory 
resources 
and is discarding 
incoming 
frames. 
This state differs 
from Idle in that the RU accumulates 
statistics 
on the number of discarded 
frames. 


• 
Suspended. 
The RU has memory 
available 
for storing 
frames, 
but is discarding 
them. The suspend 
state 
can only be reached 
if the CPU forces 
this through 
the SCB or sets the suspend 
bit in the RFD. 


• 
Ready. 
The RU has memory 
available 
and is storing 
incoming 
frames. 


The CPU can affect 
RU operation 
in three ways: by issuing an RU Control 
Command, 
by setting 
bits in the 
Frame Descriptor 
Command 
word of the frame being received, 
or by setting the EL bit of the current 
buffer's 
Buffer Descriptor. 


When 
programming 
the 82596 
RU, it is important 
to consider 
the asynchronous 
way the 82596 
processes 
receive frames. 
If an RU Start is issued to the 82596 RU, it may be busy processing 
other incoming 
packets. 
In 
order to avoid asynchronous 
race conditions, 
the following 
guidelines 
are recommended 
to the 82596 
pro- 
grammer: 


• 
If the RU is already in the Ready state, and a new RFA is required to be started, 
it is unwise to immediately 
issue another 
RU Start command. 
If the new RFA needs to be started, 
first issue an RU Suspend 
com- 
mand, wait for the RU to become Suspended, 
then issue the new RU Start. This will insure that all incoming 
frames 
are received 
correctly. 


• 
In general, 
it is' a good 
idea to make 
sure any RU command 
has been 
accepted 
and executed 
before 
issuing a new control 
command 
to the RU. 
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SYSTEM, CONTROL 
BLOCK (SCB) 


The SCB is a memory 
block that plays a major role in communications 
between 
the CPU and the 82596. Such 
communications 
include 
the following. 


• 
Commands 
issued by the CPU 


• 
Status reported 
by the 82596 


Control 
commands 
are sent to the 82596 
by writing 
them 
into the SCB and then asserting 
CA. The 82596 
examines 
the command, 
performs 
the 
required 
action, 
and then 
clears 
the SCB command 
word. 
Control 
commands 
perform 
the following 
types of tasks. 


• 
Operation 
of the Command 
Unit (CU). The SCB controls 
the CU by specifying 
the address 
of the Command 
Block List (CBl) 
and by starting, 
suspending, 
resuming, 
or aborting 
execution 
of CBl 
commands. 


• 
Operation 
of the Bus Throttle. 
The SCB controls 
the Bus Throttle 
timers by providing 
them with new values 
and sending the load 
and Start timer commands. 
The timers can be operated 
in both the 32-bit Segmented 
and Linear modes, 


• 
Reception 
of frames 
by the Receive 
Unit (RU). The SCB controls 
the RU by specifying 
the address 
of the 
Receive 
Frame Area and by starting, 
suspending, 
resuming, 
or aborting 
frame reception. 


• 
Acknowledgment 
of events 
that cause interrupts. 


• 
Resetting 
the chip. 


The 82596 
sends 
status 
reports 
to the CPU via the System 
Control 
Block. The SCB contains 
four types of 
status 
reports. 


• 
The cause 
of the current 
interrupts. 
These 
interrupts 
are caused 
by one or more of the following 
82596 


events. 
11 
• 
The Command 
Unit completes 
an Action 
Command 
that has its I bit set. 
~ 


• 
The Receive 
Unit receives 
a frame. 


• 
The Command 
Unit becomes 
inactive. 


• 
The Receive 
Unit becomes 
not ready. 


• 
The status of the Command 
Unit. 


• 
The status of the Receive 
Unit. 


• 
Status reports 
from the 82596 
regarding 
reception 
of corrupted 
frames. 


I 
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Events can be cleared only by CPU acknowledgment. If some events are not acknowledged by the ACK field 
the Interrupt signal (INT) will be reissued after Channel Attention (CA) is processed. Furthermore, if a new 
event occurs while an interrupt is set, the interrupt is temporarily cleared to trigger edge-triggered interrupt 
controllers. 


The CPU uses the Channel Attention line to cause the 82596 to examine the SCB. This signal is trailing-edge 
triggered-the 
82596 latches CA on the trailing edge. The latch is cleared by the 82596 before the SCB 
control command is read. 


ODD WORD 
EVEN WORD 
31 
16 
15 
o 


SCB 


SCB + 4 


SCB + 8 


SCB + 12 


ACK 
Ixl 
CUC 
IRI 
RUC 
Ix 
x 
X 
X 
STAT 
101 
CUS 
I 0 I 
RUS 
I 0 
0 0 
0 


RFAOFFSET 
CBLOFFSET 


ALIGNMENT 
ERRORS 
CRCERRORS 


OVERRUN 
ERRORS 
RESOURCE 
ERRORS 


Figure 18. SCB-82586 
Mode 


°In rnorutor 
mode these 
counters 
change 
function 


ODD WORD 
EVEN WORD 
31 
16 
15 
o 


SCB 


SCB + 4 


SCB + 8 


SCB + 12 


SCB + 16 


SCB + 20 


SCB + 24 


SCB + 28 


SCB + 32 


ACK 
101 
CUC 
1R 1 
RUC 
10 
o 0 
01 
STAT 
101 
CUS 
I 
RUS 
ITlo 
0 
0 


RFAOFFSET 
1 
CBLOFFSET 


CRCERRORS 


ALIGNMENT 
ERRORS 


RESOURCE 
ERRORS 
(0) 


OVERRUN 
ERRORS 
(0) 


RCVCDT 
ERRORS 
(0) 


SHORT 
FRAME 
ERRORS 


T-ONTIMER 
1 
T-OFFTIMER 


Figure 19. SCB-32-Blt 
Segmented Mode 


31 
ODD WORD 
EVEN WORD 
16 
15 
o 


SCB 


SCB + 4 


SCB + 8 


SCB + 12 


SCB + 16 


SCB + 20 


SCB + 24 


SCB + 28 


SCB + 32 


SCB + 36 


ACK 
101 
CUC 
I RI 
RUC 
10.0 
0 0 I 
STAT 
101 
CUS 
I 
RUS 
ITlo 
0 0 


COMMAND 
BLOCK ADDRESS 


RECEIVE 
FRAME AREA ADDRESS 


CRCERRORS 


ALIGNMENT 
ERRORS 


RESOURCE 
ERRORS 
(0) 


OVERRUN 
ERRORS 
(0) 


RCVCDT 
ERRORS 
(0) 


SHORT 
FRAME 
ERRORS 


T-ONTIMER 
1 
T-OFFTIMER 


°In MONITOR 
mode these 
counters 
change 
function 


Figure 20. SCB-L1near Mode 
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Command 
Word 
31 


82596CA 


16 


'-----'--_A--'?'-K----J'-----'QJ __ 
:'---C_UC----'--:-----l0__ :'---R_U_C 
L: -----l~ 
SCB 
+ 2 


These bits specify the action to be performed 
as a result of a CA. This word is set by the CPU and cleared 
by 


the 82596. Defined 
bits are: 


Bit 31 ACK-CX 


Bit 30 ACK-FR 


Bit 29 ACK-CNA 


Bit 28 ACK-RNR 


Bits 24-26 CUC 


Bit 23 RESET 


Bits 20-22 RUC 


I 


- 
Acknowledges 
that the CU completed 
an Action 
Command. 


- 
Acknowledges 
that the RU received 
a frame. 


- 
Acknowledges 
that the Command 
Unit became 
not active. 


- 
Acknowledges 
that the Receive 
Unit became 
not ready. 


- 
(3 bits) This field contains 
the command 
to the Command 
Unit. Valid values 
are: 
o 
- 
NOP (does not affect 
current 
state of the unit). 


1 
- 
Start execution 
of the first command 
on the CBL. If a command 
is executing, 


complete 
it before starting 
the new CBL. The beginning 
of the CBl 
is in CBl 


OFFSET 
(address). 


2 
- 
Resume the operation 
of the Command 
Unit by executing 
the next command. 


This 
operation 
assumes 
that 
the Command 
Unit has been 
previously 
sus- 


pended. 


3 
- 
Suspend 
execution 
of commands 
on CBl 
after current command 
is complete. 


4 
- 
Abort current 
command 
immediately. 


5 
- 
loads 
the Bus Throttle 
timers 
so they will be initialized 
with their new values 
after the active timer (T-ON or T-OFF) 
reaches 
Terminal 
Count. If no timer is 
•• 
active 
new values 
will be loaded 
immediately. 
This command 
is not valid in 
~ 
82586 mode. 


6 
- 
loads 
and immediately 
restarts 
the Bus Throttle 
timers with their new values. 


This command 
is not valid in 82586 mode. 


7 
- 
Reserved. 


- 
Reset chip (logically 
the same as hardware 
RESET). 


- 
(3 bits) This field contains 
the command 
to the Receive 
Unit. Valid values 
are: 


o 
- 
NOP (does not alter current 
state of unit). 


1 
- 
Start reception 
of frames. 
The beginning 
of the RFA is contained 
in the RFA 


OFFSET 
(address). 
If a frame 
is being 
received 
complete 
reception 
before 
starting. 


2 
- 
Resume 
frame reception 
(only when in suspended 
state). 


3 
- 
Suspend 
frame reception. 
If a frame is being received 
complete 
its reception 
before 
suspending. 


4 
- 
Abort receiver 
operation 
immediately. 


5-7 - 
Reserved. 
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Status Word 
15 
o 


_----'-_S_T,'_AT_--'-------'~'_ 
_ 
_'_:_GU_S--L:_-'~L- 
_ 
_'_:_RU_S--L: 
__ 
~ 
SGB 


82586 
mode 


15 
o 


__ ~ 
R~y_S 
__ ~_~ 
SGB 
I 
: 
ST,AT: 
I~ 
:GUS: 


32-Bit Segmented 
and Linear mode. 


Indicates 
the status of the 82596. This word is modified 
only by the 82596. 
Defined 
bits are: 


Bit 15 CX 


Bit 14 FR 


Bit 13 CNA 


Bit 12 RNR 


Bits 8-10 
CUS 


Bits 4-7 
RUS 


Bit 3 T 


- 
The CU finished 
executing 
a command 
with its I (interrupt) 
bit set. 


- 
The RU finished 
receiving 
a frame. 


- 
The Command 
Unit left the Active 
state. 


- 
The Receive 
Unit left the Ready state. 


- 
(3 bits) This field contains 
the status of the command 
unit. Valid values are: 
o 
-Idle 


1 
- 
Suspended 


2 
-Active 


3-7 
- 
Not used 


- 
This field contains 
the status of the receive 
unit. Valid values are: 


Oh (0000) 
- 
Idle 


1h (0001) 
- 
Suspended 


2h (0010) 
- 
No Resources. 
This 
bit indicates 
both 
no resources 
due to lack 
of 
RFDs in the RDL and no resources 
due to lack of RBDs in the FBL. 


4h (0100) 
- 
Ready, 


Ah (1010) 
- 
No resources 
due to no more RBDs (not in the 82586 
mode). 


Ch (1100) 
- 
No more RBDs (not in 82586 mode) 


No other combinations 
are allowed 


- 
Bus Throttle 
timers 
loaded 
(not in 82586 
mode). 


SCB OFFSET 
ADDRESSES 


CBl 
Offset (Address) 


In 82586 
and 32-bit Segmented 
modes this 16-bit quantity 
indicates 
the offset 
portion 
of the address 
for the 
first Command 
Block on the CBL. In Linear mode it is a 32-bit linear address 
for the first Command 
Block on 
the CBL. It is accessed 
only if CUC equals Start. 


RFA Offset (Address) 


In 82586 
and 32-bit Segmented 
modes this 16-bit quantity 
indicates 
the offset 
portion 
of the address 
for the 
Receive 
Frame Area. In Linear mode it is a 32-bit linear address 
for the Receive 
Frame Area. It is accessed 
only if RUC equals Start. 
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see STATISTICAL 
COUNTERS 


Statistical Counter Operation 


• The CPU is responsible for clearing all error counters before initializing the 82596. The 82596 updates 
these counters by reading them, adding 1, and then writing them back to the SCB. 
• The counters are wraparound counters. After reaching FFFFFFFFh the counters wrap around to zero. 
• The 82596 updates the required counters for each frame. It is possible for more than one counter to be 


updated; multiple errors will result in all affected counters being updated. 


• The 82596 executes the read-counter/incrementlwrite-counter 
operation without relinquishing the bus 


(locked operation). This is to ensure that no logical contention exists between the 82596 and the CPU due 
to both attempting to write to the counters simultaneouslYClnthe dual-port memory configuration the CPU 
should not execute any write operation to a counter if LO K is asserted. 


• The counters are 32-bits wide and their behavior is fully compatible with the IEEE 802.3 standard. The 


82596 supports all relevant statistics (mandatory, optional, and desired) through the status of the transmit 
and receive header'and directly through SCB statistics. 


CRCERRS 


This 32-bit quantity contains the number of aligned frames discarded because of a CRC error. This counter is 
updated, if needed, regardless of the RU state. 


ALNERRS 


This 32-bit quantity contains the number of frames that both are misaligned (i.e., where CRS deasserts on a 
nonoctet boundary) and contain a CRC error. The counter is updated, if needed, regardless of the RU state. 


SHRTFRM 


This 32-bit quantity contains the number of received frames shorter than the minimum frame length. 


The last three counters Changefunction in monitor mode. 


RSCERRS 


This 32-bit quantity contains the number of good frames discarded because there were no resources to 
contain them. Frames intended for a host whose RU is in the No Receive Resources state, fall into this 
category. This counter is updated only if the RU is in the No Resources state. When in Monitor mode this 
counter counts the total number of frames-good 
and bad. 


I 


•• 
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OVRNERRS 


This 32-bit quantity 
contains 
the number 
of frames 
known 
to be lost because 
the local system 
bus was not 
available. 
If the traffic 
problem 
lasts longer than the duration 
of one frame, the frames 
that follow the first are 
lost without 
an indicator, 
and they are not counted. 
This counter 
is updated, 
if needed, 
regardless 
of the RU 
state. 


This 32-bit counter 
contains 
the number of collisions 
detected 
during frame reception. 
This counter 
will only be 
updated 
if at least 64 bytes of data are received 
before 
the collision 
occurs. 
If a collision 
occurs 
before 
64 
bytes of data are received, 
the frame is counted 
as a short frame. 
If the collision 
occurs 
in the preamble, 
no 
counters 
are incremented. 


ACTION 
COMMANDS 
AND OPERATING 
MODES 


This section 
lists all the Action Commands 
of the Command 
Unit Command 
Block List (CBl). 
Each command 
contains 
the Command 
field, the Status 
and Control 
fields, 
the link to the next Action 
Command, 
and any 
command-specific 
parameters. 
There 
are three 
basic types 
of action 
commands: 
82596 
Configuration 
and 
Setup, Transmission, 
and Diagnostics. 
The following 
is a list of the actual commands. 


• 
NOP 
• 
Transmit 


• 
Individual 
Address 
Setup 
• 
TOR 


• 
Configure 
• 
Dump 


• 
MC Setup 
• 
Diagnose 


The 82596 has three addressing 
modes. 
In the 82586 mode all the Action 
Commands 
look exactly 
like those 
of the 82586. 


• 
82586 
Mode. The 82596 software 
and memory 
structure 
is compatible 
with the 82586. 


• 
32-Blt Segmented 
Mode. The 82596 can access the entire system memory and use the two new memory 
structures-Simplified 
and Flexible-while 
still using the segmented 
approach. 
This does not require any 
significant 
changes 
to existing 
software. 


• 
Linear Mode. The 82596 operates 
in a flat, linear, 4 gigabyte 
memory 
space without 
segmentation. 
It can 
also use the two new memory 
structures. 


In the 32-bit Segmented 
mode there are some differences 
between 
the 82596 and 82586 action commands, 


mainly in programming 
and activating 
new 82596 features. 
Those 
bits marked 
"don't 
care" 
in the compatible 
mode are not checked; 
however, 
we strongly 
recommend 
that those 
bits all be zeroes; 
this will allow future 
enchancements 
and extensions. 


In the Linear 
mode 
all of the address 
offsets 
become 
32-bit 
address 
pointers. 
All new 82596 
features 
are 
accessible 
in this mode, and all bits previously 
marked 
"don't 
care" 
must be zeroes. 


The Action Commands, 
and all other 82596 memory structures, 
must begin on even byte boundaries, 
l.e., they 
must be word aligned. 
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NOP 


This command 
results in no action by the 82596 except for those performed 
in the normal command 
process- 
ing. It is used to manipulate 
the eBl 
manipulation. 
The format 
of the NOP command 
is shown 
in Figure 21. 


N0P-82586 
and 32·Blt Segmented Modes 


ODD WORD 
16 15 
EVEN WORD 
o 


XXXXXXX 
OCBOKOOOOOOO 
0000000 
x 
X 
X 
X 
X 
X 
X 
X 
A15 
LINK OFFSET 
A04 


ODD WORD 
EVEN WORD 
0 
o 
0 
0 
0 
0 
0 
0 
00000000000000 


AO 4 


Figure 21 


where: 


LINK POINTER 
- 
In the 82586 
or 32-bit 
Segmented 
modes 
this is a 16-bit offset 
to the next Command 
Block. 
In the Linear mode this is the 32-bit address 
of the next Command 
Block. 


El 
- 
If set, this bit indicates 
that this command 
block is the last on the CBL. 


S 
- 
If set to one, suspend 
the CU upon completion 
of this CB. 


I 
- 
If set to one, the 82596 
will generate 
an interrupt 
after 
execution 
of the command 
is 
• 
complete. 
If I is not set to one, the CX bit will not be set. 


CMD (bits 16-18) 
- 
The NOP command. 
Value: Oh. 


Bits 19-28 
- 
Reserved 
(zero in the 32-bit Segmented 
and Linear modes). 


C 
- 
This bit indicates 
the execution 
status of the command. 
The CPU initially resets it to zero 
when the Command 
Block is placed on the CBL. Following 
a command 
Completion, 
the 


82596 will set it to one. 


B 
- 
This bit indicates 
that the 82596 
is currently 
executing 
the NOP command. 
It is initially 


reset to zero by the CPU. The 82596 sets it to one when execution 
begins and to zero 
when execution 
is completed. 
This bit is also set when the 82596 
prefetches 
the com- 
mand. 


NOTE: 
The C and B bits are modified 
in one operation. 


OK 
- 
Indicates 
that the command 
was executed 
without 
error. If set to one no error occurred 


(command 
executed 
OK). If zero an error occured. 


Individual Address Setup 


This command 
is used to load the 82596 with the Individual 
Address. 
This address 
is used by the 82596 for 
inserting 
the Source 
Address 
during transmission 
and recognizing 
the Destination 
Address 
during reception. 
After 
RESET, and prior to Individual 
Address 
Setup Command 
execution, 
the 82596 assumes 
the Broadcast 


Address 
is the Individual 
Address 
in all aspects, 
i.e.: 


• 
This will be the Individual 
Address 
Match 
reference. 


• 
This will be the Source 
Address 
of a transmitted 
frame 
(for Al-lOC=O 
mode only). 


I 
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The format 
of the Individual 
Address 
Setup command 
is shown 
in Figure 22. 


lA Setup--82586 
and 32-Blt Segmented 
Modes 


31 
ODD WORD 
16 15 
EVEN WORD 
0 


ELlsl 
I Ix 
x 
x 
x 
x 
x 
x 
x 
x 
xlo 
0 
1 
cl 
B IOKI A 10 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
o 0 


INDIVIDUAL 
ADDRESS 
1st byte A15 
LINK OFFSET 
AO 4 


6th byte 
5th byte 
4th byte 
3rd byte 
8 


lA Setup-Linear 
Mode 


31 
ODD WORD 
1615 
EVEN WORD 
0 


ELl S II 
10 
0 
0 
0 
0 
0 
0 
0 
0 
010 
0 
1 
cl 
B IOKI A 10 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
o 0 


A31 
LINK ADDRESS 
AO 4 


4th byte 
3rd byte 
INDIVIDUAL 
ADDRESS 
1st byte 
8 


6th byte 
5th byte 
C 


Figure 22 


where: 


LINK ADDRESS, 
EL, B, C, I, S 


A 


- 
As per standard 
Command 
Block (see the NOP command 
for details) 


- 
Indicates 
that the command 
was abnormally 
terminated 
due to CU Abort 
control 
command. 
If one, then the command 
was aborted, 
and if necessary 
it should 
be 
repeated. 
If this bit is zero, the command 
was not aborted. 


Bits 19-28 
- 
Reserved 
(zero in the 32-bit Segmented 
and Linear modes). 


CMD (bits 16-18) 
- 
The Address 
Setup command. 
Value: 1h. 


INDIVIDUAL 
ADDRESS 
- 
The individual 
address 
of the node, 0 to 6 bytes long. 


The least significant 
bit of the Individual 
Address 
must be zero for Ethernet 
(see the Command 
Structure). 


However, 
no enforcement 
of 0 is provided 
by the 82596. 
Thus, 
an Individual 
Address 
with 
1 as its least 
significant 
bit is a valid Individual 
Address 
in all aspects. 


The default 
address 
length 
is 6 bytes long, as in 802.3. 
If a different 
length 
is used the lA Setup command 
should 
be executed 
after the Configure 
command. 


Configure 


The 
Configure 
command 
loads 
the 82596 
with 
its operating 
parameters. 
It allows 
changing 
some 
of the 
parameters 
by specifying 
a byte count less than the maximum 
number of configuration 
bytes (11 in the 82586 
mode, 
14 in the 32-Bit 
Segmented 
and 
Linear 
modes). 
The 82596 
configuration 
depends 
on its mode 
of 
operation. 
When configuring 
the 12th byte (Byte 11 undefined) 
in 82586 
mode this byte should 
be all ones. 


• 
In the 82586 
mode the maximum 
number 
of configuration 
bytes is 12. Any number 
larger than 
12 will be 
reduced 
to 12 and any number 
less than 4 will be increased 
to 4. 


• 
The additional 
features 
of the serial side are disabled 
in the 82586 
mode. 


• 
In both the 32-Bit Segmented 
and Linear modes 
there are four additional 
configuration 
bytes, which 
hold 
parameters 
for additional 
82596 features. 
If these parameters 
are not accessed, 
the 82596 will follow 
their 
default 
values. 


• 
For more detailed 
information 
refer to the 32-Bit 
LAN Components 
User's 
Manual. 
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The format 
of the Configure 
command 
is shown 
in Figure 23, 24 and 25. 


31 
ODD WORD 
1615 
EVEN WORD 
0 


ELl S I1 
I X 
X 
X 
X 
X 
X 
X 
X 
X 
xlo 
1 
o 
cl 
B IOKI A I 0 ·0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
o 0 


Byte 1 
Byte 0 
A15 
LINK OFFSET 
AO4 


Byte 5 
Byte 4 
Byte 3 
Byte 2 
8 


Byte 9 
Byte 8 
Byte 7 
Byte 6 
12 


X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
Byte 10 
16 


Figure 23. cONFIGURE-82586 
Mode 


31 
ODD WORD 
1615 
EVEN WORD 
0 


ELl S I1 
10 
0 
0 
0 
0 
0 
0 
0 
0 
010 
1 
o 
cl 
B IOKI A I 0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
o 0 


Byte 1 
Byte 0 
A15 
LINK OFFSET 
AO4 


Byte 5 
Byte 4 
Byte 3 
Byte 2 
8 


Byte 9 
Byte 8 
Byte 7 
Byte 6 
12 


Byte 13 
Byte 12 
Byte 11 
Byte 10 
16 


Figure 24. cONFIGURE-32-Blt 
Segmented Mode 


31 
ODD WORD 
1615 
EVEN WORD 
0 


ELl S I1 
10 
0 
0 
0 
0 
0 
0 
0 
0 
010 
1 
o 
cl 
B IOKI A I 0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
o 0 


A31 
LINK ADDRESS 
AO4 


Byte 3 
Byte 2 
Byte 1 
Byte 0 
8 


Byte 7 
Byte 6 
Byte 5 
Byte 4 
12 


Byte 11 
Byte 10 
Byte 9 
Byte 8 
16 


X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
Byte 13 
Byte 12 
20 


Figure 25. CON FIGURE-LInear 
Mode 


LINK ADDRESS, 
- 
As per standard 
Command 
Block (see the NOP command 
for details) 
EL, B,C, I, S 


A 
- 
Indicates 
that the command 
was abnormally 
terminated 
due to a CU Abort control 
com- 
mand. If 1, then the command 
was aborted 
and if necessary 
it should be repeated. 
If this 
bit is 0, the command 
was not aborted. 


Bits 19-28 
- 
Reserved 
(zero in the 32-Bit Segmented 
and Linear Modes) 


CMD (bits 16-18) 
- 
The CON FIGURE 
command. 
Value: 2h. 


The interpretation 
of the fields follows: 


7 
6 
5 
3 
2 
1 
4 
o 


BYTE?OUNT 
p 
X 
X 
X 


BYTE 0 


BYTE CNT (Bits 0-3) 
Byte Count. Number 
of bytes, including 
this one, that hold pa- 
rameters 
to be configured. 


Enable 
the 
82596 
to write 
the 
prefetched 
bit in all prefetch 
RBDs. 
PREFETCHED 
(Bit 7) 


I 


• 
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____________ 
o 
o 
o 
o 
.•.. 
~R_E_S_UM 
__~ __R9~ 
__ o 
1 
I 
SAVBF 


BYTE 
2 


SAV BF (Bit 7) 


DEFAULT: 
40h 


RESUMLRD 
(Bit 1) 


7 


o 


O-Received 
bad frames 
are not saved in the memory. 


1-Received 
bad frames 
are saved in the memory. 


o - 
The 82596 does not reread the next CB on the list when a CU Resume 
Control 
Command 
is issued. 


1 - 
The 
82596 
will 
reread 
the 
next 
CB on the 
list when 
a CU Resume 
Control 
Command 
is issued. This is available 
only on the 82596B 
step- 
ping. 


PREAMBLE 
LENGTH 


o 


LOOP 
BACK 
MODE 
ADDRESS 
LENGTH 


BYTE 
3 


ADR LEN (Bits 0-2) 


NO SCR ADD INS (Bit 3) 


PREAM LEN (Bits 4-5) 


LP BCK MODE (Bits 6-7) 


DEFAULT: 
26h 


Address 
length 
(any kind). 


No Source 
Address 
Insertion. 
In the 82586 this bit is called AL LOC. 


Preamble 
length. 


Loopback 
mode. 


7 
0 


, BOF 
METD 
I 
E_X_P_O.J...~_EN_T_IA_L_P_R...J.IP_R_ITY 
O__ 
---I. 
Ll..L..~E_A_R_P_R_IO_R_IL..rv 
1 


BYTE 4 


UN PRIO (Bits 0-2) 


EXP PRIO (Bits 4-6) 


BOF METD (Bit 7) 


DEFAULT: 
OOh 


Linear Priority. 


Exponential 
Priority. 


Exponential 
Backoff 
method. 


7 
0 


, 
--'- 
'-- 
--'-- 
.•..: I_N_TE_R_F_R_A.L¥E_S_P_A_C_IN_G....l.: 
--'- 
---ll 


BYTE 5 


INTERFRAME 
SPACING 


DEFAULT: 
60h 
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7 
0 
I 
SLOT TI¥E 
- LOW 
I 
_------L------'_----'--_.'-------'--_'--------'--_ 
BYTE 6 
SLOT TIME (L) 
DEFAULT: OOh 
Slot time, low byte. 


7 
0 
I 
...J~_AX_IM_U_M_R_~L...R_Y_N_U_M_B_ERL: 
O_-'- 
S_LL:O_T_TI_M_E_- 
H_1G.L: H 
I 


BYTE 7 
SLOT TIME (H) 
(Bits 0-2) 
RETRY NUM (Bits 4-7) 


DEFAULT: F2h 
7 


Slot time, high part. 


Number of transmission retries on collision. 


BYTE 8 
PRM (Bit 0) 
BC DIS (Bit 1) 
MANCH/NRZ (Bit 2) 


Promiscuous mode. 
Broadcast disable. 
Manchester or NRZ encoding. See specific timing require- 
ments for TXC in Manchester mode. 
Transmit on no CRS. 
No CRC insertion. 
CRC type. 
Bit stuffing. 
Padding. 


TONO CRS (Bit 3) 
NOCRC INS (Bit 4) 
CRC-16/CRC-32 (Bit 5) 
BIT STF (Bit 6) 
PAD (Bit 7) 
DEFAULT: OOh 


7 
0 
I CDT SRC I 
COLLI~ION 
DETECT:FILTER 
I CRS SRC I 
CAR~IER 
SENSE 
F:ILTER 
I 


BYTE 9 
CRSF (Bits 0-2) 
Carrier Sense filter (length). 
CRS SRC (Bit 3) 
Carrier Sense source. 


CDTF (Bits 4-6) 
Collision Detect filter (length). 
COTSRC (Bit 7) 
Collision Detect source. 


DEFAULT: OOh 


I 


82596CA 


o 
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7 
° 
: MINIMUM 
FR~ME 
LENGTH 
: 


BYTE 10 


MIN FRAME LEN 


DEFAULT: 
40h 


7 


BYTE 11 


PRECRS (Bit 0) 


LNGFLD 
(Bit 1) 


CRCINM 
(Bit 2) 


AUTOTX 
(Bit 3) 


CDBSAC 
(Bit 4) 


MC-.ALL 
(Bit 5) 


MONITOR 
(Bits 6-7) 


DEFAULT:FFH 


Minimum 
frame 
length. 


° 
MC-ALL 
I 
CDBSAC 
I 
AUTOTX 
CRCINM 
I 
LNGFLD 
I 
PRECRS 
, 


Preamble 
until Carrier Sense 


Length field. Enables 
padding 
at the End-of-Carrier 
framing 
(802.3). 


Rx CRC appended 
to the frame 
in memory. 


Auto retransmit 
when a collision 
occurs 
during the preamble. 


Collision 
Detect 
by source 
address 
recognition. 


Enable to receive 
all MC frames. 


Receive 
monitor 
options. 


7 
° 


1 
O __ 
'--_F_D_X 
o 
o__ 
'-- __ o 
o 
o 
o__ 
1 


BYTE 12 


FOX (Bit 6) 


DEFAULT: 
OOh 


7 
I 
DIS_BOF 
I 
MUL T_lA 


BYTE 13 


MUL T_lA 
(Bit 6) 


DIS_BOF 
(Bit 7) 


DEFAULT:3Fh 
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A reset (hardware 
or software) 
configures 
the 82596 according 
to the following 
defaults. 


I 


Table 4. Configuration 
Defaults 


Parameter 
Default Value 
Units/Meaning 


ADDRESS 
LENGTH 
"6 
Bytes 


AIL 
FIELD LOCATION 
0 
Located in FD 
• 
AUTO RETRANSMIT 
1 
Auto Retransmit 
Enable 
BITSTUFFING/EOC 
0 
EOC 
BROADCAST 
DISABLE 
' 
0 
Broadcast 
Reception 
Enabled 
• 
CDBSAC 
1 
Disabled 
CDTFILTER 
0 
Bit Times 
CDTSRC 
0 
External Collision 
Detection 
• 
CRC IN MEMORY 
1 
CRC Not Transferred 
to Memory 
CRC-16/CRC-32 
··0 
CRC-32 
CRSFILTER 
0 
o Bit Times 
CRSSRC 
0 
External CRS 
• 
DISBOF 
0 
Backoff 
Enabled 
EXT LOOPBACK 
0 
Disabled 
EXPONENTIAL 
PRIORITY 
··0 
802.3 Algorithm 
EXPONENTIAL 
BACKOFF 
METHOD 
··0 
802.3 Algorithm 
• 
FULL DUPLEX (FDX) 
0 
CSMAlCD 
Protocol 
(No FDX) 
FIFO THRESHOLD 
8 
TX: 32 Bytes, RX: 64 Bytes 
INT LOOPBACK 
0 
Disabled 
INTERFRAME 
SPACING 
··96 
Bit Times 
11 


LINEAR PRIORITY 
··0 
802.3 Algorithm 
• 
LENGTH 
FIELD 
1 
Padding Disabled 
MIN FRAME LENGTH 
··64 
Bytes 
• 
MC ALL 
1 
Disabled 
• 
MONITOR 
11 
Disabled 
MANCHESTERINRZ 
0 
NRZ 
• 
Ml!LTI 
lA 
0 
Disabled 
NUMBER 
OF RETRIES 
··15 
Maximum 
Number of Retries 
NO CRC INSERTION 
0 
CRC Appended 
to Frame 
PREFETCH 
BIT IN RBD 
0 
Disabled 
(Valid Only in New Modes) 
PREAMBLE 
LENGTH 
··7 
Bytes 
· 
Preamble 
Until CRS 
1 
Disabled 
PROMISCUOUS 
MODE 
0 
Address 
Filter On 
PADDING 
0 
No Padding 
SLOT TIME 
··512 
Bit Times 
SAVE BAD FRAME 
0 
Discards 
Bad Frames 
TRANSMIT 
ON NO CRS 
0 
Disabled 


NOTES: 
1. This configuration 
setup 
is compatible 
with the IEEE 802.3 
specification. 


2. The Asterisk 
••••• signifies 
a new configuration 
parameter 
not available 
in the 62586. 


3. The default 
value of the Auto retransmit 
configuration 
parameter 
is enabled(1). 


4. Double 
Asterisk 
•••••• signifies 
IEEE 602.3 requirements. 
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Multicast-Setup 


This command 
is used to load the 82596 with the Multicast-IDs 
that should be accepted. 
As noted previously, 
the filtering 
done 
on the 
Multicast-IDs 
is not perfect 
and some 
unwanted 
frames 
may be accepted. 
This 
command 
resets the current 
filter and reloads 
it with the specified 
Multicast-IDs. 
The format 
of the Multicast- 
addresses 
setup command 
is: 


31 
ODD WORD 
1615 
EVEN WORD 
0 


ELl S I1 
I X 
X 
X 
X 
X 
X 
X 
X 
X 
xlo 
1 
1 
cl 
B IOKI A 10 
o 
0 0 0 0 000 
o 
0 
0 


xlxl 
MC COUNT 
A15 
LINK OFFSET 
AO 


4th byte 
1st byte 


Nth byte 


MULTICAST AiDRESSES LIST 


Figure 26. MC Setup-82586 
and 32-Blt Segmented 
Modes 


31 
ODD WORD 
16 15 
EVENWORD 
0 


ELl S I1 10 0 0 
o 
0 
0 0 
o 
0 010 
1 
1 I cl 
B IOKI A I 0 
o 
0 0 0 0 0 
0 
0 
o 
0 
0 


A31 
LINK ADDRESS 
AO 


2nd byte 
1st bytei X I X I 
MC COUNT 


Nth byte 
MULTICASTAiDRESSES LIST 


where: 


LINK ADDRESS, 
F;L, B, C, I, S 


A 


Bits 19-28 


CMD (bits 16-18) 


MC-CNT 


MC LIST 


Figure 27. MC Setup-Linear 
Mode 


- 
As per standard 
Command 
Block (see the NOP command 
for details) 


- 
Indicates 
that the command 
was abnormally 
terminated 
due to a CU Abort control 
command. 
If one, then the command 
was aborted 
and if necessary 
it should 
be 
repeated. 
If this bit is zero, the command 
was not aborted. 


- 
Reserved 
(0 in bolh the 32-Bit Segmented 
and Linear Modes). 


- 
The MC SETUP command 
value: 3h. 


This 14-bit field indicates 
the number 
of bytes in the MC LIST field. The MC CNT 
must be a multiple of the ADDR LEN; otherwise, 
the 82596 reduces the MC CNT to 
the 
nearest 
ADDR 
LEN multiple. 
MC CNT =0 implies 
resetting 
the 
Hash 
table 
which 
is equivalent 
to disabling 
the Multicast 
filtering 
mechanism. 


- 
A list of Multicast 
Addresses 
to be accepted 
by the 82596. The least significant 
bit 
of each MC address 
must be 1. 


NOTE: 
The list is sequential; 
i.e., the most significant 
byte of an address 
is immediately 
followed 
by the least signifi- 
cant byte of the next address. 


- 
When the 82596 
is configured 
to recognize 
multiple 
Individual 
Address 
(Multi-lA), 
the MC-Setup 
command 
is also used to set up the Hash table 
for the individual 
address. 


The least significant 
bit in the first byte of each lA address 
must be O. 
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Transmit 


This command 
is used to transmit 
a frame of user data onto the serial link. The format of a Transmit 
command 
is as follows. 


31 
ODD WORD 
1615 
EVEN WORD 
0 


ELl S II 
I X 
X 
X 
X 
X 
X 
X 
X 
X 
XI1 
0 
0 
C I BI 
STATUS BITS 
I 
MAXCOLL 
0 


A15 
TBDOFFSET 
AOA15 
LINK OFFSET 
AO4 


4th byte 
DESTINATIONADDRESS 
tst byte 8 


LENGTH FIELD 
6th byte 
12 


Figure 28. TRANSMIT --82586 
Mode 


31 
ODD WORD 
1615 
EVEN WORD 
0 


ELlsl 
1I010101010101010INCIsFI1 
0 
0 
C I BI 
STATUS BITS 
I 
MAXCOLL 
0 


A15 
TBDOFFSET 
AOA15 
LINK OFFSET 
A04 
o 
0 
0 
0 
0 
0 
o 
0 
0 
o 
0 
0 
o 
0 
0 
o eor] o ] 
TCBCOUNT 
8 


4th byte 
DESTINATION ADDRESS 
1st byte 12 


LENGTH FIELD 
6th byte 
16 


OPTIONAL DATA 


Figure 29. TRANSMIT -32-Blt 
Segmented 
Mode 


31 
ODD WORD 
1615 
EVENWORD 
0 


ELl S I I I 0 I 0 I 0 I 0 I 0 I 0 I 0 I 0lNCIsFI1 
o 
0 
C I BI 
STATUS BITS 
I 
MAXCOLL 
0 


A31 
LINK ADDRESS 
AO4 


A31 
TRANSMIT BUFFER DESCRIPTORADDRESS 
AO8 
o 
0 
0 
0 
0 
0 
o 
0 
0 
0 
0 
0 
o 
0 
0 
o EOFI 0 I 
TCBCOUNT 
12 


4th byte 
DESTINATION ADDRESS 
1st byte 16 


LENGTH FIELD 
6th byte 
20 


OPTIONAL DATA 


Figure 30. TRANSMIT-LInear 
Mode 


31 
COMMAND WORD 
16 


IELI S I I I 0 I 0 I 0 I 0 I 0 I 0 I 0 I 0lNCIsFI1 
0 
012 
i i 
0: No CRC Insertion disable; when the 
0: Simplified Mode, all the Tx data is in 
configure command is configured to 
the Transmit Command Block. The 
not insert the CRC during 
Transmit Buffer Descriptor Address 


transmission the NC bit has no 
field is all 1s. 


effect. 
1: Flexible Mode. Data is in the TCB and 
1: No CRC Insertion enable; when the 
in a linked list of TBDs. 


configure command is configured to 
insert the CRC during transmission 
the CRC will not be inserted when 
NC = 1. 


I 


• 
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where: 


EL, B,C,I,S 


OK (Bit 13) 


A (Bit 12) 


Bits 19-28 


CMD (Bits 16-18) 


Status Bit 11 


Status Bit 10 


Status Bit 9 


Status Bit 8 


Status Bit 7 


Status Bit 6 


Status Bit 5 


Status Bit 4 


MAX-COL 
(Bits 3-0) 


LINK OFFSET 


TBDPOINTER 


DEST ADDRESS 


LENGTH 
FIELD 


TCBCOUNT 


EOF Bit 
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- 
As per standard 
Command 
Block (see the NOP command 
for details). 


- 
Error free completion. 


- 
Indicates 
that the command 
was abnormally 
terminated 
due to CU Abort 
control 
command. 
If 1, then 
the command 
was aborted, 
and if necessary 
it should 
be 
repeated. 
If this bit is 0, the command 
was not aborted. 


- 
Reserved 
(0 in the 32-bit Segmented 
and Linear modes). 


- 
The transmit 
command: 
4h. 


- 
Late collision. 
A late collision 
(a collision 
after the slot time is elapsed) 
is detected. 


- 
No Carrier 
Sense 
signal 
during 
transmission. 
Carrier 
Sense 
signal 
is monitored 
from the end of Preamble transmission 
until the end of the Frame Check Sequence 
for TONOCRS = 1 (Transmit 
On No Carrier Sense mode) it indicates 
that transmis- 


sion 
has 
been 
executed 
despite 
a lack 
of CRS. 
For TONOCRS = 0 (Ethernet 


mode), 
this 
bit also 
indicates 
unsuccessful 
transmission 
(transmission 
stopped 
when lack of Carrier Sense has been detected). 


- 
Transmission 
unsuccessful 
(stopped) 
due to Loss of CTS. 


- 
Transmission 
unsuccessful 
(stopped) 
due to DMA Underrun; 
Le., the system 
did 
not supply data for transmission. 


- 
Transmission 
Deferred, 
Le., transmission 
was not immediate 
due to previous 
link 
activity. 
. 


- 
Heartbeat 
Indicator, 
Indicates 
that after a previously 
performed 
transmission, 
and 
before 
the most 
recently 
performed 
transmission, 
(Interframe 
Spacing) 
the CDT 
signal was monitored 
as active. This indicates 
that the Ethernet 
Transceiver 
Colli- 


sion 
Detect 
logic 
is performing 
properly. 
The 
Heartbeat 
is monitored 
during 
the 
Interframe 
Spacing 
period. 


- 
Transmission 
attempt was stopped 
because the number of collisions 
exceeded 
the 
maximum 
allowable 
number 
of retries. 


- 
0 (Reserved). 


- 
The number of Collisions 
experienced 
during this frame. Max Col = 0 plus S5 = 1 
indicates 
16 collisions. 


- 
As per standard 
Command 
Block (see the NOP Command 
for details) 


- 
In the 82586 
and 32-bit Segmented 
modes this is the offset 
of the first Tx Buffer 
Descriptor 
containing 
the data to be transmitted. 
In the Linear mode this is the 32- 


bit address of the first Tx Buffer Descriptor 
on the list. If the TBD POINTER 
is all 1s 
it indicates 
that no TBD is used. 


- 
Contains 
the Destination 
Address 
of the frame. The least significant 
bit (MC) indi- 
cates the address 
type. 


MC = 0: Individual 
Address. 


MC = 1: Multicast 
or Broadcast 
Address, 


If the Destination 
Address 
bits are all 1s this is a Broadcast 
Address. 


- 
The contents 
of this 2-byte field are user defined. 
In 802.3 it contains 
the length of 
the data field. It is placed 
in memory 
in the same order it is transmitted; 
i.e., most 


significant 
byte first, least significant 
byte second. 


- 
This 14-bit counter 
indicates 
the number of bytes that will be transmitted 
from the 
Transmit 
Command 
Block, starting 
from the third byte after the TCB COUNT field 


(address n+ 12 in the 32-bit Segmented 
mode, N+ 16 in the Linear 
mode). 
The 
TCB COUNT field can be any number of bytes (including 
an odd byte), this allows 
the user to transmit 
a frame 
with a header 
having an odd number 
of bytes. The 


TCB COUNT field is not used in the 82586 
mode. 


- 
Indicates 
that 
the whole 
frame 
is kept 
in the Transmit 
Command 
Block. 
In the 
Simplified 
memory 
model it must be always asserted. 


I 
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The interpretation 
of what is transmitted 
depends 
on the No Source Address 
insertion 
configuration 
bit and the 
memory 
model being used. 


NOTES: 


1. The Destination 
Address 
and the Length 
Field are sequential. 
The Length 
Field immediately 
follows 
the 
most significant 
byte of the Destination 
Address. 


2. In case 
the 82596 
is configured 
with No Source 
Address 
insertion 
bit equal 
to 0, the 82596 
inserts 
its 
configured 
Source 
Address 
in the transmitted 
frame. 


• 
In the 82586 mode, or when the Simplified 
memory 
model is used, the Destination 
and Length fields of the 
transmitted 
frame are taken from the Transmit 
Command 
Block. 


• 
If the FLEXIBLE 
memory 
model is used, the Destination 
and Length fields of the transmitted 
frame can be 
found 
either in the TCB or TBD, depending 
on the TCB COUNT. 


3. If the 82596 is configured 
with the Address/Length 
Field Location 
equal to 1, the 82596 does not insert its 
configured 
Source 
Address 
in the transmitted 
frame. 
The first (2 
X 
Address 
Length) + 2 bytes 
of the 
transmitted 
frame are interpreted 
as Destination 
Address, 
Source Address, 
and Length fields respectively. 
The location 
of the first transmitted 
byte depends 
on the operational 
mode of the 82596: 


• 
In the 82586 
mode, it is always the first byte of the first Tx Buffer. 


• 
In both the 32-bit Segmented 
and Linear modes 
it depends 
on the SF bit and TCB COUNT: 


- 
In the Simplified 
memory 
mode the first transmitted 
byte is always the third byte after the TCB COUNT 
field. 


- 
In the Flexible 
mode, if the TCB COUNT 
is greater than 0 then it is the third byte after the TCB COUNT 
field. If TCB COUNT equals 0 then it is first byte of the first Tx Buffer. 


• 
Transmit 
frames 
shorter than six bytes are invalid. The transmission 
will be aborted 
(only in 82586 
mode) 
because 
of a DMA Underrun. 


4. Frames 
which 
are aborted 
during transmission 
are jammed. 
Such an interruption 
of transmission 
can be 
caused 
by any reason 
indicated 
by any of the status bits 8, 9, 10 and 12. 


Jamming Rules 


1. Jamming 
will not start before completion 
of preamble 
transmission. 


2. Collisions 
detected 
during transmission 
of the last 11 bits will not result in jamming. 


The format 
of a Transmit 
Buffer Descriptor 
is: 


82586 Mode 
31 
ODD WORD 
1615 
13 
EVEN WORD 
0 


NEXT TBD OFFSET 
IEOFI x ] 
SIZE (ACT COUNn 
0 
x x x x x x x xl 
TRANSMIT 
BUFFER 
ADDRESS 
4 


32-Bit 
Segmented 
Mode 
31 
ODD WORD 
1615 
13 
EVEN WORD 
0 


NEXT TBD OFFSET 
IEOFI 
0 I 
SIZE (ACT COUNT) 
0 


TRANSMIT 
BUFFER 
ADDRESS 
4 


Linear 
Mode 
31 
ODD WORD 
1615 
13 
EVEN WORD 
0 


0 
0 
0 
0 
0 
0 
0 
010 
0 
0 
0 
0 
0 
0 
o IEOFI 
0 I 
SIZE (ACT COUNT) 
0 


NEXT TBD ADDRESS 
4 


TRANSMIT 
BUFFER 
ADDRESS 
B 


Figure 
31 
I 


•• 
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where: 


EOF 
- 
This bit indicates 
that this TBD 
is the last one associated 
with the frame 
being 
transmitted. 
It is set by the CPU before transmit. 
' 


SIZE (ACT COUNT) 
- 
This 
14-bit quantity 
specifies 
the number 
of bytes 
that 
hold 
information 
for the 
current 
buffer. 
It is set by the CPU before transmission. 


NEXT TBD ADDRESS 
- 
In the 82586 and 32-bit Segmented 
modes, 
it is the offset 
of the next TBD on the 
list. In the Linear mode this is the 32-bit address 
of the next TBD on the list. It is 
meaningless 
if EOF = 1. 


BUFFER ADDRESS 
- 
The starting 
address 
of the memory 
area that contains 
the data to be sent. In the 
82586 
mode, this is a 24-bit address 
(A31-A24 
are considered 
to be zero). In the 
32-bit 
Segmented 
and Linear modes 
this -is a 32-bit address. 
This buffer 
can be 
byte aligned for the 82596 
B step. 


TOR 


This operation 
activates 
Time Domain 
Reflectomet, 
which is a mechanism 
to detect 
open or short circuits 
on 


the link and their distance 
from 
the diagnosing 
station. 
The TOR command 
has no parameters. 
The TOR 


transmit 
sequence 
was changed, 
compared 
to the 82586, to form a regular transmission. 
The TOR command 


is designed 
to be used statically. 
Make 
sure that 
both the CU and 
RU are idle before 
attempting 
a TOR 
command. 
The TOR bit stream 
is as follows. 


- 
Preamble 


- 
Source 
address 


- 
Another 
Source 
address 
(the TOR frame 
is transmitted 
back to the sending 
station, 
so DEST ADR = SRC ADR). 


- 
Data field containing 
7Eh patterns. 


- 
Jam Pattern, 
which 
is the inverse CRC of the transmitted 
frame. 


Maximum 
length of the TOR frame is 2048 bits. If the 82596 senses collision 
while transmitting 
the TOR frame 


it transmits 
the jam pattern 
and stops the transmission. 
The 82596 then triggers 
an internal 
timer (STC); the 


timer 
is reset 
at the beginning 
of transmission 
and reset 
if CRS is returned. 
The timer 
measures 
the time 
elapsed 
from the start of transmission 
until an echo is returned. The echo is indicated 
by Collision 
Detect going 
active or a drop in the Carrier Sense signal. The following 
table lists the possible 
cases that the 82596 is able 


to analyze. 


Transceiver 
Type 
Ethernet 
Non Ethernet 
Condition 


Carrier Sense was inactive for 2048-bit-time 
Short or Open on the 
NA 
periods 
Transceiver 
Cable 


Carrier Sense signal dropped 
Short on the Ethernet 
cable 
NA 


Collision 
Detect went active 
Open on the Ethernet 
cable 
Open on the Serial Link 


The Carrier Sense Signal did not drop or the 
No Problem 
No Problem 
Collision 
Detect did not go active within 
2048-bit 
time period 


Conditions 
of TOR as Interpreted 
by the 82596 


An Ethernet 
transceiver 
is defined 
as one that returns transmitted 
data on the receive 
pair and activates 
the 
Carrier Sense Signal while transmitting. 
A Non-Ethernet 
Transceiver 
is defined 
as one that does not do so. 
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The format 
of the Time Domain 
Reflectometer 
command 
is: 


82586 and 32-Blt Segmented 
Modes 
31 
ODD WORD 
1615 
EVEN WORD 
a 


EL 
S 
I 
X 
X 
X 
X 
X 
X 
X 
X 
X 
XI1 
a 
1 
cl 
S IOKI a 
a 
a 
a 
a 
a 
a 
a 
a 
a 
a 
a 
a 


LNK XVR ET 
se:TI x I 
TIME 
A15 
LINK OFFSET 
Aa 
OK PRS OPN 
(11 bits) 


LInear Mode 
31 
ODD WORD 
1615 
EVEN WORD 
a 


Ell 
si 
I 1a 
a 
0 
o 
a 
0 
0 
a 
a 
a 11 
a 
1 1c 
S IOKI a 
0 
0 
o 
0 
0 
a 
a 
a 
o 
a 
a 
0 


A31 
LINK ADDRESS 
Aa 


o 
a 
a 
a 
0 
0 
a 
a 
a 
a 
a 
a 
a 
0 
0 
o I LNK I XVR I 
ET 
I 
ET 
I X I 
TIME 
OK 
PRS 
OPN 
SRT 
(11 bits) 


where: 


LINK ADDRESS, 
EL, B,C,I,S 


A 


Bits 19-28 


CMD (Bits 16-18) 


TIME 


LNK OK (Bit 15) 


XCVR PRB (Bit 14) 


ET OPN (Bit 13) 


ET SRT (Bit 12) 


I 


Figure 32. TOR 


- 
As per standard 
Command 
Block (see the NOP command 
for details). 


- 
Indicates 
that the command 
was abnormally 
terminated 
due to CU Abort 
control 
command. 
If one, then the command 
was aborted, 
and if necessary 
it should 
be 
repeated. 
If this bit is zero, the command 
was not aborted. 
4 


- 
Reserved 
(0 in the 32-bit Segmented 
and Linear Modes). 


- 
The TOR command. 
Value: 5h. 


- 
An 11-bit field that specifies 
the number of TxC cycles that elapsed 
before an echo 
was observed. 
No echo 
is indicated 
by a reception 
consisting 
of "1s" 
only. 
Be- 
cause the network 
contains 
various 
elements 
such as transceiver 
links, transceiv- 
ers, Ethernet, 
repeaters 
etc., the TIME is not exactly 
proportional 
to the problems 
distance. 


- 
No link problem 
identified. 
TIME=7FFh. 


- 
Indicates 
a Transceiver 
problem. 
Carrier Sense was inactive 
for 2048-bit 
time peri- 
od. LNK OK=O. 
TIME=7FFh. 


- 
The transmission 
line is not properly 
terminated. 
Collision 
Detect 
went active and 
LNK OK=O. 


- 
There is a short circuit on the transmission 
line. Carrier Sense Signal dropped 
and 
LNK OK=O. 
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Ell S 1 I 
1 x x x x x x x x x 
X 11 
1 
0 1cl B IOKI 0 
o 
0 0 0 0 0 0 0 0 0 0 0 


LINK ADDRESS 
AO 
A31 


BUFFER 
ADDRESS 
AO 
A31 


Figure 33. Dump 


where: 


LINK ADDRESS, 
EL, B,C, I,S 


OK 


Bits 19-28 


CMD (Bits 16-18) 


BUFFER 
POINTER 


- 
Indicates 
error free completion. 


- 
Reserved 
(0 in the 32·bit Segmented 
and Linear Modes). 


'- 
The Dump command. 
Value: 6h. 


- 
In the 82586 
and 32·bit Segmented 
modes 
this is the 16·bit-offset 
portion 
of the 
dump area address. 
In the Linear mode this is the 32-bit linear address of the dump 
area. 


- 
As per standard 
Command 
Block (see the NOP command 
for details). 


Dump Area Information 
Format 


• 
The 82596 
is not Dump compatible 
with the 82586 
because 
of the 32-bit internal 
architecture. 
In 82586 
mode the 82596 will dump the same number 
of bytes as the 82586. 
The compatible 
data will be marked 
with an asterisk. 


• 
In 82586 
mode the dump area is 170 bytes. 


• 
The DUMP area format 
of the 32-bit Segmented 
and Linear modes 
is described 
in Figure 35. 


• 
The size of the dump area of the 32-bit Segmented 
and Linear modes 
is 304 bytes. 


• 
When the Dump is executed 
by the Port command 
an extra word will be appended 
to the Dump Area. The 
extra word is a copy of the Dump Area status word (containing 
the C, B, and OK Bits). The C and OK Bits 
are set when the 82596 
has completed 
the Port Dump command. 
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15 
14 
13 
12 
11 
10 
9 
8 
7 
6 
5 
4 
3 
2 
1 
0 


DMA CONTROL 
REGISTER 
00 


CONFIGURE 
BYTES' 
3, 2 
02 
'The 
82596 
is not 
Dump 
compatible 
with 


CON FIGURE 
BYTES' 
5,4 
04 
the 
82586 
because 
of the 
32·bit 
internal 
ar- 


CONFIGURE 
BYTES' 
7, 6 
06 
chitecture. 
In 82586 
mode 
the 
82596 
will 


dump 
the 
same 
number 
of 
bytes 
as 
the 


CONFIGURE 
BYTES' 
9, 8 
08 
82586. 


CONFIGURE 
BYTES' 
10 
OA 
"These 
bytes 
are 
not 
user 
defined, 
results 


LA. BYTES 
1, 0' 
QC 
may 
vary 
from 
Dump 
command 
to 
Dump 


command. 


LA. BYTES 
3, 2' 
OE 


LA. BYTES 
5, 4' 
10 


LAST T.X. STATUS' 
12 


T.X. CRC BYTES 
1, 0' 
14 


T.X. CRC BYTES 
3, 2' 
16 


R.X. CRC BYTES 
1, 0' 
18 


R.X. CRC BYTES 
3, 2' 
lA 


R.X. TEMP 
MEMORY 
1,0' 
lC 


R.X. TEMP 
MEMORY 
3, 2' 
lE 


R.X. TEMP 
MEMORY 
5, 4' 
20 


LAST RECEIVED 
STATUS' 
22 


HASH REGISTER 
BYTES 
1, 0' 
24 


HASH 
REGISTER 
BYTES 
3, 2' 
26 


HASH REGISTER 
BYTES 
5, 4' 
28 


HASH 
REGISTER 
BYTES 
7, 6' 
2A 


SLOT TIME COUNTER' 
2C 


WAIT TIME COUNTER' 
2E 


MICRO 
MACHINE" 
30 


REGISTER 
FILE 


60 BYTES 
6A 


MICRO 
MACHINE 
LFSR" 
6C 


MICRO 
MACHINE" 
6E 


FLAG ARRAY 


14 BYTES 
7A 


QUEUE 
MEMORY" 
7C 


CUPORT 
8 BYTES 
82 


MICRO 
MACHINE 
ALU" 
84 


RESERVED" 
86 


M.M. TEMP A ROTATE 
R" 
88 


M.M. TEMP 
A" 
8A 


T.X. DMA BYTE COUNT" 
8C 


M.M. INPUT 
PORT ADDRESS" 
8E 


T.X. DMA ADDRESS 
90 


M.M. OUTPUT 
PORT" 
92 


R.X. DMA BYTE COUNT" 
94 


M.M. OUTPUT 
PORT ADDRESS 
REGISTER" 
96 


R. DMAADDRESS" 
98 


RESERVED" 
9A 


BUS THROTTLE 
TIMERS 
9C 


DIU CONTROL 
REGISTER" 
9E 


RESERVED" 
AO 


DMA CONTROL 
REGISTER" 
A2 


BIU CONTROL 
REGISTER" 
A4 


M.M. DISPATCHER 
REG." 
A6 


M.M. STATUS 
REGISTER" 
A8 


Figure 34. Dump Area Format-82586 
Mode 


I 


11 
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31 
0 


CON FIGURE 
BYTES 
5, 4, 3, 2 
00 
The 
82596 
is not Dump 
compatible 
with 
the 


82586 
because 
of the 
32-M 
internal 
archi- 


CONFIGURE 
BYTES 
9, 8, 7, 6 
04 
tecture. 
In 82586 
mode 
the 82596 
will dump 


CONFIGURE 
BYTES 
13, 12, 11,10 
08 
the same 
number 
of bytes 
as the 82586. 


"These 
bytes 
are 
not user 
defined, 
results 


lA 
BYTES 
1, 0 
X 
X 
X 
X 
X 
X 
X 
X 
OG 
may 
vary 
from 
Dump 
command 
to 
Dump 


lA 
BYTES 
5, 2 
10 
command. 


TX CRC BYTES 
0, 1 
LAST T.X. STATUS 
14 


RX CRC BYTES 
0, 1 
TX CRC BYTES 
3, 2 
18 


RX TEMP 
MEMORY 
1,0 
RX CRC BYTES 
3, 2 
1C 


R.X. TEMP 
MEMORY 
5, 2 
20 


HASH 
REGISTERS 
1, 0 
LAST R.X. STATUS 
24 


HASH 
REGISTER 
BYTES 
5, 2 
28 


SLOT TIME COUNTER 
HASH 
REGISTERS 
7, 6 
2C 


RECEIVE 
FRAME 
LENGTH 
WAIT-TIME 
COUNTER 
30 


MICRO 
MACHINE" 
34 


REGISTER 
FILE 


128 BYTES 
BO 


MICRO 
MACHINE 
LFSR" 
B4 


MICRO 
MACHINE" 
B8 


FLAG ARRAY 


28 BYTES 
DO 


M.M. INPUT 
PORT" 


D4 


16 BYTES 
EO 


MICRO 
MACHINE 
ALU" 
E4 


RESERVED" 
E8 


M.M. TEMP A ROTATE 
R." 
EC 


M.M. TEMP 
A" 
FO 


T.X. DMA BYTE COUNT" 
F4 


M.M. INPUT 
PORT ADDRESS 
REGISTER" 
F8 


T.X. DMA ADDRESS" 
FC 


M.M. OUTPUT 
PORT REGISTER" 
100 


R.X. DMA BYTE COUNT" 
104 


M.M. OUTPUT 
PORT ADDRESS 
REGISTER" 
108 


R.X. DMA ADDRESS 
REGISTER" 
10G 


RESERVED" 
110 


BUS THROTTLE 
TIMERS 
114 


DIU CONTROL 
REGISTER" 
118 


RESERVED" 
11C 


DMA CONTROL 
REGISTER" 
120 


BIU CONTROL 
REGISTER" 
124 


M.M. DISPATCHER 
REG." 
128 


M.M. STATUS 
REGISTER" 
12C 


Figure 35_ Dump Area Format-Linear 
and 32-Blt Segmented 
Mode 


4-48 
I 


82596CA 


Diagnose 


The Diagnose 
Command 
triggers 
an internal 
self-test 
procedure 
that checks 
internal 
82596 
hardware, 
which 
includes: 


• 
Exponential 
Backoff 
Random 
Number 
Generator 
(Linear Feedback 
Shift Register). 


• 
Exponential 
Backoff 
Timeout 
Counter. 


• 
Slot Time Period Counter. 
' 


• 
Collision 
Number 
Counter. 


• 
Exponential 
Backoff 
Shift Register. 


• 
Exponential 
Backoff 
Mask Logic. 


• 
Timer Trigger 
Logic. 


This procedure 
checks 
the operation 
of the Backoff 
block, which 
resides 
in the serial side and is not easily 
controlled. 
The Diagnose 
command 
is performed 
in two phases. 


The format 
of the 82596 
Diagnose 
command 
is: 


ODD WORD 
82586 and 32-Blt Segmented 
Modes 
1615 
EVEN WORD 
o 


x x x x x x x x x x 
1CBOKOFOOOOO 
000 
000 
x x x x x x x x x x 
X 
A15 
LINK OFFSET 
AO 


ODD WORD 
EVEN WORD 
0 


o 
0 
0 
0 
0 
0 
0 
o 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


AO 


Figure 36. Diagnose 


where: 


LINK ADDRESS, 
EL, B,C,I,S 


Bits 19-28 


CMD (bits 16-18) 


OK (bit 13) 


F(bit11) 


- 
As per standard 
Command 
Block (see the NOP command 
for details). 


- 
Reserved 
(0 in the 32-bit Segmented 
and Linear Modes). 


- 
The Diagnose 
command. 
Value: 7h. 


- 
Indicates 
error free completion. 


- 
Indicates 
that the self-test 
procedure 
has failed. 


I 
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1. The received 
frame 
is truncated. 


2. The No Resource 
error counter 
is updated. 


3. If the 82596 is configured 
to Save Bad Frames the RFD is not reused; otherwise, 
the same RFD is used to 
hold the next received 
frame, 
and the only action 
taken 
regarding 
the truncated 
frame 
is to update 
the 
counter. 


4. The 82596 continues 
to receive 
the next frame 
in the next RFD. 


290218-15 


L 
SCB 
RECEIVE FRAt.4E AREA 


r-- f-e 
RFD 1 


RFA 
STATUS 
STATUS U 


STATUS lJ 


STATUS If 


POINTER 
I 
STATISTICS 
- 
- 
- 
-~ 
-"- 


TO 
Ot.4t.4AND 
RECEIVE 
VALID 
Et.4PTY 
Et.4PTY 
Et.4PTY 


BLOCK 
FRAt.4E 
PARAt.4ETERS 


LIST 
DESCRIPTORS 
L 
RBD1 
RBD2 
I 
RBD3 
RBD4 
RBDS 


OIACT-:Lr 
1 I ACT-cnt If 
01 
ACT-cnt U 
01 
ACT-cnt 


Lr 
01 ACT-cnl If 
- 
- 
- 
- 
RECEIVE 
BUFFER 
DESCRIPTORS 
, 
, 
, 
, 
, 
J- 
J- 
.r. 
J- 
J- 


RECEIVE 
VALID 
VALID 


BUFFERS 
DATA 
DATA 
'--- 
'--- 
- 
'--- 
'--- 


BUFFER 
1 
BUFFER 2 
BUFFER 3 
BUFFER 4 
BUFFER S 


_ 
RECEIVE FRAt.4E LIST 
FREE FRAt.4E LIST 


C 


Figure 37. The Receive 
Frame Area 
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Note 
that 
this sequence 
is very useful 
for monitoring. 
If the 82596 
is configured 
to Save 
Bad Frames, 
to 
receive in Promiscuous 
mode, and to use the Simplified 
memory structure, 
any programmed 
length of received 


data can be saved in- memory. 


The Simplified 
memory 
structure 
is shown 
in Figure 38. 


:+-- RECEIVE FRA~E LIST ----t~~:.4------- 
FREE FRA~E 
LIST ----------t~~, 


SCB 


STATUS .r'TO CO~~AND 
LIST 


CBl 
4 
RECEIVE FRA~E AREA 
~ 
POINTER 


RFA 
h 
FDI 
FD2 
FD3 
FD4 
POINTER 
STATUS 
STATUS Y 


STATUS tr 


STATUS J: 
STATISTICS 
I 
- 
- 
- 
I 
BUS 
I 
I 
THROTIlE 
I 
I 
I 
._-----_. 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
RECEIVE 
VARIABLE 
I 
I 
1 
1 
1 
1 
FRA~E 
DATA 
1 
E~PTY 
I 
I 
E~PTY 
1 
1 
E~PTY 
1 
DESCRIPTORS 
FIELD 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 


'1 
1 
1 
1 
1 
1 
I 
I 
I 
1 
1 
1 
..----_. 
._----_. 
..----_ . 
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Figure 38. RFA Simplified 
Memory 
Structure 


Flexible Memory Structure 


The second 
structure 
is the Flexible 
memory 
structure, 
the data structure 
of the received 
frame 
is stored 
in 
both the RFD and in a linked list of Receive Buffers-Receive 
Buffer Descriptors. 
The received 
frame is placed 


in the RFD as configured 
in the Size field. Any remaining 
data is placed 
in a linked list of RBDs. 


The Flexible 
memory 
structure 
is shown 
in Figure 39. 


l 
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SCB S 
TO CO•••• AND LIST 


STATUS 
• 
RECEIVE FRA•• E AREA 
• 


CBL 


POINTER 
FD1 
FD2 
FD3 
FD4 


RFA 
STATUS 
STATUS -.r 
STATUS r 


STATUS Lt: 
POINTER 
I 
- 
- 
- 


STATISTICS 
- 
- 
I 
I 
I-- 
- 


I 
BUS 
I 


I 
THROTILE 
I 
CONTROL 
I 
I 
I 
I 
I 
I 
._-----_ 
.. 
FIELD 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
RECEIVE 
VARIABLE 
I 
E•• PTY 
I 
I 
E•• PTY 
I 
I 
E•• PTY 
I 
FRA ••E 
DATA 
I 
I 
I 
I 
I 
I 


DESCRIPTORS 
FIELD 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
._----_. 
._----_ . 
._---_ .. 
l 


RBD1 
RBD2 
I 
RBD3 
RBD4 
RBD5 
lr 
J.r 
lr 
.r 
u: 
- 
- 
- 
- 
- 


RECEIVE 
BUFFER 
T 
T 
T 
T 
DESCRIPTORS 
T.-L 
J- 
J- 
.-L 
J- 


V~L1D 
E•• PTY 
E•• PTY 
E•• PTY 
RECEIVE 
DATA 


BUFFERS 
- 
'---- 
'---- 
--- 
'---- 


BUFFER 
1 
BUFFER 2 
BUFFER 3 
BUFFER 4 
BUFFER 
5 


,+-- 
RECEIVE FRA•• E LIST 
~:4 
FREE FRA ••E LIST 
~: 
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Figure 39. RFA Flexible Memory 
Structure 


Buffers 
on the receive 
side can be different 
lengths. 
The 82596 will not place more bytes into a buffer than 


indicated 
in the associated 
RBO. The 82596 will fetch 
the next 
RBO before 
it is needed. 
The 82596 will 
attempt 
to receive 
frames 
as long as the FBL is not exhausted. 
If there 
are no more 
buffers, 
the 82596 
Receive 
Unit will enter the No Resources 
state. Before starting the RU, the CPU must place the FBL pointer in 


the RBO pointer field of the first RFO. All remaining 
RBO pointer fields for subsequent 
RFOs should be "1 s." If 


the Receive 
Frame Oescriptor 
and the associated 
Receive 
Buffers 
are not reused 
(e.g., the frame is properly 
received 
or the 82596 is configured 
to Save Bad Frames), 
the 82596 writes the address 
of the next free RBO 


to the RBO pointer 
field of the next RFO. 


Receive Buffer Descrlptor 
(RBD) 


The RBOs are used to store received 
data in a flexible 
set of linked buffers. 
The portion 
of the frame's 
data 


field that is outside the RFO is placed 
in a set of buffers chained 
by a sequence 
of RBOs. The RFO points to 


the first RBO, and the last RBO is flagged 
with an EOF bit set to 1. Each buffer 
in the linked 
list of buffers 
related 
to a particular 
frame 
can be any size up to 214 bytes but must be word aligned 
(begin 
on an even 
numbered 
byte). This ensures 
optimum 
use of the memory 
resources 
while 
maintaining 
low overhead. 
All 
buffers 
in a frame are filled with the received 
data except for the last, in which the actual count can be smaller 
than the allocated 
buffer space. 
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31 
ODD WORD 
1615 
EVEN WORD 
0 


ELl S I x 
x 
x 
x 
x 
x 
x 
x 
x 
x 
x 
x 
x 
x 
cl 
S IOKI 
0 I 
STATUS 
SITS 
10 
0 
0 
0 
0 
o 
0 


A15 
RSDOFFSET 
AO A15 
LINK OFFSET 
AO 4 


4th byte 
DESTINATION 
ADDRESS 
1st byte 8 


SOURCE 
ADDRESS 
1st byte 6th byte 
12 


6th byte 
4th byte 
16 


X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
LENGTH 
FIELD 
20 


Figure 40. Receive 
Frame Descrlptor-82586 
Mode 


31 
ODD WORD 
1615 
EVEN WORD 
0 


ELl si 
0 
0 
0 
0 
000 
o 
0 
o ISFI 
0 
0 
0 
C 
I sIoKI 
STATUS 
BITS 
0 


A15 
RSDOFFSET 
AO A15 
LINK OFFSET 
AO 4 


0101 
SIZE 
EOFI F I 
ACTUAL 
COUNT 
8 


4th byte 
DESTINATION 
ADDRESS 
tst byte 12 


SOURCE 
ADDRESS 
1st byte 6th byte 
16 


6th byte 
4th byte 
20 


LENGTH 
FIELD 
24 


OPTIONAL 
DATA AREA 


Figure 41. Receive 
Frame Descrlptor-32-Blt 
Segmented 
Mode 


31 
ODD WORD 
1615 
EVEN WORD 
0 


ELl si 0 
0 
0 
0 
0 
0 
0 
0 
0 
OISFlo 
0 
0 
C 
I sIoKI 
STATUS 
SITS 
0 


A31 
LINK ADDRESS 
A04 


A31 
RECEIVE 
SUFFER 
DESCRIPTOR 
ADDRESS 
AO 8 


0101 
SIZE 
EOFI F I 
ACTUAL 
COUNT 
12 


4th byte 
DESTINATION 
ADDRESS 
tst byte 16 


SOURCE 
ADDRESS 
1st byte 
6th byte 
20 


6th byte 
4th byte 
24 


LENGTH 
FIELD 
28 


OPTIONAL 
DATA AREA 


Figure 42. Receive 
Frame Descrlptor-Llnear 
Mode 
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where: 
EL 
S 
SF 


C 


B 


OK (bit 13) 


STATUS 


LINK ADDRESS 


RBDPOINTER 


EOF 
F 
SIZE 
ACT COUNT 
MC 
DESTINATION 
ADDRESS 
SOURCE ADDRESS 


LENGTH FIELD 


4-54 


- 
When set, this bit indicates that this RFD is the last one on the RDL. 


- 
When set, this bit suspends the RU after receiving the frame. 
- 
This bit selects between the Simplified or the Flexible mode. 


0- 
Simplified mode, all the RX data is in the RFD. RBD ADDRESS field is all 
u1s." 


1- 
Flexible mode. Data is in the RFD and in a linked list of Receive Buffer De- 
scriptors. 
. 


- 
This bit indicates the completion of frame reception. It is set by the 82596. 


- 
This bit indicates that the 82596 is currently receiving this frame, or that the 82596 
is ready to receive the frame. It is initially set to 0 by the CPU. The 82596 sets it to 
1 when reception set up begins, and to 0 upon completion. The C and B bits are 
set during the same operation. 


- 
Frame received successfully, without errors. RFDs with bit 13 equal to 0 are possi- 
ble only if the save bad frames, configuration option is selected. Otherwise all 
frames with errors will be discarded, although statistics will be collected on them. 
- 
The results of the Receive operation. Defined bits are, 
Bit 12: 
Length error if configured to check length 
Bit 11: 
CRC error in an aligned frame 
Bit 10: 
Alignment error (CRC error in misaligned frame) 
Bit 9: 
Ran out of buffer space-no 
resources 
Bit 8: 
DMA Overrun failure to acquire the system bus. 


Bit 7: 
Frame too short. 


Bit 6: 
No EOP flag (for Bit stuffing only) 
Bit 5: 
When the SF bit equals zero, and the 82596 is configured to save bad 
frames, this bit signals that the receive frame was truncated. Otherwise it 
is zero. 
Bits 2-4: 
Zeros 
Bit 1: 
When it is zero, the destination address of the received frame matches 
the lA address. When it is a 1, the destination address of the received 
frame did not match the individual address. For example, a multicast 
address or broadcast address will set this bit to a 1. 


Bit 0: 
Receive collision. A collision is detected during reception and the colli- 
sion occurred after the destination address was received. 
- 
A 16-bit offset (32-bit address in the Linear mode) to the next Receive Frame 
Descriptor. The Link Address of the last frame can be used to form a cyclical list. 


- 
The offset (address in the Linear mode) of the first RBD containing the received 
frame data. An RBD pointer of all ones indicates no RBD. 
- 
These fields are for the Simplified and Flexible memory models. They are exactly 
the same as the respective fields in the Receive Buffer Descriptor. See the next 
section for detailed explanation of their functions. 


- 
Multicast bit. 


- 
The contents of the destination address of the receive frame. The field is 0 to 6 
bytes long. 


- 
The contents of the Source Address field of the received frame. It is 0 to 6 bytes 
long. 
- 
The contents of this 2-byte field are user defined. In 802.3 it contains the length of 
the data field. It is placed in memory in the same order it is received, i.e., most 
significant byte first, least significant byte second. 
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NOTES 
1. The Destination 
address, 
Source 
address 
and Length fields are packed, 
i.e., one field immediately 
follows 
the next. 
2. The affect 
of Address/Length 
Location 
(No Source 
Address 
Insertion) 
configuration 
parameter 
while 
re- 
ceiving 
is as follows: 


- 
82586 
Mode: The Destination 
address, 
Source 
address 
and Length field are not used, they are placed 
in 
the RX data buffers. 


- 
32-Bit Segmented 
and Linear Modes: when the Simplified 
memory model is used, the Destination 
address, 


Source 
address 
and Length fields reside in their respective 
fields in the RFD. When the Flexible 
memory 
strucrture 
is used the Destination 
address, 
Source address, 
and Length field locations 
depend 
on the SIZE 
field of the RFD. They can be placed in the RFD, in the RX data buffers, or partially in the RFD and the rest 
in the RX data buffers, 
depending 
on the SIZE field value. 


82586 Mode 
31 
ODD WORD 
1615 
EVEN WORD 
0 


A15 
NEXT RBD OFFSET 
AolEOFI 
F I 
ACTUAL 
COUNT 
0 


X 
X 
X 
X 
X 
X 
X 
X IA23 
RECEIVE 
BUFFER 
ADDRESS 
AO 4 


X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X 
X I EL I X I 
SIZE 
6 


32-Blt Segmented 
Mode 
31 
ODD WORD 
1615 
EVEN WORD 
0 


A15 
NEXT RBD OFFSET 
AolEOFI 
F I 
ACTUAL 
COUNT 
0 


A31 
RECEIVE 
BUFFER 
ADDRESS 
AO 4 


o 
0 
0 
0 
0 
o 
0 
0 
0 
0 
0 
0 
0 
0 
o 
01 EL I pi 
SIZE 
6 


Linear Mode 
31 
ODD WORD 
1615 
EVEN WORD 
0 


o 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
o IEOFI 
F I 
ACTUAL 
COUNT 
0 


A31 
NEXT RBD ADDRESS 
AO 4 


A31 
RECEIVE 
BUFFER 
ADDRESS 
AO 6 


o 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
o I EL I pi 
SIZE 


Figure 43. Receive 
Buffer Descrlptor 


I 
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where: 


EOF 


F 


ACTCOUN 


NEXT BD ADDRESS 


BUFFER ADDRESS 


EL 


P 


SIZE 
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- 
Indicates 
that this is the last buffer 
related 
to the frame. 
It is cleared 
by the CPU 
before 
starting 
the RU, and is written 
by the 82596 at the end of reception 
of the 
frame. 


- 
Indicates 
that this buffer has already been used. The Actual Count has no meaning 
unless the F bit equals one. This bit is cleared 
by the CPU before starting 
the RU, 


and is set by the 82596 after the associated 
buffer has been. This bit has the same 
meaning 
as the Complete 
bit in the RFD and CB. 


- 
This 
14-bit quantity 
indicates 
the number 
of meaningful 
bytes 
in the buffer. 
It is 
cleared 
by the CPU before 
starting 
the RU, and is written 
by the 82596 
after the 
associated 
buffer 
has already 
been used. 
In general, 
after 
the buffer 
is full, the 
Actual 
Count value equals the size field of the same buffer. 
For the last buffer 
of 
the frame, Actual 
Count can be less than the buffer size. 


- 
The offset 
(absolute 
address 
in the Linear mode) of the next RBD on the list. It is 
meaningless 
if EL = 1. 


- 
The starting 
address 
of the memory 
area that contains 
the received 
data. 
In the 
82586 
mode, this is a 24-bit address 
(with pins A24-A31 
=0). 
In the 32-bit 
Seg- 
mented 
and Linear modes 
this is a 32-bit address. 


- 
Indicates 
that the buffer associated 
with this RBD is last in the FBL. 


- 
This bit indicates 
that the 82596 has already prefetched 
the RBDs and any change 
in the RBD data will be ignored. 
This bit is valid only in the new 82596 
memory 


modes, 
and 
if this 
feature 
has 
been 
enabled 
during 
configure 
command. 
The 
82596 Prefetches 
the RBDs in locked cycles; after prefetching 
the RBD the 82596 
performs 
a write cycle where the P bit is set to one and the rest of the data remains 
unchanged. 
The CPU is responsible 
for resetting 
it in all RBDs. The 82596 will not 


check this bit before 
setting 
it. 


- 
This 14-bit quantity 
indicates 
the size, in bytes, of the associated 
buffer. This quan- 


tity must be an even number. 
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PGA PACKAGE 
THERMAL 
SPECIFICATION 


NOTICE: This is a production data sheet. The specifi- 
cations are subject to change without notice. 


Parameter 
Thermal 
Resistance 


8JC 
3·C/W 


8JA 
24·C/W 


• WARNING: Stressing the device beyond the "Absolute 
Maximum 
Ratings" 
may 
cause permanent 
damage. 


These are stress ratings 
only. Operation 
beyond 
the 


"Operating 
Conditions" 
is not recommended 
and ex- 
tended 
exposure 
beyond 
the 
"Operating 
Conditions" 


may affect device reliability. 


ELECTRICAL 
AND TIMING 
CHARACTERISTICS 


Absolute 
Maximum Ratings 


• 
Storage Temperature 
- 65·C to + 150·C 


• 
Case Temperature 
under Bias -: 65·C to + 11O·C 


• 
Supply Voltage 
with Respect to VSS 
-0.5V 
to + 6.5V 


• 
Voltage on Other Pins 
-0.5V 
to vcc + 0.5V 


DC Characteristics 


Tc = 0·C-85·C. 
vcc = 5V ± 10% LE/BE 
have MOS levels (see VMll. 
VMIH). 


All other signals have TTL levels (see Vll. 
VIH. VOL. VOH). 


Symbol 
Parameter 
Mln 
Max 
Units 
Notes 


Vll 
Input Low Voltage 
(TTL) 
-0.3 
+0.8 
V 
J 


VIH 
Input High Voltage 
(TTL) 
2.0 
vcc + 0.3 
V 


VMll 
Input Low Voltage 
(MOS) 
-0.3 
+0.8 
V 


VMIH 
Input High Voltage 
(MOS) 
3.7 
vcc + 0.3 
V 


VOl 
Output 
Low Voltage 
(TTL) 
0.45 
V 
IOl = 4.0mA 


Vcu, 
RXC. TXC Input Low Voltage 
-0.5 
0.6 
V 


VCIH 
RXC. TXC Input High Voltage 
3.3 
Vcc+0.5 
V 


VOH 
Output High Voltage 
(TTL) 
2.4 
V 
IOH = 0.9 mA-1 
mA 


lu 
Input Leakage Current 
±15 
/LA 
o $; VIN s vcc 


ILO 
Output Leakage Current 
±15 
/LA 
0.45 < VOUT < vcc 


CIN 
Capacitance 
of Input Buffer 
10 
pF 
FC = 1 MHz 


COUT 
Capacitance 
of Input/Output 
12 
pF 
FC = 1 MHz 
Buffer 


CClK 
CLK Capacitance 
20 
pF 
FC = 1 MHz 


Icc 
Power Supply 
200 
mA 
At 25 MHz 
Icc Typical = 100 mA 


Icc 
Power Supply 
300 
mA 
At 33 MHz 
lee Typical = 150 mA 
. 
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AC Characteristics 


82596CA 
C-STEP 
INPUT IOUTPUT 
SYSTEM 
TIMINGS 


Tc = 0·C-+85·C, 
vcc = 5V ± 10%. These timing assume 
the CL on all outputs 
is 50 pF unless otherwise 
specified. 
CL can be 20 pF to 120 pF however 
timings 
must be derated. 
All timing requirements 
are given in 
nanoseconds. 


16MHz 
Symbol 
Parameter 
Mln 
Max 
Notes 


Operating 
Frequency 
12.5 MHz 
16 MHz 
1X ClK 
Input 


T1 
ClK 
Period 
62.5 
80 


T1a 
ClK 
Period Stability 
0.1% 
Adjacent 
ClK 
/l. 


T2 
ClK 
High 
20 
2.0V 


T3 
ClK 
low 
20 
0.8V 


T4 
ClK 
Rise Time 
8 
0.8Vt02.0V 


T5 
ClKFallTime 
8 
2.0Vto 
0.8V 


T6 
BEn, lOCK, 
and A2-A31 
Valid Delay 
3 
23 


T6a 
BLAST, PCHK Valid Delay 
3 
32 


T7 
BEn, U5CK, BLAST, A2-A31 
Float Delay 
3 
39 


T8 
WIR and ADS Valid Delay 
3 
23 


T9 
WIR and ADS Float Delay 
3 
39 


T10 
00-031, 
DPn Write Data Valid Delay 
3 
27 


T11 
00-031, 
DPn Write Data Float Delay 
3 
39 


T12 
HOLD Valid Delay 
2 
. 30 


T13 
CA and BREQ Setup Time 
11 
1,2 


T14 
CA and BREQ Hold Time 
6 
1,2 


T15 
BS16 Setup Time 
12 
2 


T16 
BS16 Hold Time 
5 
2 


T17 
BRDY, RDY Setup Time 
12 
2 


T18 
BRDY, RDY Hold Time 
5 
2 


T19 
00-031, 
DPn READ Setup Time 
10 
2 


T20 
00-031, 
DPn READ Hold Time 
6 
2 


T21 
AHOlD 
and HlDA 
Setup Time 
15 
1,2 


T22 
AHOlD 
Hold Time 
5 
1,2 


T22a 
HlDA 
Hold Time 
5 
1,2 


T23 
RESET Setup Time 
14 
1,2 


T24 
RESET Hold Time 
5 
1,2 


T25 
INT liNT 
Valid Delay 
1 
23 


T26 
CA and BREQ, PORT Pulse Width 
2T1 
1,2,3 


T27 
00-031 
CPU PORT Access Setup Time 
10 
2 


T28 
00-031 
CPU PORT Access 
Hold Time 
6 
2 


T29 
PORi Setup Time 
11 
2 


T30 
PORT Hold Time 
5 
·2 


T31 
BC5FF Setup Time 
12 
2 


T32 
BOFF Hold Time 
5 
2 


"TImings 
shown 
are for the 82596CA 
C-Stepplng. 
For information 
regarding 
timings 
for the 82596CA 
A 1 or B-Step, 
contact 
your local Intel representative. 
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AC Characteristics 
(Continued) 


82596CA 
C-STEP 
INPUT IOUTPUT 
SYSTEM 
TIMINGS 


Tc = O·C- + 85·C, vcc = 5V ± 10%. These timing assume 
the CL on all outputs 
is 50 pF unless otherwise 
specified. 
CL can be 20 pF to 120 pF however 
timings 
must be derated. 
All timing requirements 
are given in 
nanoseconds. 


20MHz 
Symbol 
Parameter 
Mln 
Max 
Notes 


Operating 
Frequency 
12.5 MH.z 
20MHz 
1X ClK 
Input 


T1 
ClK 
Period 
50 
80 


T1a 
ClK 
Period Stability 
0.1% 
Adjacent 
ClK 
~ 


T2 
ClK 
High 
16 
2.0V 


T3 
ClKlow 
16 
0.8V 


T4 
ClK 
Rise Time 
6 
0.8Vt02.0V 


T5 
ClKFallTime 
6 
2.0V to 0.8V 


T6 
BEn, LC5CK, and A2-A31 
Valid Delay 
3 
20 


T6a 
BLAST, PCHK Valid Delay 
3 
25 


T7 
BEn, LC5CK, BLAST, A2-A31 
Float Delay 
3 
34 


T8 
WIR and ADS Valid Delay 
3 
20 


T9 
W/R and ADS Float Delay 
3 
34 


T16 
00-031, 
DPn Write Data Valid Delay 
3 
23 


T11 
00-031, 
DPn Write Data Float Delay 
3 
34 


T12 
HOLD Valid Delay 
2 
25 


T13 
CA and BREa 
Setup Time 
10 
1,2 


T14 
CA and BREa 
Hold Time 
6 
1,2 


T15 
BS 16 Setup Time 
12 
2 


T16 
BS16 Hold Time 
4 
2 


T17 
BRDY, RDY Setup Time 
12 
2 


T18 
BRDY, RDY Hold Time 
4 
2 


T19 
00-031, 
DPn READ Setup Time 
6 
2 


T20 
00-031, 
DPn READ Hold Time 
5 
2 


T21 
AHOlD 
and HlDA 
Setup TIme 
15 
1,2 


T22 
AHOlD 
Hold Time 
4 
1,2 


T22a 
HlDA 
Hold Time 
, 
5 
1,2 


T23 
RESET Setup Time 
12 
1,2 


T24 
RESET HoldTime 
4 
1,2 


T25 
INT liNT 
Valid Delay 
1 
23 


T26 
CA and BREa, 
PORT Pulse Width 
2T1 
1,2,3 


T27 
00-031 
CPU PORT Access Setup Time 
6 
2 


T28 
00-031 
CPU PORT Access 
Hold Time 
5 
2 


T29 
PORT Setup Time 
10 
2 


T30 
PORT Hold Time 
5 
2 


T31 
BOFF Setup Time 
12 
2 


T32 
BOFF Hold Time 
4 
2 


"Timings 
shown 
are for the 82596CA 
C-Stepping. 
For information 
regarding 
timings 
for the 82596CA 
A 1 or B-Step, 
contact 
your local 
Intel representative. 
' 


L 
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T3 
CLKLow 
14 
0.8V 


T4 
CLK Rise Time 
4 
0.8Vto2.0V 


T5 
CLKFallTime 
4 
2.0Vto 
0.8V 


T6 
BEn Valid Delay 
3 
17 


T6a 
BLAST Valid Delay 
3 
20 


T6b 
LOCK Valid Delay 
3 
18 


T6c 
A2-A31 
Valid Delay 
3 
18 


T6d 
PCHK Valid Delay 
3 
24 


T7 
BEn, LOCK, BLAST, A2-A31 
Float Delay 
3 
30 


T8 
W/R and ADS Valid Delay 
3 
19 


T9 
W/R and ADS Float Delay 
3 
30 


T10 
00-031, 
DPn Write Data Valid Delay 
3 
20 


T11 
00-031, 
DPn Write Data Float Delay 
3 
30 


T12 
HOLD Valid Delay 
3 
19 


T13 
CA and BREQ Setup Time 
7 
1,2 


T14 
CA and BREQ Hold Time 
3 
1,2 


T15 
BS16 Setup Time 
8 
2 


T16 
BS16 Hold Time 
3 
2 


T17 
BRDY Setup Time 
9 
2 


T17a 
RDY Setup Time 
' 
8 
2 


T18 
BRDY, RDY Hold Time 
3 
2 


T19 
00-031, 
DPn READ Setup Time 
6 
2 


T20 
00-031, 
DPn READ Hold Time 
4.5 
2 


T21 
AHOLD and HLDA Setup Time 
10 
1,2 


T22 
AHOLD 
Hold Time 
3 
1,2 


T22a 
HLDA Hold Time 
3 
1,2 


T23 
RESET Setup Time 
10 
1,2 


T24 
RESET Hold Time 
3 
1,2 


T25 
INT liNT 
Valid Delay 
1 
20 


'TImings 
shown 
are for the 82596CA 
C-Stepplng. 
For information 
regarding 
timings 
for the 82596CA 
A 1 or B-Step, 
contact 
your local 
Intel representative. 
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AC Characteristics 
(Continued) 


82596CA 
C-STEP 
INPUT/OUTPUT 
SYSTEM 
TIMINGS 


Tc = o°C_ + 85°C, vcc = 5V ± 10%. These timing assume the CL on all outputs 
is 50 pF unless otherwise 
specified. 
CL can be 20 pF to 120 pF however 
timings 
must be derated. 
All timing requirements 
are given in 
nanoseconds. 
' 


25MHz 
Symbol 
Parameter 
Notes 
Mln 
Max 


T26 
CA and BREQ, PORT Pulse Width 
2T1 
1,2,3 


T27 
00-031 
CPU PORT Access Setup Time 
6 
, 
2 


T28 
00-031 
CPU PORT Access 
Hold Time 
4.5 
2 


T29 
PORT Setup Time 
7 
2 


T30 
PORT Hold TIme 
3 
2 


T31 
BOFF Setup Time 
10 
2 


T32 
BOFF Hold Time 
3 
2 


"Timings 
shown 
are for the 82596CA 
C-Stepping. 
For information 
regarding 
timings 
for the 82596CA 
A 1 or B-Step, 
contact 
your local Intel representative. 


I 


• 
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AC Characteristics 
(Continued) 


82596CA 
C·STEP 
INPUT IOUTPUT 
SYSTEM 
TIMINGS 


TC = O°C to + 85°C, vcc = 5V ±5%. 
These timing assume 
the Cl on all outputs 
is 50 pF unless otherwise 
specified. 
Cl can be 20 pF to 120 pF, however 
timings 
must be derated. 
All timing requirements 
are given in 
nanoseconds. 


33MHz 
Symbol 
Parameter 
Notes 
Mln 
Max 


Operating 
Frequency 
12.5 MHz 
33MHz 
1XClK 
Input 


T1 
ClKPeriod 
30 
80 


T1a 
ClK 
Period Stability 
0.1% 
Adjacent 
ClK 
!:J. 


T2 
ClK 
High 
11 
2.0V 


T3 
ClKlow 
11 
0.8V 


T4 
ClK 
Rise Time 
3 
0.8V to 2.0V 


T5 
ClK 
Fall Time 
3 
2.0V to 0.8V 


T6 
BEn Valid Delay 
3 
17 


T6a 
BLAST Valid Delay 
3 
20 


T6b 
lOCK 
Valid Delay 
3 
16 


T6c 
A2-A31 
Valid Delay 
3 
18 


T6d 
PCHK Valid Delay 
3 
23 


T7 
BEn, IOCR, BLAST, A2-A31 
Float Delay 
3 
20 


T8 
W/R and ADS Valid Delay 
3 
16 


T9 
W/R and ADS Float Delay 
3 
20 


T10 
DO-D31, 
DPn Write Data Valid Delay 
3 
19 


T11 
DO-D31, 
DPn Write Data Float Delay 
3 
20 


T12 
HOLD Valid Delay 
3 
19 


T13 
CA and BREQ Setup Time 
7 
1,2 


T14 
CA and BREQ Hold Time 
3 
1,2 


T15 
BS16 Setup Time 
7 
2 


T16 
B516 Hold Time 
3 
2 


T17 
BRDY Setup Time 
9 
2 


T17a 
RDY Setup Time 
8 
2 


T18 
BRDY, RDY Hold Time 
3 
2 


T19 
DO-D31, 
DPn READ Setup Time 
6 
2 


T20 
DO- D31, DPn READ Hold Time 
4.5 
2 


T21 
AHOlD 
Setup Time 
10 
1,2 


T21a 
HlDA 
Setup Time 
8 
1,2 


T22 
AHOlD 
Hold Time 
3 
1,2 
.. 
"Timings 
shown 
are for the 82596CA 
C-Stepplng. 
For Information 
regarding 
timings 
for the 82596CA 
A 1 or B-Step. 
contact 
your local 
Intel representative. 


4-62 
I 


82596CA 


AC Characteristics 
(Continued) 


82596CA 
C-STEP 
INPUT IOUTPUT 
SYSTEM 
TIMINGS 


CL on all outputs 
is 50 pF unless otherwise 
specified. 
All timing requirements 
are given in nanoseconds. 


33MHz 
Symbol 
Parameter 
Mln 
Max 
Notes 


T22a 
HLOA Hold Time 
3 
1.2 


T23 
RESET Setup Time 
9 
1.2 


T24 
RESET Hold Time 
3 
1.2 


T25 
INT /INT Valid Oelay 
1 
20 


T26 
CA and BREQ. PORT Pulse Width 
2T1 
1.2.3 


T27 
00-031 
CPU PORT Access Setup Time 
6 
2 


T28 
00-031 
CPU PORT Access 
Hold Time 
4.5 
2 


T29 
PORT Setup Time 
7 
2 


T30 
PORT Hold Time 
3 
2 


T31 
BOFF Setup Time 
10 
2 


T32 
BOFF Hold Time 
3 
. 
2 


NOTES: 
"Timings 
shown 
are for the 82596CA 
C-stepping. 
For information 
regarding 
timings 
for the 82596CA 
A 1 or B-step. 
contact 
your local Intel representative. 
. 
4 


1. RESET. 
HLDA. and CA are internally 
synchronized. 
This timing 
is to'guarantee 
recognition 
at next clock 
for RESET. 
HLDA 
and CA. 
2. All set-up. 
hold and delay timings 
are at maximum 
frequency 
specification 
Fmax. and must be derated 
according 
to the 
following 
equation 
for operation 
at lower frequencies: 


Tderated 
= (Fmax/Fopr) 
x T 
where: 
Tderate 
= Specifies 
the value to derate 
the specification. 
Fmax = Maximum 
operating 
frequency. 
Fopr = Actual 
operating 
frequency. 


T = Specification 
at maximum 
frequency. 
This calculation 
only 
provides 
a rough 
estimate 
for derating 
the frequency. 
For more 
detailed 
information. 
contact 
your 
Intel Sales Office 
for the data sheet 
supplement. 


::I. CA pulse width 
need only be 1 T1 wide if the set up and hold times 
are met; BREQ 
must meet setup and hold times 
and 
need only be 1 T1 wide. 


TRANSMIT/RECEIVE 
CLOCK 
PARAMETERS 


Symbol 
Parameter 
20MHz 
Notes 
Mln 
Max 


T36 
TxCCycle 
50 
1.3 


- 
T38 
TxC Rise Time 
5 
1 


T39 
TxC Fall Time 
5 
1 


T40 
TxC High Time 
19 
1.3 


T41 
TxCLowTime 
18 
1.3 


T42 
TxO Rise Time 
10 
4 


T43 
TxO Fall Time 
10 
4 


T44 
TxO Transition 
20 
2.4 


T45 
TxC Low to TxO Valid 
25 
4.6 


T46 
TxC Low toTxO 
Transition 
25 
2.4 


T47 
TxC High to TxO Transition 
25 
2.4 


T48 
TxC Low to TxO High (At End of Transition) 
25 
4 
I 
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T54 
RXC Rise Time 
5 
1 


T55 
RXC Fall Time 
5 
1 


T56 
RXC High Time 
19 
1 


T57 
RXCLowTime 
18 
1 


RECEIVED 
DATA PARAMETERS 


T58 
RXD Setup Time 
20 
6 


T59 
RXD Hold Time 
10 
6 


T60 
RXD Rise Time 
10 


T61 
RXDFallTime 
10 


CRS AND CDT PARAMETERS 


T62 
CDT Low to TXC HIGH 
20 
External Collision 
Detect Setup Time 


T63 
TXC High to CDT Inactive, CDT Hold Time 
10 


T64 
CDT Low to Jam Start 
10 


T65 
CRS Low to TXC High, 
20 
Carrier Sense Setup Time 


T66 
TXC High to CRS Inactive, CRS Hold Time 
10 
(Internal Collision 
Detect) 


T67 
CRS High to Jamming 
Start, 
12 


T68 
Jamming 
Period 
11 


T69 
CRS High to RXC High, 
30 
CRS Inactive Setup Time 


T70 
RXC High to CRS High, 
10 
CRS Inactive Hold, Time 
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TRANSMIT 
IRECEIVE 
CLOCK 
PARAMETERS 
(Continued) 


Symbol 
I 


Parameter 
I 
20MHz 
I 
Notes 
I 
Mln 
I 
Max 
I 


INTERFRAME 
SPACING 
PARAMETERS 


T71 
I 
Interframe 
Delay 
I 
I 
I 
9 


EXTERNAL 
LOOPBACK-PIN 
PARAMETERS 


T72 
I 
TXC Low to LPBK Low 
I 


, 
• 
I 
T36 
I 
4 
T73 
I 
TXC Low to LPBK High 
I 
I 
T36 
I 
4 


NOTES: 
1. Special 
MOS levels. Vcu, = 0.9V and VCIH = 3.0V. 


2. Manchester 
only. 


3. Manchester. 
Needs 
50% 
duty cycle. 


4. 1 TTL load + 50 pF. 
5. 1 TTL load + 100 pF. 
6. NRZ only. 
7. Abnormal 
end of transmission-CTS 
expires 
before 
RTS. 
8. Normal 
end to transmission. 


9. Programmable 
value: 
T71 = NIFS. 
T36 
where: 
NIFS = the IFS configuration 
value 
(if NIFS is less than 
12 then 
NIFS is forced 
to 12). 


10. Programmable 
value: 


T64 
= (NCDF. 
T36) + x» T36 
(If the collision 
occurs 
after the preamble) 
where: 
NCDF = the collision 
detect 
filter configuration 
value, 


and 
x= 
12,13, 
14,or15 


11. T68 = 32.T36 
12. Programmable 
value: 


T67 = (NCSF. 
T36) + x» T36 
where: 
NCSF = the Carrier 
Sense 
Filter configuration 
value, 
and 
x = 12,13,14, 
or 15 


13. To guarantee 
recognition 
on the next clock. 


I 
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82596CA 
BUS OPERATION 


The following 
figures 
show the 82596CA 
basic bus cycle and basic burst cycle. 


Please refer to the 32-8it 
LAN Component 
User's Manual 


T1 
T2 
tlDlE 
T2 
T1 
T2 
T1 
T2 
tlDlE 
T1 


ClK 


ADS 


A31-A2 
I 
I 
BEO-3 


w/fi. 
, 
! 


RDY 


, 
, 
, 
, 


BLAST 
en en 
, 
, 
, 
, 
, 0 


, 


< 
DATA 
,,,, 


PCHK 
Ll1 


READ 
WRITE 


I 
I 


'\-""';""-..;,......J! 


, 
, 
cb cb c 
, 
, 
, 
, 
~-C:::r-)+-' ---(O~----' CJ- 


Ll1 


READ 
,WRITE 
290218-40 


Figure 44. Basic 82596CA 
Bus Cycle 
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Figure 45. Basic 82596CA 
Burst Cycle 
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SYSTEM INTERFACE A.C. TIMING CHARACTERISTICS 


The measurements should be done at: 
• TC = OOGto +85°G, VCC~= 5V ±10%, G = 50 pF unless otherwise specified. 
• A.G. testing inputs are driven at 2.4V for a logic "1" and 0.45V for a logic "0". 


• Timing measurements are made at 1.5V for both logic "1" and "0". 
• Rise and Fall time of inputs and outputs signals are measured between 0.8V and 2.0V respectively unless 
otherwise specified. 
• All timings are relative to GlK crossing the 1.5V level. 


• All A.G. parameters are valid only after 100 p.sfrom power up. 


2.• V ~ 
1.5V Test PoInty- 
M5V --1\.:. 
~ 
290218-18 


t---- T1----I 
290218-19 


Figure 46. ClK Timings 


Two types of timing specifications are presented below: 
1. Input Timing-minimum 
setup and hold times. 
2. Output Timings--output delays and float times from GlK rising edge. 


Figu're47 defines how the measurements should be done: 


elK 


1.5V 


LEGEND: 
Ts = Input Setup Time 
Th = Input Hold Time 
Tn = Minimum 
output 
delay or Mininum 
float delay 
Tx = Maximum 
output 
delay or Maximum 
float delay 


290218-20 


Figure 47. Drive levels and Measurements Points for A.C. Specifications 


Ts = T13, T15, T17, T19, T21, T23, T27, T29, T31 
Th = T14, T16, T18, T20, T22, T22a, T24, T28, T30, T32 
Tn = T6, T6a, T7, TB, T9, T10, T11, T12, T25 
Tx = T6, T6a, T7, TB, T9, T10, T11, T12, T25 
I 


• 
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J 
~2596CA 


~NPUTWAVEFORMS 


ClK 


290218-21 


BREQ 
CA 


Figure 48. CA and BREQ Input Timing 
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Figure 49. INT liNT 
Output Timing 
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Figure 50. HOLD/HLDA 
Timings 
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Figure 51. Input Setup and Hold Time 
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w/fl, 
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Figure 52. Output Valid Delay Timing 
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Figure 53. Output Float Delay Timing 
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Figure 54. PORT Setup and Hold Time 
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RESET 
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Figure 55. RESET Input Timing 


SERIAL 
AC TIMING 
CHARACTERISTICS 
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Figure 56. Serial Input Clock Timing 
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Figure 57. Transmit 
Data Waveforms 
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Figure 58. Transmit Data Waveforms 
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Figure 59. Receive Data Waveforms (NRZ) 
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Figure 60. Receive Data Waveforms (CRS) 
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Family: Ceramic 
Pin Grid Array Package 


Symbol 
Mllllmeters 
Inches 


Mln 
Max 
Notes 
Mln 
Max 
Notes 


A 
3.56 
4.57 
0.f40 
0.180 


A1 
0.76 
1.27 
Solid Lid 
0.030 
0.050 
Solid Lid 


A2 
2.67 
3.43 
Solid Lid 
0.105 
0.135 
Solid Lid 


A3 
1.14 
1.40 
0.045 
0.055 


B 
0.43 
0.51 
0.017 
0.020 


0 
36.45 
37.21 
1.435 
1.465 


01 
32.89 
33.15 
1.295 
1.305 


61 
2.29 
2.79 
0.090 
0.110 


L 
2.54 
3.30 
0.100 
0.130 


N 
132 
132 


S1 
1.27 
2.54 
0.050 
0.100 
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Intel Case Outline 
Drawings 
Plastic 
Quad Flat Pack (PQFP) 
0.025 Inch (0.635mm) 
Pitch 


Symbol 
Description 
Mln 
Max 
Mln 
Max 
Mln 
Max 
Mln 
Max 
Mln 
Max 
Mln 
Max 


N 
Leadcount 
68 
84 
100 
132 
164 
196 


A 
Package 
Height 
0.160 
0.170 
0.160 
0.170 
0.160 
0.170 
0.160 0.170 
0.160 
0.170 
0.160 
0.170 


A1 
Standoff 
0.020 
0.030 
0.020 0.030 
0.020 
0.030 
0.020 
0.030 
0.020 
0.030 
0.020 
0.030 


D,E 
Terminal 
Dimension 
0.675 
0.685 
0.775 
0.785 
0.875 
0.885 
1.075 
1.085 
1.275 
1.285 
1.475 
1.485 


D1, E1 
Package 
Body 
0.547 
0.553 
0.647 
0.653 
0.747 
0.753 
0.947 
0.953 
1.147 
1.153 
1.347 
1.353 


D2,E2 
Bumper Distance 
0.697 
0.703 
0.797 
0.803 
0.897 
0.903 
1.097 
1.103 
1.297 
1.303 
1.497 
1.503 


D3,E3 
Lead Dimension 
0.400 REF 
0.500 REF 
0.600 REF 
0.800 REF 
1.000 REF 
1.200 REF 


D4,E4 
Foot Radius Location 
0.623 
0.637 
0.723 
0.737 
0.823 
0.837 
1.023 
1.037 
1.223 
1.237 
1.423 
1.437 


L1 
Foot Length 
0.020 
0.030 
0.020 0.030 
0.020 
0.030 
0.020 0.030 
0.020 
0.030 
0.020 
0.030 
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INCH 


Symbol 
Description 
Mln 
Max 
Mln 
Max 
Mln 
Max 
Mln 
Max 
Mln 
Max 
Mln 
Max 


N 
Leadcount 
68 
84 
100 
132 
164 
196 


A 
Package 
Height 
4.06 
4.32 
4.06 
4.32 
4.06 
4.32 
4.06 
4.32 
4.06 
4.32 
4.06 
4.32 


A1 
Standoff 
0.51 
0.76 
0.51 
0.76 
0.51 
0.76 
0.51 
0.76 
0.51 
0.76 
0.51 
0.76 


D,E 
Terminal 
Dimension 
17.15 
17.40 
19.69 
19.94 22.23 
22.48 
27.31 
27.56 
32.39 
32.64 
37.47 
37.72 


D1, E1 
Package 
Body 
13.89 
14.05 
16.43 
16.59 
18.97 
19.13 24.05 
24.21 
29.13 
29.29 
34.21 
34.37 


D2,E2 
Bumper Distance 
17.70 
17.85 20.24 
20.39 
22.78 
22.93 
27.86 
28.01 
32.94 
33.09 
38.02 
38.18 


D3,E3 
Lead Dimension 
10.16 REF 
12.70 REF 
15.24 REF 
20.32 REF 
25.40 REF 
30.48 REF 


D4,E4 
Foot Radius Location 
15.82 
16.17 
18.36 
18.71 21.25 
21.25 
25.89 
26.33 
31.06 
31.41 
36.14 
36.49 


L1 
Foot Length 
0.51 
0.76 
0.51 
0.76 
0.51 
0.76 
0.51 
0.76 
0.51 
0.76 
0.51 
0.76 


Issue 
IWS Preliminary 
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Figure 61. Principal Dimensions 
and Datums 
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Figure 62. Molded 
Details 
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Figure 63. Terminal 
Details 


0.31 
(.012)-11- 
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<'005)@[~]A®-B®~ 
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Figure 64. Typical 
Lead 
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Figure 65. Detail M 


3. Recommendation to use only one type of buffer 
(either Simplified or Flexible) in any given linked 
list. 


4. Added detailed description regarding operation 
or RCVCDT counter. 
5. Added New Enhanced Big Endian Mode section. 
The New Enhanced Big Endian Mode applies 
only to the 82596 C-stepping. 
6. Added programming recommendations regarding 
RU and CU Start commands. These warn against 
Starting the CU while it is Active and Starting the 
RU while it is Ready. 
7. Emphasized that the TDR command is a static 
command and should not be used in an active 
network. 


8. Improved 82596CA C-step timings were added 
for all speeds. 


REVISION 
SUMMARY 


The following represents the key differences be- 
tween version 004 and version 005 of the 82596CA 
Data Sheet. 
1. Timings added for -16 MHz and -20 MHz specfi- 
cations. 


The following represents the key differences be- 
tween version 005 and version 006 of the 82596CA 
Data Sheet. 
1. A description of the 82596CA C-stepping en- 
hancements was added and the 82596CA B-step 
information was removed. 
2. Description of BOFF pin changed. BOFF may be 
asserted in T1 in the 82596 C-step. 
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QT960 EVALUATION AND PROTOTYPING 
BOARD 


270743-1 


LOW COST EVALUATION TOOL 
The QT960 products give you a 32-bit starter kit to begin software evaluation and 
hardware 
design at a low cost. The boards feature the 20 MHz 80960KB 32-bit embedded 
processor. The 80960KB has integrated 
floating point, instruction 
and register caches, 
and an on-chip interrupt 
controller. The 80960K-series are the first in a new 
architectural 
family of embedded processors from Intel built using Intel's CHMOS IVt 
process. These boards provide you with full access to the features of the 80960KB 
processor. A wire wrap prototyping area offers you easy access to board features to test 
your designs. Interleaved 
EPROM means fast execution of your code taking advantage of 
the 8096OKB's burst bus. A programmable 
wait state generator simulates different 
memory environments 
useful in evaluating the performance of your code. These features 
make the QT960 boards useful low cost tools for the 32-bit embedded designer. 


Once written, you can debug your program with NINDY, an EPROM resident debug 
monitor. NINDY enables you to download code, set seven different trace modes, display 
and modify memory or registers, and disassemble problem code sequences. 


Available separately 
from Intel are the ASM-960 (assembly language) and iC-960 (high- 
level language) products which provide you with the code development environment 
for 
the QT960 boards. 


The starter 
kit comes in two versions: the QT960F version has fast SRAM, high speed 
EPROM and Flash memory; the QT960E version has lower cost SRAM, Flash memory 
and no high speed EPROM. Each version has NINDY in either EPROM (QT960F) or 
Flash memory (QT960E), power supply cable, and the QT960 User Manual. Both versions 
also include the parts list, source code of the debug monitor, and the board data base 
(schematics) all on diskette. Armed with this starter kit you now have a system to 
evaluate and prototype your product ideas quickly and at low cost. 


November 
1990 
Order 
Number: 
270743-002 
5-1 
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FEATURES 


QT960 FEATURES 
• 20 MHz Execution 
Speed 
• 128K Bytes to Zero Wait State EPROMt 
• 128K Bytes of Flash Memory 
• 128K Bytes of Zero Wait State SRAMt 
• Programmable 
Wait State Generator 


• Prototyping 
Wire Wrap Area 
• Five Instruction 
Traces 
• Two Hardware 
Breakpoints 
Product 
Order Codes: EVQT960F20 and EVQT960E20 


• Display/Modify 
Memory and Registers 
• Code Disassembly 
• High Level Language 
Support 
• RS-232 Communications 
Link 
• The QT960E Version has 128K Bytes of Two 
Wait State SRAM and 128K Bytes Four Wait 
State Flash Memory 


tCHMOS IV is a patented InOOlproceas. 
*QT900F 
Version 
only. 


FAST AND EASY CODE UPDATES 


128K Bytes of Intel's 28F256 Flash memory provides an easy and quick method of changing 
your 
code in nonvolatile 
memory. Flash memory may be conveniently 
reprogrammed 
without 


removing it from the board while software is under development. 


FAST EPROM 


Interleaved 
fast EPROM (lntel's 27C202) on the QT960F version yields one-zero-zero-zero wait 


state code access. It efficiently 
utilizes the four word burst capabilities 
of the 80960KB bus 


maximizing 
program 
performance. 
' 


PROTOTYPING SUPPORT 


A prototyping 
wire wrap area is provided on board with access to the system's signals and buses. 


This area gives you access to the board's features 
and allows you to easily test design ideas. A 


system bus connector 
is also provided for off board prototyping. 


PROGRAMMABLE 
WAIT STATE GENERATOR 


A software programmable 
wait state generator 
enables you to quickly model various memory 


speeds. Under software control you can set over 16 different 
wait state combinations 
and evaluate 


the performance 
of your target system. 


DMA 


The board offers you eight DMA channels 
accessed through 
a NINDY library function 
using 


Intel's 82380. In addition, off board connectors 
provide DMA 110 capabilities. 


FIVE INSTRUCTION TRACES AND TWO HARDWARE 
BREAKPOINTS 


NINDY utilizes the built-in trace capabilities 
of the 80960KB to provide you with single step, 
supervisor, 
call, return, 
and branch 
instruction 
tracing offering you extensive 
debug capabilities 


for software examination 
and modification. 
Two hardware 
breakpoints 
enable you to break on 
and examine 
EPROM resident 
code. 


5-2 


infel· 


FEATURES 


HIGH LEVEL LANGUAGE SUPPORT 
NINDY is capable of downloading 
absolute object code generated 
by ASM-960 or iC-960. ASM-960 
and iC-960 may be purchased 
separately 
from Intel. 


COMMUNICATION AND SOFTWARE REQUIREMENTS 
The QT960 boards communicate 
with the host through 
the RS-232 link using an Intel82510 
UART provided on board. The boards support five baud rates: 1200,2400,9600,19200, 
and 38400. 


The default is 9600 baud. To communicate 
with the QT960 boards you must meet the following 
minimum 
software requirements: 


• Terminal 
Emulator 
• XMODEM Download Capabilities 


80960KB -I WAIT STATE I 
82380 
GENERATOR 
82510 
CPU 
SYSTE~ 
SERIAL 
SUPPORT 
f- 
CONTROLLER 
PERIPHERAL 
- 


ADDRESS 
I 
I 
T 
LATCHES 
U= 
I 
I 
I 


EPRO~ 
rLASH 
- 
128K 
BYTES 
~E~ORY 
SRA~ 


CLOCK 
128K 
BYTES 
RESET 
128K 
BYTES 
I 


WIRE 
WRAP 
PINS 


err 
BOARD 
CONNECTOR 


270743·2 


Block Diagram 
of the QT960 Board 


For information or the number of your nearest sales office call ~752 
(D.S. and Canada). 
Intel Corporeton, Literature 
Department, 
3065 Bowers Avenue. Santa Clara CA 95051. United States. Tel: 408-987-8080. • 
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MICROPROCESSOR FAMILY 
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PRODUCT OVERVIEW 
In recent years, embedded designs have grown in sophistication, 
encompassing 
many 


years of development 
and thousands 
of lines of code. Due to the size, complexity 
and 


constantly 
evolving desire to stay competitive, 
designers 
have turned 
to high-level 
languages 
for productivity 
and RISC-based processors for performance. 


Advanced 
processor technology 
such as RISC and supersca1ar 
has promised enhanced 


performance 
through 
the use of pipelining, 
multiple 
execution 
units, and generally, 
more 


efficient code execution. 
Achievement 
of this performance, 
however, has been 


constrained 
by the lack of compiler technology 
to take advantage 
of it. 


Intel's continuous 
investment 
in compiler technology 
allows user applications 
to deliver 
maximum 
performance 
from the i960 processor family. CTOOLS960 exploits the i960 


processor features 
and implements 
the most advanced optimization 
technique-profIle 


driven optimizations, 
where code optimizations 
are based on application 
runtime 


behavior, 
thus achieving superior 
performance 
compared with code generated 
by 
conventional 
global optimizations. 


i960- 
is a registered 
trademark 
of Intel Corporation. 
"Otherbrands 
and names 
are property 
of their respective 
owners. 
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C PROGRAMMING TOOLS FOR THE i960® 
MICROPROCESSOR FAMILY 


PRODUCT HIGHLIGHTS 
• CI'OOLS960 consists of a high-performance 
profiling compiler, profiling tools, an 
assembler/linker 
and utility tools 
• Fully conforms to ANSI standard, passes 
Plum Hall" and Perennial" 
tests 
• Produces superior quality, highly optimized 
code for i960 microprocessor family 
• Tightly integrated 
with DB960 source-level 
debugger and in-circuit emulators 
• Operates on a variety of host operating 
systems 
• Supports in-line assembly code in C source 
• Big-endian support (i960 CA/i960 CF 
processors) 
• Faster compile time and smaller code with 
Revision 4.0 
PROFILE-DRIVEN COMPILER 
OPTIMIZATIONS 
Profile-driven compiler optimization is a 
technique wherein the compiler reads a 
runtime profile of the source code being 
compiled, and makes code optimization 
decisions based on that profile. The runtime 
profile is gathered by instrumentation 
code 
inserted by the compiler during a preliminary 
compilation. The instrumented 
program is 
executed using typical data and the resultant 
program profile is saved. In a subsequent 
compilation, the run time profile is correlated 
to the source code and an optimized program 
generated. 


Code optimizations based on a program profile 
are more effective than traditional 
global 
optimizations. Conventional optimization 
strategies are limited by static knowledge of 
the program: the text of the program is 
extrapolated 
to its runtime behavior. Since the 
compiler has no idea which parts of the 
program influence execution time, all parts of 
the program receive equal attention. 
In contrast, profile-driven optimizations are 
based on known run time characteristics 
ofthe 
program. They go beyond traditional 
local and 
global optimizations, to become application- 
specific. The profile information 
is used by the 
compiler in a number of ways: 
• Additional resources (e.g., registers) can be 
allocated to parts of the program that 
execute more often. 


• Loops may be transformed 
to separate 
frequently executed paths from rarely 
executed ones, reducing interference 
between code segments and opening up more 
code motion opportunities. 
(Superblock 
Formation) 
• Frequently 
taken paths through the program 
code may be placed in sequence, increasing 
instruction 
cache hit rates. (Basic Block 
Rearrangement) 
• Program-level function inlining reduces call 
overhead and creates more optimization 
opportunities. 
(Inter-module Inlining) 


• Heavily used global variables may be placed 
in faster memory or in on-chip data RAM. 
All of these optimizations have been 
implemented in the Intel CI'OOLS960 
compiler tool set. 


i960® ARCHITECTURE- 
SPECIFIC OPTIMIZATIONS 
These optimizations are implemented 
to 
ensure the application can benefit from all 
features of the architecture. 
• Specialized 
instruction 
selection. 
The 
compiler makes use of the i960 architecture's 
specialized instructions 
such as clrbit, setbit, 


nand, and ornot instructions. 


• Intelligent 
register 
manager. 
It performs 
the assignment of all register operands to the 
available general purpose and floating point 
registers. 


• Code scheduling. 
The compiler modifies the 
ordering of instructions 
to increase the 
amount of parallel execution available on the 
specific i960 processor. 


• Use of on-chip 
data RAM for spill 


registers. 
When the i960 processor's 
physical registers are insufficient to meet the 
demands of a function, some registers must 
be "spilled" to some slower storage. For the 
i960 CA/i960 CF processor, you may "spill" 
into the on-chip data RAM instead of the 
slower off-chip memory. 


• Complex 
addressing 
modes (reducing 
instruction 
fetches 
and code space). By 
default, the compiler uses complex 
addressing modes that generate 
denser code, 


• 
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generating fewer instructions 
and fetches 
from memory. Users can instruct the 
compiler to retain the RISC form of 
addressing when code compaction is not 
desirable. 
• Branch 
prediction. 
The compiler uses 
profile data to set the branch prediction bit 
for the i960 CA/i960 CF processors to 
maximize the chances of a correct prediction. 
• Identification 
of leaf procedures. 
The 
iC960 compiler identifies procedures that 
contain no further calls. The linker optimizes 
the call to such leaf procedures to use the 
branch-and-link 
mechanism that does not 
allocate a new stack frame or register frame 
and executes faster than the call mechanism. 
ARCHITECTURE 
INDEPENDENT TRADITIONAL 
OPTIMIZATIONS 
The i960 processor compiler performs the 
following local and global optimizations. 


• Constant 
expression 
evaluation. 
The 
iC960 compiler directly evaluates simple 
arithmetic 
expressions containing constants 
and tracks constant values through all the 
computations 
performed. 


• Collapsing 
of arithmetic 
and bitwise 
Boolean 
identities. 
The compiler recognizes 
instances of arithmetic 
and bitwise Boolean 
operations in which one of the operands is an 
identity constant. The unnecessary operation 
is eliminated. 
• Common 
subexpression 
elimination. 
The 
compiler detects multiple occurrences ofthe 
same expression and avoids redundant 
computations by using the result left in the 
register. 


• Register 
subsumption 
or register 
coalescing. 
The compiler coalesces multiple 
registers containing the same value, thus 
eliminating 
a large number of register move 
instructions. 
• Local variable 
promotions. 
The compiler 
promotes to a register location, a variable of 
automatic or register storage class. 


• Tail-call 
elimination. 
When the last 
statement 
in a function is a call, execution 
time can be saved by replacing the call with 
an unconditional branch. 


• Automatic 
procedure 
inlining. 
The 
compiler automatically 
selects functions for 
inlining. Pragmas are also available for user 
control over inlining. 
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• Branch 
optimization&. 
The compiler 
rearranges 
branch instructions 
to minimize 
the number of branches executed. 


• Dead code elimination. 
The compiler 
eliminates code that computes values never 
used and code that cannot be reached. 


• Loop invariant 
code motion. 
The compiler 
identifies computations that do not change 
within the body of a loop and moves those 
computations to a point before the loop 
entry. 


• Induction 
variable 
elimination. 
FOR loop 
array subscripts can be simplified by the 
compiler thus minimizing address 
calculation overhead. 


ASSEMBLER AND LINKER 
The ASM960 assembler tool is based on the 
Free Software Foundation Tools (Note: 
Applications code generated by ASM960 is not 
subject to copyleft. Copyleft applies to 
application code incorporating 
ASM960 source 
code.). The assembler processes assembly code 
produced by the compiler. The i960 processor 
linker links together separately compiled 
modules, performing additional optimizations 
such as replacing calls by branch-and-link 
sequences. 
The ASM960 toolset offers other useful 
utilities such as: 


• Debugging 
aids: COFF dumper and 
mapper. 


• An archiver 
to build libraries in COFL 
format. 


• A COFF stripper 
to eliminate debug 
records from the object module. 


• A big-endian 
to little-endian 
converter 
(on big endian hosts). 


• A ROM builder 
to produce ROMable code. 


• A COFF to IEEE-695 converter. 


LIBRARY SUPPORT 
There are three types of libraries supported by 
CTOOLS960: 


• High levellibraries--contain 
over 200 
ANSI conforming and ANSI superset 
functions. 


C PROGRAMMING TOOLS FOR THE i960® 
MICROPROCESSOR FAMILY 


281434-2 


• Floating-point 
support-the 
Accelerated 
Floating Point library provides highly 
optimized, basic floating-point operations for 
i960 architectures 
without a floating-point 
unit (KA, SA, CA, CF). 
• Low-levellibraries--provide 
low-level 
support for execution on Intel-supported 
i960 
processor execution vehicles. 


Intel also offers software support which 
includes technical software information, 
automatic distributions 
of software and 
documentation 
updates, iCOMMENTS 
publication, remote diagnostic software, and a 
development tools troubleshooting 
guide . 


Intel Development Tools also offers a 30-day, 
money-back guarantee 
to customers who are 
not satisfied after purchasing any Intel 
development tool. 
WORLDWIDE SERVICE AND 
SUPPORT 
To augment its development tools, Intel offers 
field application engineering expertise and 
hotline technical support. 


ORDERING INFORMATION 


CTOOLS960 includes C compiler, assembler (ASM960), linker, utilities, C and floating point 
libraries. Note ASM960 source code is also included. 


Code 
Description 


CTOOLS960D 
Intel386TM or Intel486TM CPU DOS-based i960 processor C programming 
tools. 


CTOOLS960H 
HP9000/300 
or 700, HP-UX based i960 processor C programming 
tools. 
CTOOLS960R 
IBM RS60001AIX-based i960 processor C programming 
tools. 


CTOOLS960W 
Sun-4 based i960 processor programming 
tools. 
For more product information 
call 1-800-628-8686(U.S. and Canada). 


For literature 
call 1-800-548-4725 (Ll.S. and Canada) 


• 
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DB960 SOURCE-LEVEL RETARGETABLE 
DEBUGGER 


281433-1 


PRODUCT 
OVERVIEW 
DB960 is a source-level 
symbolic debugger supporting 
all members 
of the i960@ 
microprocessor 
family. It allows the users to debug application 
code in their own targets. 


DB960D's new user interface 
is based on the Microsoft Windows 3.1 environment 


providing greater 
debug productivity. 
DB960 shares the same debugger interface 
as 
Release 4.0 of Intel's i960 KB/SB 
processor emulators 
so users can easily transition 
debug 


work between the different 
tools. It operates 
on a DOS-based Microsoft Windcws" 3.1 


environment, 
communicating 
with the user's target system via a serial port. A 


retargetable 
monitor, MON960, is included with DB960D. The complete monitor source, 


example code and documentation 
are shipped so users can easily customize the monitor 


for the target 
system. There is no incorporation 
or royalty fee incurred 
if the monitor is 


shipped with the customers' 
products. 


FEATURES 


• Supports 
crOOL960 
R4.0 and GNU960 
R2.2 compiler development 
environment 
• Microsoft Windows 3.1 human 
interface 
with all the convenience 
ofthe 
windows 


environment, 
plus source display 


window, register 
window, watch window, 
memory window and local variable 
window to boost debugging productivity 


• Extensive 
breakpoint 
modes including 
source breakpoints, 
passpoints, 
temporary 
breakpoints 
and event-action 
breakpoints 
which trigger actions after 
the breakpoint 
is reached 
• Breakpoints 
can be defmed interactively, 
in the source window or symbolically 
using module names, procedure 
names 
and line numbers 


·WindOWB is a trademark 
of Microsoft 
Corporation. 
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DB960 Source-Level Retargetable Debugger 


• Single step program 
executions 
based on an 
assembler 
instruction, 
a C language 
statement 
or function 
• Memory and registers 
can be displayed and 
modified 
• Watch expressions 
can be defined and 
observed in the watch window as the 
program 
executes 
• Low level run-time 
library allows programs 
to access the host me system or to perform 
I/O operations 
• Symbols are demand loaded 
• DB960 R4.0 comes with a retargetable 
monitor (MON960) 


DEBUGGING FEATURES 


High-level source or disassembled 
code can be 
displayed 
in the source window. Users can 
scroll through 
the source, browse from module 
to module in a program, 
scope to any execut- 


able point in the source, or instantaneously 
relocate from a symbol name to the location 
where it was defined (hyperscope 
operation). 


S~bol 
names in the source can be highlighted 
to mspect the current 
run-time 
value of 
program 
variables. 
Call stacks can be 
examined 
to trace execution 
flow. 


A variety 
of breakpoints 
can be specified 
inclum.ng source breakpoints, 
watch points, 
passpomts, 
or event-action 
breakpoints. 


Breakpoints 
can be defined symbolically 
using 
module names, procedure 
names and line 
numbers. 
Watch points allow users to observe 
a variable 
as it changes during program 
execution. 
Passpoints 
display a message when 
a specified instruction 
is executed, giving the 
user a non-realtime 
way to track execution 
of 
key code sequences without 
halting 
instruction 
flow. The event-action 
form allows complex 
breakpoint 
c~nditions to be set up, including 
data breakpomts 
(when supported 
by on-chip 
registers). 


Users can step through 
program 
execution via 
a single assembly language 
instruction, 
a high- 
level language 
statement 
or a high-level 
function, function 
return 
statement 
or branch 
taken. Memory can be displayed or modified as 
common data types and all processor registers 
and system tables can be examined 
or changed. 


Expressions 
involving symbol names, memory 
references, 
or both, can be defined as watch 
expressions 
whose values are monitored 
in a 
Watch window as a program 
executes. 


RETARGETABLE MONITOR 


A.retargetable 
monitor, MON960, is shipped 
With DB960 for users to customize and 
incorporate 
into their target 
systems. 
Complete source code is provided with 
comprehensive 
retargeting 
instructions. 


Example code for porting to Intel evaluation 
boards, Intel82510 
UART serial controller 
chip and the 82C54 counter/timer 
are 
provided. MON960 has a well defined host 
debugger interface 
allowing users to develop a 
custom host debugger for their product. It 
supports 
big Endian, flash and interrupt- 
driven capabilities. 
MON960 also supports 
GDB960 R2.2 (debugger shipped with GNU960) 
and is available 
as a stand-alone 
product. 


HARDWARE DEBUG 
DB960 takes advantage 
of on-chip debug 
registers 
like those found on the i960 CA 
processor to provide hardware 
execution 
address and data address breakpoints. 
By 
using the available 
hardware 
breakpoint 
registers, 
the debugger can be used to find 
difficult bugs like stack overruns 
and invalid 
pointer accesses. Once the monitor has been 
retargeted 
to the target system, hardware 
designers can download initialization 
code 
read/write 
to registers, 
and examine 
memory 
or register 
contents. 


HIGH·SPEED SERIAL LINK 


Communications 
between the debugger host 
and target system is supported 
via RS-232 and 
RS-422 communication 
links. RS-232 allows 
access to industry 
standard 
serial protocols 
while the RS-422 interface 
provides higher 
speed communication 
(up to 115K baud) for 
faster code and data download. PC-AT bus- 
compatible 
RS-422 communication 
boards are 
available 
from various third party vendors. 


WORLDWIDE SERVICE AND 
SUPPORT 


Intel offers software support which includes 
technical 
software information 
a technical 
hotline, automatic 
distribution~ 
of software 
and documentation 
updates, 
iCOMMENTS 
publication, 
remote diagnostic 
software, and a 


development 
tools troubleshooting 
guide. Intel 


Development 
Tools also offers a 3O-day, money- 


back guarantee 
to customers 
who are not 
satisfied after purchasing 
any Intel 
development 
tool. 
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ORDERING INFORMATION 


Order Code 
Description 


DB960D 
DOS-based, retargetable 
software debugger for the 
i960 KA, i960 KB, i960 SA, i960 SB, i960 CA and 
i960 CF microprocessors. 
Includes host debug 
software, MON960 retargetable 
monitor, 
host 110 
libraries 
and documentation. 


MON960 
System debug monitor, source code product. 


For more product information 
call 1-800-628-8686 (V.S. and Canada). 


For more literature 
call 1-800-548-4725 (V.S. and Canada). 
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iRMK 960 
REAL-TIME KERNEL 


• 
32-Blt Real-Time 
Multl-Tasklng 
Kernel 
for the 1960 Microprocessor 
Family 


• 
Flexible, 
Modular 
Design to Ease 
System 
Integration 


• 
Fast Execution 
with Predictable 
Response 
Time for Time-Critical 
Applications 


• 
Compact 
Code Size (14 Kbytes- 
Including 
All Optional 
Modules) 


/" 


• 
Requires 
Only an 1960 KA, KB or MC 


Embedded 
Processor 


• 
Bus Independent 


• 
Easy Customlzatlon 
and Add-On 
Enhancements 


• 
Easily EPROMmable 


• 
Comprehensive 
Development 
Tool 
Support 


The iRMK 960 Real-Time 
Kernel is the 32-bit real-time 
executive 
developed 
and supported 
by Intel, the i960 


architecture 
experts. 
The kernel 
is a small, fast and highly modular 
package 
of system 
control 
software. 
It 


contains 
the basic software 
building 
blocks 
that act as the foundation 
in .using the key features 
of the i960 


microprocessor. 
The iRMK 960 software 
is fully supported 
by an array of tools that work in the most popular 


development 
environments 
(i.e., DOS', 
VAXIVMS', 
SUW). 


The iRMK 960 Real-Time 
Kernel is available 
off-the-shelf. 
The kernel 
reduces 
the cost and risk of designing 


and maintaining 
software 
for numerous 
real-time 
applications 
such as, embedded 
control 
systems 
and dedi- 


cated 
real-time 
subsystems 
in multiprocessor 
environment. 
Use of the kernel can save man years that might 


otherwise 
be spent developing 
or porting another 
real-time 
kernel. This means reduced 
time to market for the 
user. 


*DOS 
is a registered 
trademark 
of Microsoft 
Corporation. 
VAXIVMS 
is a trademark 
of Digital 
Equipment 
Corporation. 


SUN is a trademark 
of Sun Microsystems. 


October 1991 
Order Number: 281006-001 


• 
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A Full Set of Real-Time 
Building Blocks 


The kernel provides a full set of services for real- 
time applications including task management, time 
management, synchronization of and communica- 
tions between tasks, and memory pool manage- 
ment. 


and task priority change. Task handlers can be used 
for a wide range of functions, including saving and 
restoring the state of coprocessor registers on task 
switch, masking interrupts based on task priority or 
implementing statistical and diagnostic monitors. 


INTERRUPT 
MANAGEMENT 


iRMK 960 interrupts are managed by immediately 
switching control to user-written interrupt handlers 
when an interrupt occurs. 


HARDWARE 


APPLICATION 


LANGUAGE 
INTERFACE 
LIBRARIES 


KERNEL 


CORE 


IoIODULES 


KERNEL 


OPTIONAL 


IoIODULES 


KERNEL 


SUPPLIED 


DEVICE 


IoIANAGERS 


USER- 


SUPPLIED 
SYSTEIoI 


ROUTINES 


Figure 1. iRMK 960 Real-Time 
Architecture 
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Response to interrupts is both fast and predictable. 
Most of the kernel's system calls can be executed 
directly from interrupt handlers. 


TIME 
MANAGEMENT 


The time management features included in the ker- 
nel provide single-shot alarms, repetitive alarms and 
a real-time clock. In addition, alarms can be reset. 


These time management facilities can solve a wide 
range of real-time programming problems. Single- 
shot alarms, for example, can be used to handle 
timeouts. If the timeout occurs, the alarm invokes a 
user-written handler; if the event occurs before the 
timeout, the application simply deletes the alarm. 
Other uses for the kernel's time management facili- 
ties including polling devices with repetitive alarms, 
putting tasks to sleep for specified periods of time, 
or implementing a time-of-day clock. 


INTERTASK 
SYNCHRONIZATION 
AND 
COMMUNICATION 


Semaphores, regions and mailboxes are the key 
mechanisms the kernel uses for synchronizing tasks 
and communicating between tasks. 


Semaphores are objects used for intertask signaling 
and 
synchronization. 
Tasks 
exchange 
abstract 
"units" with semaphores as a means of becoming 
synchronized. A task requests 
a unit from a sema- 
phore to gain access to a resource. If the resource is 
available, the semaphore will have a unit to give to 
the task, enabling the task to proceed. A task sends 
a unit to a semaphore to indicate that it has released 
a previously obtained resource. 


A special binary type of semaphore is called a Re- 
gion. Regions are used to ensure mutual exclusion, 
thus preventing deadlock when tasks contend for 
control of system resources. A task holding a re- 
gion's unit runs at the priority of the highest priority 
task waiting in queue for the region's unit. 


Mailboxes are queues that can hold any number of 
messages and are used to exchange data between 
tasks. Either data or pointers can be sent using mail- 
boxes. The kernel allows mailbox messages to be of 
any length. High priority messages can be placed 
Oammed)at the front of the message queue to en- 
sure that they are received and processed before 
other messages queued at the mailbox. 


To ensure that high priority tasks are not blocked by 
lower priority tasks, the kernel allows tasks to queue 
at semaphores and mailboxes in priority order. The 
kernel also supports first-in, first-out task queueing. 
I 
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MEMORY 
POOL MANAGEMENT 


The iRMK 960 kernel uses the concept of memory 
pools to efficiently divide and manage blocks of 
memory. The memory pool manager provides for 
both fixed and variable block allocation. 


Memory can be divided into any number of pools. 
Multiple memory pools might be created for different 
speed memories, or for allocating different size 
blocks. The times to allocate and de-allocate fixed- 
size areas from within a pool have a fixed upper 
bound. 


The kernel-supplied memory manager works with 
flat memory architecture. Users can also write their 
own memory manager to provide different memory 
management policies or support virtual memory. 


Hardware 
Requirements 
and Support 


The kernel requires only an i960 microprocessor and 
sufficient memory for itself and its application. The 
kernel's design, however, recognizes that many sys- 
tems use additional programmable peripheral devic- 
es and coprocessors. The kernel provides optional 
device managers for: 
• The 82380 Integrated System Peripheral (ISP) 
chip 


• The 8254 Programmable Interval Timer (PIT) chip 


An application can supply managers for other devic- 
es and coprocessors in addition to or in replacement 
of the devices listed above. 


The openness of the iRMK 960 kernel is a major 
benefit to the OEM. The kernel is designed to be 
programmed into PROM or EPROM, making it easy 
to use in embedded designs. In addition, it can be 
used with any system bus, including those of MULTI- 
SUS I and MULTISUS 11 bus architectures. 


A Modular Architecture 
for Easy 
Customization 


The kernel is designed for maximum flexibility. It can 
be customized for any application. Each major func- 
tion, mailboxes for example, is implemented as a 
separate module. The kernel's modules have not 
been linked together and are supplied individually. 
(See Table 1 for the list of kernel modules, and their 
approximate sizes.) 


The user links only the modules needed for his appli- 
cation. Any module not used does not need to be 
linked in, and does not increase the size of the ker- 
nel in your application. The user can also replace 


• 
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any optional kernel module with one that imple- 
ments specific features required by the application. 
For example, the user might want to replace the ker- 
nel's memory manager with one that supports virtual 
memory. 


Table 1. IRMK 960 Kernel Modules 
and Approximate 
Sizes 


Core Modules 
Task Manager 
Interrupt Manager 
Time Manager 
Scheduler 
Initialization 


Bytes 
2600 
150 
3000 
1700 
50 


Optional 
Modules 
Mailbox Manager 
Semaphore Manager 
Memory Manger 
Fault Handler Manager 
Miscellaneous 


1250 
2900 
1260 
50 
300 


Optional 
Device Manager 
82380 Integrated System Peripheral 
8254 Programmable Interval Timer 
4200 
1200 


Total size of the (entire) kernel (minus device man- 
agers) is about 13.5 Kbytes. 


Developing 
with the iRMK 960 
Real-Time 
Kernel 


Kernel applications can be written using any lan- 
guage or compiler that produces code that executes 
on the i960 microprocessor. This independence is 
achieved by using an interface library. This library 
works with the idiosyncracies of a particular lan- 
guage-for 
example, the ordering of parameters. 


The interface library translates the calls provided by 
the language into a standard format expected by the 
kernel. Intel provides an interface library for our iC 
960 compiler. The source code of this library is in- 
cluded, so that the user can modify it to support oth- 
er compliers. 


Because the kernel is supplied as unlinked object 
modules, applications can be developed on any sys- 
tem that hosts the development tools needed. 


Comprehensive 
Development 
Tool 
Support 


Intel provides a complete line of 80960 development 
tools for writing and debugging iRMK 960 applica- 
tions. 
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These tools include: 
Software: 
ASM 960 assembler iC 960 
compiler 
NOTE: 
These tools are available for DOS, 
VAXIVMS", MicroVAX", SUN" and 
EVA960KB 4MB environment 


Debuggers: 
ICETM960 
In-Circuit Emulator for the i960 mi- 
croprocessor 
SMD 960 
System Debug Monitor for the i960 
microprocessor 
Evaluation 
Vehicles: 
EVA960KB 
AT Bus-Compatible Board 
A960KB4MB AT Bus-Compatible Board with 
4 Mbytes of Memory 
QT960 
Standalone Evaluation Vehicle 


Intel Support, Consulting and Training 


With iRMK 960 kernel software, the developer has 
available the total Intel i960 architecture and real- 
time expertise of Intel's support engineers. Intel pro- 
vides telephone support, on or off-site consulting, 
troubleshooting guides and updates. The kernel in- 
cludes 90 days of Intel's Technical 
Information 


Phone Service (TIPS). Extended support and con- 
sulting are also available. 


Contents 
of the iRMK 960 Kernel 
Development 
Package 


The iRMK 960 Kernel comes in a comprehensive 
package including: 


• Kernel object modules 
• Source for the kernel supplied 82380 Integrated 
System Peripheral and 8254 PIT device manag- 
ers 
• Source for the iC 960 interface library 
• Source for sample applications showing the fol- 


lowing: 


- 
Structure of kernel applications 
- 
Use of the kernel with an application written in iC 
960 language 
- 
Compile, bind and build sequences 
- 
Sample initialization code for the i960 microproc- 
essor 
- 
Applications written to execute in a flat memory 
space 


• User reference guide 
• 90 days of customer support 
I 
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LICENSING 


iRMK 960 software requires prior execution of the 
standard Intel Software License Agreement (SLA). A 
single development copy requires a Class I license 
and allows iRMK 960 software to be loaded and run 
on one single-processor system. 


SPECIFICATIONS 


Sys~emCalls 


The following items are system calls arranged by 
type: 


KERNEL 
INITIALIZATION 


IRMK 960 KERNEL 
SYSTEM 
CALLS 
LISTING 


KN~nitialize 


OBJECT 
MANAGEMENT 


TASK 
MANAGEMENT 


KN_create_task 
KN_delete_task 
KN_suspenLtask 
KN_resume_task 
KN_seL.priority 
KN_geL.priority 


INTERRUPT 
MANAGEMENT 


KN_seLinterrupt 
KN_stop_scheduling 
KN_starLscheduling 


TIME 
MANAGEMENT 


KN_sleep 
KN_create_alarm 


KN_reseLalarm 
KN_delete_alarm 


I 


Initialize kernel 


Returns a pointer to the 
area holding object 
Returns a pointer for the 
current task 


Creates a task 
Deletes a task 
Suspends a task 
Resumes a task 
Change priority of a task 
Return priority of a task 


Specify interrupt handler 
Suspend task switching 
Resume task switching 


Put calling task to sleep 
Create and start virtual 
alarm clock 
Reset an existing alarm 
Delete alarm 


KN_geLtime 
KN_seLtime 
KN_tick 


IRMK960 


Get time 
Set time 
Notify kernel that clock 
tick has occurred 


INTERTASK 
COMMUNICATION 
AND 
SYNCHRONIZATION 


KN_create_semaphore 
KN_delete_semaphore 


KN_send_unit 


KN_create_mailbox 
KN_delete_mailbox 
KN_send_data 
KN_send_priority_data 


MEMORY 
MANAGEMENT 


Create a semaphore 
Delete a semaphore 


Add a unit to a 
semaphore 


Receive a unit from a 
semaphore 
Create a mailbox 
Delete a mailbox 
Send data to a mailbox 
Place Oaril)priority 
message at head of 
message queue 
Request a message 
from a mailbox 


KN_create-pool 
Create a memory pool 


KN_delete_pool 
Delete a memory pool 11 
KN_create_area 
Create a memory area 
from a pool 
KN_d~lete_area 
Return a memory area to 
a memory pool 
KN_geLpool_attributes 
Get a memory pool's 
attributes 


PROGRAMMABLE 
INTERRUPT 
CONTROLLER 
MANAGEMENT 


KN_initialize_PICs 
KN_masLslot 


KN_new_masks 
KN_geLslot 


KN_geLinterrupt 


Initialize PlC's 
Mask out interrupts on a 
specified slot 
Unmask interrupts on a 
specified slot 
Signal the PlC that the 
interrupt on a specified 
slot has been serviced 
Change interrupt masks 
Return the most 
important active interrupt 
slot 
Get address of specified 
interrupt handler 
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PROGRAMMABLE 
INTERVAL 
CONTROLLER 
MANAGEMENT 


KN_initialize_PIT 


KN_starLPIT 


KN_geLPIT 
_interval 


Initialize the PIT 


Start PIT counting 


Return PIT interval 


PROCESSOR 
RECOGNIZED 
FAULT 
HANDLING 


KN_geLfaulLhandler 
Get address of fault 
handler currently 
associated 
with 
specified 
fault type 


KN_seLfaulLhandler 
Establish 
address of 
fault handler for the 
specified 
fault type 


I 


PROCESSOR 
INTERRUPT 
CONTROLLER 
SUPPORT 


KN_geLprocessor_ 
-priority 


KN_seLprocessor_ 
-priority 


Returns value of the 
processor 


Change the value of the 
processor 
priority 


PERFORMANCE 


The figures 
listed 
below 
were 
derived 
from 
a test 
suite running on a EVA-960 evaluation 
vehicle 
using 
an 80960KB 
running 
at 20 MHz. The EVA-960 
has 
what 
is known 
as 2-1-1-1 
wait state 
memory; 
what 
this means 
is that the first instruction 
of a four in- 
struction 
fetch takes two wait states, and each of the 
three 
successive 
instructions 
takes 
one wait state. 
The figures are the worst case values obtained 
from 
several 
sets of test runs. The code was generated 
using the iC 960 DOS hosted 
compiler, 
Version 
1.1. 


Action 
Create Pool 
Get Pool Attributes 
Delete Pool 
Create Area 
Delete Area 


Time (in /ls) 
18 
36 
1 
35 
32 
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Action 
Create Semaphore 
Delete Semaphore 
FIFa Semaphore 
Send Unit 
FIFa Semaphore 
Receive 
Unit 
Region Semaphore 
Send Unit 
Region Semaphore 
Receive 
Unit 


Create Mailbox 
Delete Mailbox 
Send Data 
Receive 
Data 


Create Alarm 
Delete Alarm 
FIFa Semaphore 
Send/Receive 
Unit with Task Switch 
Suspend Task with Task Switch 
Basic Task Switch 
Create Task 
Suspend Task 
Resume Task 
Delete Task 
Get Priority 
Set Priority 


Set Interrupt 
Get Interrupt 


MANUALS 
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Time (in /ls) 
6 
14 
7 
7 
18 
14 


19 
23 
21 
21 


29 
30 


75 
70 
50 
62 
26 
50 
50 
5 
27 


3 
3 


iRMK 
960 
User's 
Manual 
(Intel 
Order 
#463863- 
001). 


TRAINING 
INFORMATION 


Intel Customer 
Service 
Training: 


"80960 
KAlKB 
Embedded 
Processor 
Training 
Course" 


ORDERING 
INFORMATION 


Ordering Code 


RMK960 
Product Description 


iRMK 960 Real-Time 
Kernel 


I 


EV80960CA Evaluation 
Board 


270870-1 


Low Cost Processor Evaluation 
Tool 
Intel's EV80960CA evaluation 
board provides a low-cost hardware 
environment 
for code 
execution 
and software debugging. The board features 
the 8096OCA, the newest and 
highest performance 
member of Intel's family of 32-bit embedded microprocessors. 
The 
board allows a user's program 
to take full advantage 
of the power of the 80960CA and 
provides zero wait state execution 
of the user's code. 


Popular 
features 
such as single line assembler/disassembler, 
single-step 
program 
execution 
and software breakpoints 
are standard 
on the EV8096OCA's on-board monitor. 
Available 
separately, 
Intel offers a complete code development 
environment 
using the 
assembler 
(ASM-960) as well as high-level languages, 
such as Intel's iC-960 C compiler, to 
accelerate 
development 
schedules. 


The EV80960CA evaluation 
board package features 
the 80960CA System Debug Monitor 
(SDM) in EPROM, a SDM host software floppy disk, a power supply cable, a 9-pin PC/AT 
serial connector 
for terminal 
and the EV80960CA User's Manual. 
The EV80960CA 
User's Manual 
includes schematics 
of the board, a part list and programmable 
logic 
(PLD) equations. 
The board is hosted on an IBM or BIOS-compatible 
PC/AT. 


"The SRAM memory 
system 
provides 
zero wait state read (O..Q-O..O.O) 
and one wait state write 
(1-1-1-1-0) performance. 
··The 
DRAM 
memory 
system 
provides 
2.1-1-1-1 reads and writes. 


October 
1991 
Order 
Number: 
27087()'()()1 
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• rngn-Levei 
Language 
Support 
• Two RS-232s for Host and User 
Communication 
• Two iSBX 110 Connectors 
• An Expansion 
Bus to Accommodate 
Eurocard 
Form-Factor 
Prototyping 
Boards 


Fast Pipelined SRAM Memory 
System 


The pipelined-read 
memory system of the 
EV80960CA provides true zero wait state read 
and one wait state performance. 
The memory 
design utilizes the internal 
wait state 
generator 
ofthe 
80960CA. 


- 


The 80960CA System Debug Monitor (SDM) for 
the EV80960CA allows the user to read and 
modify internal 
registers 
and external 
memory 
while the user's program 
is running 
on the 
board. 


iSBX 110 Connectors and 
Expansion Interface 


The EV80960CA evaluation 
board has two 
connectors 
to support both 8- and 16-bit 
standard 
iSBX Expansion 
Modules. The board 
also provides an expansion 
bus to 
accommodate 
Eurocard 
form-factor 
prototyping 
boards. 


I/o 
EXPANSION 
WITH STANDARD 
ISBX BOARDS 
I 
I 


BUS 
BOOT I 


SENSE; I I 
iSBX I 
80960CA 
1 WBYTES 
BUFFERING 
EPROW 
SWITCHES 
EXPANSION 
CPU 
DRAW 
AND 
CONTROL 
I 
SIGNAL HBUFFER~ 
GEN. 
I 
I 
64 KBYTES 
O-WAIT 
32-BIT 
FAULT AND 
TIWER/ 
I 
I 


STATE 
EXPANSION 
USER LEDS 
COUNTERS 
UARTS 
SRAW 
BUS 
I 
I 
HOST 
USER 
INTERFACE 
PORT 
PORT 


270870-2 


Block Diagram 
of the EV80960CA 
Board 
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EV80960CA Evaluation 
Board 


Communication Link 


The EV80960CA board communicates 
with the 
host through 
the RS-232 link using an Intel 
82510 UART provided on board. The board 
supports 
seven baud rates: 300, 1200, 2400, 
4800, 9600, 19200 and 38400. 


Power Requirements 


The EV80960CA Evaluation 
Board requires 
5V 
at 2000 mA and ± 12V at 25 mA. 


Host System Requirements 


The EV80960CA Evaluation 
Board is hosted on 
an mM PC/ AT or compatibles; 
a 386-based PC 
is recommended. 
The host system must meet 
the following minimum 
requirements: 
• 512 Kbytes of Memory 
• One 1.2 Mbyte Floppy Disk Drive 
• PC-OOS 3.2 or Later 
• A Serial Port (COM1 or COM2) 
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i960 SA/SB EVALUATION BOARD 


272033-1 


i960 SA/SB EVALUATION BOARD 
The EV80960SX board is a general purpose evaluation tool for the i960 SA/SB 
embedded 
processors. This evaluation board provides a high-performance DRAM subsystem, an 
interleaved EPROM subsystem, and a robust set of peripheral devices for benchmarking 
and debugging application code written for the i960 SA/SB 
embedded processors. 


The EV80960SX is a great starter kit for your 32-bit application. The EV80960SX, 
NINDY debug environment, along with assembler and C-compiler (not provided) provide 
a seamless environment for developing code and evaluating the i960 SA/SB 
processors. 
The NINDY monitor provides code download capabilities from a number of popular 
development systems, including DOS-based PC's. Single step, breakpoints, register and 
memory display are among the full set offeatures provided by NINDY. 


The board is provided with the following 
The EPROM subsystem accommodates 
features: 
four, 32-pin or 28-pin 8-bit wide EPROMs 
• DRAM Subsystem operates at 
with up to 150 ns access times. 


1-0-0-0-0-0-0-0wait states for read and 
• Flash EPROM Subsystem reads and 
write cycles in the burst mode. The 
writes two 8-bit wide Flash EPROMS. 


DRAM subsystem runs at the maximum 
• 8259A Interrupt Controller provides 
processor frequency of 16 MHz, using 
expanded interrupt capabilities using 
100 ns fast page mode DRAMs. The 
the i960 SA/SB's interrupt controller 
DRAM subsystem can accommodate 
interface. 


from 512 Kbytes to 4 Mbytes, using 4 or 8 
• Parallel Port Input allows fast 
ZIP-packaged DRAMs. 
downloads of code or data to the 
• Interleaved EPROM Subsystem executes 
EV80960SX board. The parallel port 
burst program fetches with a 2-0-1-0-2-0- 
provides auto-busy and interrupt 
1-0wait state performance. 
capabilities, and is a full implementation 
of the Centronics standard. 


ACE5!. 
ICE and MCS. are registered 
trademarks 
of Intel Corporation. 


Ethernet 
is a registered 
trademark 
of Xerox Corporation. 


·CHMOS te a patented Intel process. 


5-20 
June 1991 
Order Number: 272033001 


infel· 


i960 SA/SB EVALUATION BOARD 


• Two serial ports provide queued and 
interrupt 
driven serial transfer 
at up to 
128000 baud. 
• 82C54 Timer/Counter 
provides a 32-bit 
counter 
and 16-bit counter, 
each with 
dedicated 
interrupts. 
• Expansion/Prototype 
Bus (XBUS) allows 
expansion 
cards and prototype 
hardware 
direct access to the i960 SA/SB's bus and 
control signals. Optionally, 
a configurable 
wait state scheme provides a no glue 
interface 
to most peripherals 
attached 
to the 
XBUS. 
• LEDs and Switches are user programmable. 
One 10-segment bar LED, a 7-segment LED 
and an 8-position switch are under program 
control. 


• Local Area Networking 
(LAN) is 
implemented 
using an 82596SX LAN 
coprocessor. 


• Laser Printer 
Control provides interfaces 
to 
TEC or Canon compatible 
laser engines. 
• Monitor and Self-test diagnostics 
are 
provided for the EV80960SX in the EPROMs 
installed 
in the board. 


The evaluation 
board comes complete with a 
design database 
included on diskette, 
the 
NINDY debug monitor on diskette 
and in 
EPROM, power and serial cables, schematics 
and user's manual. 


The EV80960SX is a public domain design. The 
hardware 
is fully documented 
and provides 
working examples of popular 
memory and 
peripheral 
interfaces 
to the i960 SA/SB 
processor. The schematic 
and PLD database 
are provided with each board. The EV80960SX 
designs are easily duplicated 
and can be used 
directly as the building blocks for custom 
designs. Custom hardware 
can be prototyped 
using the expansion 
bus (XBUS) connector . 
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